Go Forth and Be a Fisher of Data

Written by Eric Olsen
Published September 27, 2003

This new meta-data system for fishing tasty information from the vast web sea is pretty amazing:

    In short, there might be information hidden on the web that cannot be gleaned from any individual page, but becomes apparent when many pages are examined together. And that information could be of great commercial value.

    The result is a new online service called WebFountain. A big computer at IBM hoovers up web pages and information from other sources such as newsgroups, syndicated content and newswires. Each incoming page is analysed to determine what language it is in. The context--a news report, a page on a company's website, a web-log entry--is determined. Verbs, nouns, adjectives, proper nouns, place names and even entire phrases are extracted, and are analysed for positive or negative connotations. The page is also classified by category - is it about baseball, Iranian politics or global warming?

    All the results from these various tests are then fed upwards into another layer of software that gathers statistics across multiple pages, counting references to particular words or phrases in particular contexts, and looking for trends. All of this is then wrapped up in another layer of software that allows users to query the system remotely across the internet as a "web service".

    Dr Tomkins hopes to create an ecosystem of service providers who will use the WebFountain service to analyse the web in different ways to serve different markets. A clipping service, for example, which monitors the press for mentions of a particular company or product, could easily be constructed using WebFountain. A corporate public-relations firm could use WebFountain to monitor public attitudes towards its clients or track which other firms they are mentioned alongside. How have new products been received by different age groups? Are customers grumbling about a product in one part of the world, but not in another? WebFountain can send an alert if anything unexpected happens, such as a sudden surge in mentions of a particular keyword. [The Economist]

I believe the first result they will glean from this is that Blogcritics rules.

Career media professional Eric Olsen is honored to be the founder and publisher of Blogcritics.org, which, quite frankly, rules - as do his wife and four children.
Keep reading for information and comments on this article, and add some feedback of your own!
Go Forth and Be a Fisher of Data
Published: September 27, 2003
Type:
Section: Sci/Tech
Filed Under: Sci/Tech: Internet, Sci/Tech: Software
Writer: Eric Olsen
Eric Olsen's BC Writer page
Eric Olsen's personal site
Spread the Word
Like this article?
Email this
Submit to del.icio.us Save to del.icio.us
RSS Feeds
All RSS Feeds (240+)
Comments on this article
BC articles by Eric Olsen
Sci/Tech: Internet
Sci/Tech: Software
All Sci/Tech Articles
Eric Olsen's personal weblog
All BC articles
All BC Comments

Comments

#1 — September 29, 2003 @ 18:06PM — Phillip Winn [URL]

Whatever it is they devise, it sounds like Google is probably about a half-step behind them. I have been consistently surprised at how well news.google.com works. Despite occasional miscategorizations, it's right more often than I expect.

Want comments emailed to you? No spam, promise! Address:

Add your comment, speak your mind

(Or ping: http://blogcritics.org/mt/tb/8724)

Personal attacks are not allowed. Please read our comment policy.





Remember Name/URL?

Please preview your comment!

Fresh
Articles
Fresh
Comments