首页 > 语义网, 旧文 > Cuil与语义搜索[2008]

Cuil与语义搜索[2008]

[ 原文写于2008-08-13,  http://tw.rpi.edu/weblog/2008/08/13/cuil-semantic-search/]

Cuil, Semantic Search

Last week, Cuil.com caught my eye. It gave me a very good impression in just 5 seconds (BTW, 10 seconds is a survival maximal for any website for me). First, I tried, as many people may do, my name. It didn’t disappoint me by hitting quite precisely my pages. I also love the grid-based layout. A few minutes later, I found its “Explore by Category” option. It looks like that Cuil has some sort of ontology hierarchies for web pages.

A few “google” results reveal that Cuil may use some clustering technique to build such hierarchies. It is interesting to think that will such hierarchies indeed improve search experience? When I search “Semantic Web”, cuil recommends me to browse “Ontology (computer Science)” and some of its sub category; it also suggests me to look at “James Hendler”‘s homepage. I would say that it will be very useful for exploring.

Building meta data using machine learning technology is a cool thing. On the other hand, I believe that human intervention is also critical. When wikipedia knowledge is used in clustering, I expect some gain in recall or preciseness. As “Ontology (computer Science)” is a wikipedia page, I guess that cuil may have already used wikipedia information in their results.

Also don’t forget the “network effect”. I have created a prefix-based, syntactical gmail label hierarchy for a while. I really like to share part of the hierarchy to my friends, so that when I send a mail labeled with “party”, then they don’t need to relabel it again. If millions of users can share their small hierarchies (not only on gmail, but also on flicker, youtube, twine, etc.), each is connected somehow to hierarchies of friends and families, eventually we will have a very large network of ontologies which may improve search much more than we can do now. Just a random thougt.

P.S. I found one interesting thing. Cuil caches my wiki page at Iowa State University. However, that page should be offline no later than May 2008, while Cuil was online officially only on July 28, 2008. It seems its crawler has been alive for a while.

Jie Bao

P.S.  2012-01-03 cuil.com is offline now, see its’ wikipedia page: http://en.wikipedia.org/wiki/Cuil

Advertisements
分类:语义网, 旧文
  1. 还没有评论。
  1. 2012/04/16 @ 01:29

发表评论

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / 更改 )

Twitter picture

You are commenting using your Twitter account. Log Out / 更改 )

Facebook photo

You are commenting using your Facebook account. Log Out / 更改 )

Google+ photo

You are commenting using your Google+ account. Log Out / 更改 )

Connecting to %s

%d 博主赞过: