Thanks to the proliferation of web search engines and their increased efficiency, it has been possible to develop other types of similarity measures based in this type of application.
The main advantage of using search engines is that almost any possible word or meaning can be indexed, so it is not necessary to rely on limited data sources or vocabularies, where the descriptions might be limited or even non-existent.
One of the first works based on Web search engines is the one developed
by Strube. It performs a basic measure such as taking the results obtained when performing a search (hits, page counts) from a search engine and applying the so-called Jaccard coefficient.
続きをみる