Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Googlewhack
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Research applications== The probabilities of internet search result values for multi-word queries was studied in 2008 with the help of Googlewhacks.<ref>{{cite journal |title=Internet Search Result Probabilities, Heaps' Law and Word Associativity |author=Lansey JC, Bukiet B |journal=Journal of Quantitative Linguistics |date=January 2009 |volume=16 |number=1 |pages=40β66 |url=http://www.jonathan.lansey.net/publications/googlewhack.html |doi=10.1080/09296170802514153|s2cid=1808897 |url-access=subscription }}</ref><ref>{{YouTube|R0Z-PybQ8Gw|Googlewhacks for Fun and Profit}} Google Tech Talk 2008</ref><ref>{{cite web|url=http://www.jonathan.lansey.net/publications/Googlewhack_Poster.pdf|title=Poster Presentation|access-date=2014-03-28|archive-date=23 July 2011|archive-url=https://web.archive.org/web/20110723175938/http://www.jonathan.lansey.net/publications/Googlewhack_Poster.pdf|url-status=live}}</ref> Based on data from 351 Googlewhacks from the "WhackStack" a list of previously documented Googlewhacks,<ref>{{Cite web|date=Feb 13, 2010|title=The Whack Stack|url=http://www.googlewhack.com/tally.pl|url-status=dead|archive-url=https://web.archive.org/web/20130121224941/http://www.googlewhack.com/tally.pl|archive-date=Jan 21, 2013|access-date=Dec 17, 2021|website=Googlewhack}}</ref> the [[Heaps' law]] <math>\beta</math> coefficient for the indexed [[World Wide Web]] (about 8 billion pages in 2008) was measured to be <math>\beta=0.52</math>. This result is in line with previous studies which used under 20,000 pages.<ref>Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Modern Information Retrieval, ACM Press, 1999.</ref> The googlewhacks were a key in calibrating the model so that it could be extended automatically to analyse the relatedness of word pairs.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)