Joel Newkirk wrote: > On Tue, 7 Oct 2008 10:52:13 +1100, Carsten Haitzler (The Rasterman) > <[EMAIL PROTECTED]> wrote: >> On Thu, 02 Oct 2008 20:52:00 +0200 "Marco Trevisan (Treviño)" >> <[EMAIL PROTECTED]> >> babbled: >> > >>> So this is a little utility I wrote [1] to check the frequency of each >>> word and writing back a new dictionary with frequency data. >>> >>> To run it you need php-cli (I guess v5 or above), set the given options, >>> do "php words-popularity.php" and wait the work to be finished! :P >>> >>> It could be a long work, but it should give good results. >> yes. it would. who wants to run it? :) > > Ummm. Nice try, but see the results below. I ran this remotely on my > workstation at work, since it's right on a 20mb fiber... Hopefully I'll be > allowed to use Google again by the time I get in to work tomorrow... ;) > > j > > > [1328/98568] > [1329/98568] > [1330/98568] > Unknown Popularity of Aeneid's [1331/98568] > Unknown Popularity of Aeolus [1332/98568] > Unknown Popularity of Aeolus's [1333/98568] > Unknown Popularity of aeon [1334/98568] > Unknown Popularity of aeon's [1335/98568] > Unknown Popularity of aeons [1336/98568] > Unknown Popularity of aerate [1337/98568] > Unknown Popularity of aerated [1338/98568] > Unknown Popularity of aerates [1339/98568] > Unknown Popularity of aerating [1340/98568] > Unknown Popularity of aeration [1341/98568] > Unknown Popularity of aeration's [1342/98568] > Unknown Popularity of aerator [1343/98568] > Unknown Popularity of aerator's [1344/98568] > Unknown Popularity of aerators [1345/98568] > > After which browsing to www.google.com results in: > Google > Error
As said above, after that I was able to collect informations for about 420000 words, Google blocked it since bigG doesn't allow to use these batch searches (I didn't know :P). The only way to do it like I did is asking the permission to Google, I guess. -- Treviño's World - Life and Linux http://www.3v1n0.net/ _______________________________________________ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community