At 12:29 PM 10/3/2001 -0700, Karsten M. Self wrote: >on Wed, Oct 03, 2001 at 11:00:04AM -0400, Declan McCullagh ([EMAIL PROTECTED]) >wrote: > > > > On Wed, Oct 03, 2001 at 06:38:05AM -0700, Khoder bin Hakkin wrote: > > > Must've never heard of caching.. > > > > > > http://www.latimes.com/news/nationworld/nation/la-100301safe.story > > > Inevitable next step: Enterprising cypherpunk registers > > censoredfedinfo.org, hunts through google's cache, posts everything > > there, etc. > >Note that there are a relatively small number of Googles on the Net.
The trouble with Google and most other spiders is that they cannot access the DBs behind the sites. Various industry estimates place the amount of data not accessible to crawlers at up to 500x the html content. What's needed are open access data mining sites using more sophisticated crawlers like http://telegraph.cs.berkeley.edu/ steve