I've made a Perl CGI script (compatible with wwwoffle CGI interface) to replace lasttime indexes. It has features:
- next of every URL and host is [HideThis] link. Until you click it, the URL stays forever on index. - uses Berkeley DB file -> faster than wwwoffle's lasttime. Update script shoul be run by wwwoffle -offline. - looks into every URL headers + into outgoing index and uses simple scoring mechanism to autohide obvious junk - tries to compute MD5 of content and shows URL only if it is new or if MD5 changed. This doesn't work very well, as many sites contain some date information. If anyone would be interested, I could provide it. Beware, it's just chunk of Perl code with no documentation (albeit I can answer any questions). Probably I won't now have the need to polish it, becuse I got 24x7 Internet connection..... Juraj On Sunday 05 September 2004 21:11, Dan Jacobson wrote: > Andrew> recompile WWWOFFLE > > With my http://jidanni.org/comp/wwwoffle/wwwoffle-chunks package, even > 76 year-olds are enjoying a lifetime of indexes that don't go away > until YOU the user say they do. > > Some crud from 2002 that you haven't finished reading? No > problemo. There it stays indexed, solid as a rock. > (What? You purged the files it points to? Well with the big discs of > today who asked you to do that?)
