I've made a Perl CGI script (compatible with wwwoffle CGI interface) to 
replace lasttime indexes. It has features:

- next of every URL and host is [HideThis] link. Until you click it, the URL 
stays forever on index.
- uses Berkeley DB file -> faster than wwwoffle's lasttime. Update script 
shoul be run by wwwoffle -offline.
- looks into every URL headers + into outgoing index and uses simple scoring 
mechanism to autohide obvious junk
- tries to compute MD5 of content and shows URL only if it is new or if MD5 
changed. This doesn't work very well, as many sites contain some date 
information.

If anyone would be interested, I could provide it. Beware, it's just chunk of 
Perl code with no documentation (albeit I can answer any questions). Probably 
I won't now have the need to polish it, becuse I got 24x7 Internet 
connection.....

Juraj

On Sunday 05 September 2004 21:11, Dan Jacobson wrote:
> Andrew> recompile WWWOFFLE
>
> With my http://jidanni.org/comp/wwwoffle/wwwoffle-chunks package, even
> 76 year-olds are enjoying a lifetime of indexes that don't go away
> until YOU the user say they do.
>
> Some crud from 2002 that you haven't finished reading? No
> problemo. There it stays indexed, solid as a rock.
> (What? You purged the files it points to? Well with the big discs of
> today who asked you to do that?)


Reply via email to