> No, this is incorrect, the 404 error page is cached by WWWOFFLE. It would be better to 'verify' all the URLs first anyway, before feeding the list of (- then, only existing - ) pages to wwwoffle. I don't know yet how to do that, maybe with a wget script ?
Basically, i would proceed all cache entries of content (x)html, no images and other media stuff, as i'm mostly interested in texts. I can imagine a second walk where i send wwwoffle out to fetch all missing content of the then newly cached pages... will it work just to order all of them with recursion depth 0?
