Dan Jacobson wrote: > To see what files have been fetched repeatedly recently, try > cd /var/cache/wwwoffle && find *time* -name U\*|xargs sed :|sort|uniq -c|sort -nr
OK - did that; a relevant extract is attached. Some of the attached entries are monitored URLs, where the content of the page is known to change each day. Others (mainly .gif entries) are unsolicited additions to entries that had been flagged for monitoring and that <may|may not> be changed; the latter are only a small fraction of the URLs that WWWOFFLE lists daily during the download with the comment "Unchanged; not fetched" (or words to that effect). So is there a remedy or is it just an opportunity for teeth-nashing? Felix Karpfen -- Felix Karpfen [EMAIL PROTECTED] Public Key 72FDF9DF (DH/DSA)
8 http://abc.net.au/common/logos/whtblkwht.gif 8 http://abc.net.au/news/default.htm 8 http://canberra.yourguide.com.au/ 8 http://canberra.yourguide.com.au/home.asp 8 http://canberra.yourguide.com.au/setup.asp?mast_id=133 8 http://derstandard.at/Text/?ressort=politik 8 http://diepress.oewabox.at/blank.gif 8 http://diepress.oewabox.at/cgi-bin/ivw/CP/RedCont/Nachrichten/zaehlgif.ivw 8 http://finance.yahoo.com/mo?u 8 http://ichart.yahoo.com/t?s=%5EDJI 8 http://ichart.yahoo.com/t?s=%5EIXIC 8 http://news.google.com.au/ 8 http://www.abc.net.au/news/default.htm 8 http://www.bom.gov.au/cgi-bin/wrap_fwo.pl?IDN10049.txt 8 http://www.diepresse.at/ 8 http://www.diepresse.at/img/white.gif 8 http://www.ebroadcast.com.au/cgi-bin/csDynamic/csDynamic.cgi?command=view&cid=1&j=1 8 http://www.ebroadcast.com.au/tv/pic/2003_nav_1stlinespace.gif 8 http://www.ebroadcast.com.au/tv/static/CanberraNight.html 8 http://www.ecars.com.au/pic/2003_nav_1stlinespace.gif
