Marc Boucher wrote: > At 07:28 18/08/2002 +1000, Felix Karpfen wrote: > > The "fetch" mode is completely different. In this "mode" WWWOFFLE also > plays the role of the browser, in that it can analyze the content of > html pages and request images used in them.*** So WWWOFFLE can filter > what it need or shouldn't fetch. This can't be done in "proxy" > mode.***
I believe that the "DontGet", the "Outgoing" and the "Monitor" lists relate to that mode. And that was where my problem was. > > To make the problem behind the initial query more concrete - one of my > > monitored web sites is "dailynews.yahoo.com". The actual pages fetched > > include the current unrequested content of: > > > > us.a1.yimg.com > >**** us.i1.yimg.com ***** > > I've visited this site (dailynews.yahoo.com) and haven't seen any > advertising. After examining my "dontget" section, here is the line you > should add to yours. > > > ### yahoo us, fr & uk > *://us.a1.*/*/a/* > *://eur.a1.*/*/a/* > *://eur.yimg.com/a/uk/* > *://us.yimg.com/a/* > > > It works perfectly for me (it doesn't filter non-ads). > As happens with depressing frequency, after posting my query I took another look at the web sites that were actually fetched. And, as an experiment, I added: *://.*.a1.yimg.com to my "DontGet" list. Based on _one_ test, that addition + setting my (Opera) browser to fetch only cached images has solved the problem. Yahoo non-ad images appear to come from "*.i1.yimg.com" ; these are still fetched automatically by WWWOFFLE. And, when looking at the contents of the web pages in WWWOFFLE cache while online, the browser refrains from automatically loading links to other web addresses when it displays the cached Yahoo pages. Again, thank you for taking the time to share your experience. Felix -- Felix Karpfen [EMAIL PROTECTED] Public Key 72FDF9DF (DH/DSA)
