Ah, I know what the problem is, the wwwoffle.conf file needs beefed up examples:

#### Example ####
# Don't get from any hosts in the barfoo.com domain.
# *://*.barfoo.com/                       #1
# Don't get any gzipped or tar files.
# *://*/*.gz
# *://*/*.tar
# Don't get any files from /adverts in the foo.com domain.
# *://*.foo.com/adverts*                  #2

#1 and #2 need to be moved side by side, and explicit notice of the
 sight differences you mention below should be given, as you gotta
 admit, the way it is worded above makes one believe one will not be
 bothered with any BarFoo corporation junk ever again, when indeed it
 is merely their home page that is being blocked.

>>>>> "ab" == Andrew M Bishop <[EMAIL PROTECTED]> writes:

ab> Hi,
>> OK, next release I hope this works as I expect
>> DontGet
>> {

ab> ...

>> *://*.amazon.com/

>> }

ab> If you want to get any pages on the www.amazon.com server EXCEPT the
ab> front page http://www.amazon.com then this will work as you expect.

ab> Previously I answered this question with:

ab> : With version 2.7-beta (and later) versions of WWWOFFLE this is the
ab> : case.  There was an error in the implementation of the
ab> : URL-SPECIFICATIONs in earlier versions that meant that
ab> : *://*.amazon.com/ got parsed as all files on *.amazon.com when I
ab> : intended only *://*.amazon.com/* or *://*.amazon.com to mean that.
ab> : The problem was that it was impossible to specify the specific URL of
ab> : http://www.amazon.com/ with the existing URL-SPECIFICATION format.

ab> *://*.amazon.com/ means http://www.amazon.com/ matches but
ab> http://www.amazon.com/foo does not.

ab> *://*.amazon.com/* means http://www.amazon.com/ matches and
ab> http://www.amazon.com/foo does too.

ab> -- 
ab> Andrew.
ab> ----------------------------------------------------------------------
ab> Andrew M. Bishop                             [EMAIL PROTECTED]
ab>                                       http://www.gedanken.demon.co.uk/


-- 
http://www.geocities.com/jidanni/ Taiwan(04)25854780 �n����

Reply via email to