At 11:15 AM +0100 6/5/01, Adrian Taylor wrote:
>allow: /cgi-bin/item.cgi?id=35
>allow: /cgi-bin/item.cgi?s=356
>disallow: /cgi-bin/item.cgi?(anything else)

limit_urls_to: ... [/cgi-bin/item.cgi(\?id=35|\?s=356)]

(That's off the cuff, don't know if the regex is quite right.) 
Anything that doesn't match a limit_urls_to pattern is excluded right 
off.

>I can't see how these would be possible with a list of exclusion regexes,
>which is what I find in 3.1.5 and 3.2.0b3. At least, not without negative
>lookahead assertions and all the whizzy Perl stuff that the system
>routines don't provide.

I don't think this is necessary. Granted, you can make regexp more 
easily with various Perl features, but you certainly can make 
equivalent regexp using system routines AFAIK. (I'm thinking back to 
my CS theory courses...)

I mean the "allow" and "disallow" attributes you mention sound a 
*lot* like limit_urls_to and exclude_urls to me. What differences do 
you see between these sets of attributes?

Also, many of your examples, you've given "allow" attributes multiple 
times--unless you've patched the Configuration class, attributes 
later in the file override earlier ones. This lets people use 
"include" attributes to include a global config file and then 
override in a different one.

-Geoff

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to