Christopher Murtagh's bits of Thu, 20 Jun 2002 translated to:

> Currently, I have htDig configured to ignore any URLs with '?' in them
>because it indexes thousands of pages within our University that shouldn't
>be and has potential infinite loop problems. However, there is one URL
>that I would like htDig that has a list of these URLs that I would also
>like it to include. So, my question is:
>
> Is it possible to tell htDig to exclude pattern '?', but index URLs that
>match 'www.foobar.com/?foo=' ?

You could dig the two cases separately and then merge the
resulting databases (see the -m option of htmerge). You might
also take a look at htdig's -m option. If you only have one (or a
few) exceptions, you might be able to get away with just running
htdig again with -m before running htmerge.

Jim



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
We have stuff for geeks like you.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to