At 4:52 PM -0700 8/17/00, [EMAIL PROTECTED] wrote: > Rejected: Item in the exclude list: item # 3 length: 1 > > url rejected: (level 1)http://www.DOMAIN.com/index.html > >My problem is likely in this "exclude list" but I don't know where that's >coming from. There's nothing in the htdig.conf file that would indicate >such a list, and I don't think I'm intentionally doing anything. There are several reasons that it's rejected. There are the limit_urls_to and exclude_urls attributes in your htdig.conf as well as the robots.txt file you mentioned in the subject of your message. The latter is included if you've turned on this much debugging information--the patterns in the robots.txt file are spit out when htdig first starts indexing the server. It's hard to say more since you haven't given a concrete example or the relevent sections of your htdig.conf or robots.txt files. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ ------------------------------------ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>
