Seed URL are filtered during inject.
A URL rejected by rules does not get
into CrawlDb and is not crawled.

You have to take care that seed URLs pass the filters.

On 08/02/2013 08:49 PM, stone2dbone wrote:
> Sebastian,
> 
> Can you please clarify what you mean?  Why can I not use
> https://my.domain.name/inside/test/ as a seed URL?
> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Nutch-returns-index-as-document-tp4080323p4082258.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 

Reply via email to