Seed URL are filtered during inject. A URL rejected by rules does not get into CrawlDb and is not crawled.
You have to take care that seed URLs pass the filters. On 08/02/2013 08:49 PM, stone2dbone wrote: > Sebastian, > > Can you please clarify what you mean? Why can I not use > https://my.domain.name/inside/test/ as a seed URL? > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Nutch-returns-index-as-document-tp4080323p4082258.html > Sent from the Nutch - User mailing list archive at Nabble.com. >

