UPDATE: changing the value of db.max.outlinks.per.page to -1 seemed to fix
this issue.  

Based on the description of this setting ("The maximum number of outlinks
that we'll process for a page. ") I'm not 100% sure why this increases the
number of files that are indexed.  I'm curious to know, but at least it's
working now.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-Files-Limited-to-200-tp2825662p2825780.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to