Doesn't it always work out that you find the answer right after asking publicly?

db.max.outlinks.per.page

seemed to do the trick.

Steven

Steven Yelton wrote:

I have a page that has an enormous amount of links on it:

http://www.devdaily.com/unix/man/longlist.shtml

I would like nutch to fetch and index all the pages, but it stops after 80 or so. I have made sure that the http.content.limit setting exceeds the size of the page. I surveyed all the other settings, and don't see one that seems applicable.

This appears to be the last hurdle for me to replace a proprietary search with nutch.

Thanks in advance,
Steven

Reply via email to