Hello everyone,

first of all, I am new to nutch. I installed nutch on my internet server and tried to start crowling the internet. I unterstood that there are two opportunities to generate a fetchlist. First, using the parameter -topN to generate a limited list of the top rated domains. Second, without the topN parameter it generated a fetchlist of all unfetched urls. That's what i want to do now, but i don't want to fetch ALL uncrawled domains at a time. So is there an opportunity to crawl unfetched urls, but limit that to 1000 urls, ar what else?


Thank you and best regars,
Markus Thomas

Reply via email to