Hello everyone,
first of all, I am new to nutch. I installed nutch on my internet server
and tried to start crowling the internet.
I unterstood that there are two opportunities to generate a fetchlist.
First, using the parameter -topN to generate a limited list of the top
rated domains. Second, without the topN parameter it generated a
fetchlist of all unfetched urls. That's what i want to do now, but i
don't want to fetch ALL uncrawled domains at a time.
So is there an opportunity to crawl unfetched urls, but limit that to
1000 urls, ar what else?
Thank you and best regars,
Markus Thomas