Hi! I use apache-nutch-2.2.1 and for my database mysql. I run the crawl script and each time I check my database, I see that the number of links with status=1 are many more than the number of links with status=2(=link successfully fetched) (~ 80%-10%). Does anyone know why I have so many links with status=1(= link unfetched); Can I improve that percentage so i can extract more information;

Reply via email to