Hello Everyone,

I still working on nutch 2.1 with MySQL.

Today I spent all the day to test nutch with many ways to crawl.
The only way that I know to make nutch run is ( user@inux # bin/nutch crawl
urls -depth 3 -topN 5)

I tested many websites around the web and the results are the same. I
really don't get a way to parse more urls.

Example, I specify some web site and during the crawl nutch finds about 150
urls but url parsed are just about 10 - query on table webpage,
where field status = 2 -

Now, I dont know what should I do or what should I read, search...


Sorry,
Weder

Reply via email to