Doğacan Güney wrote:
> I think you may also run a segment merge. If you run segmerge on a
> single segment(where you set number of reduce tasks to the desired
> number of fetchers) segmerge will put equal number of urls to every
> part. Then set fetcher.max.threads.per.host to a value greater than 1
> and  you have a very unpolite fetcher. Please don't run this to fetch
> a site you don't control :)

.. because it destroys the built-in controls that Nutch uses to avoid 
making multiple concurrent requests to the same site, or to make them 
too quickly.


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to