Doğacan Güney wrote: > I think you may also run a segment merge. If you run segmerge on a > single segment(where you set number of reduce tasks to the desired > number of fetchers) segmerge will put equal number of urls to every > part. Then set fetcher.max.threads.per.host to a value greater than 1 > and you have a very unpolite fetcher. Please don't run this to fetch > a site you don't control :)
.. because it destroys the built-in controls that Nutch uses to avoid making multiple concurrent requests to the same site, or to make them too quickly. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
