[Nutch-general] slow distributed crawling

Des Sant Fri, 22 Jun 2007 08:31:06 -0700

hello

I tried to start crawl on a single machine, but with ditributed
configuration (single machine as master and slave at the same time).
Server communicates with itself throgh ssh.
It works and it crawls, but with very bad performance, much slower than
with local crawl on the same machine. Is it due to the hadoop overhead
or did I something wrong?



thanks for help





-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

[Nutch-general] slow distributed crawling

Reply via email to