hello I tried to start crawl on a single machine, but with ditributed configuration (single machine as master and slave at the same time). Server communicates with itself throgh ssh. It works and it crawls, but with very bad performance, much slower than with local crawl on the same machine. Is it due to the hadoop overhead or did I something wrong?
thanks for help ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
