I've been running multiple standalone machines with some different
config files and very different crawldb for a while now. However I'd
like to start running them all distributed over using the same cluster.
Do configuration files, specifically nutch-site.xml *-urlfilter.txt get
read at the beginning of a job on all machines? For things like generate
which immediately run partion afterwards, does the partition job pick up
the same config as the generate job, or are they read again from the
filesystem?


patrik
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to