Great stuff, Paul! A few minor corrections.
Apache Wiki wrote:
1. The env var NUTCH_MASTER is set to the hostname of the master machine.
This is optional. The alternative is to mount a common home directory with NFS, as many clusters do, and keep the Nutch software there.
Also, NUTCH_MASTER is an rsync path, so it should be set to something of the form host:/path/to/nutch, e.g., "foo.bar.com:/home/$USER/src/nutch".
2. The slave nodes are defined by putting list of hostnames, one per line, in ~/.slaves (alternatively, use NUTCH_SLAVES to refer to a different file).
This location can be altered with the environment variable NUTCH_SLAVES. Thanks for writing this. Doug ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
