Hi Ali

You need to modify $NUTCH_HOME/conf/nutch-site.xml and rebuild the job file
with 'ant job'. In distributed mode the conf files are taken from within
the job file

HTH

Julien


The configuration files for the "local" mode are setup fine (since a crawl
> in local mode succeeded). However, for running in deploy mode (as output
> above), since the "deploy" folder did not have any "conf" subdirectory, I
> assumed that either:
> a) the conf files need to be copied over under "deploy/conf", OR
> b) the conf files need to be placed onto HDFS.
>
> I have verified that option (a) above does not fix the issue. So, I'm
> assuming that the Nutch configuration files need to exist in HDFS, for the
> HDFS fetcher to run successfully? However, I don't know at what path within
> HDFS I should place these Nutch conf files, or perhaps I'm barking up the
> wrong tree?
>
> If Nutch reads config files during "deploy" mode from the files under
> "local/conf", then why is it that the local crawl worked fine, but the
> deploy-mode crawl isn't?
>



-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com

Reply via email to