Re: [Nutch-general] Separating nutch and hadoop configurations.

Andrzej Bialecki Wed, 11 Jul 2007 10:57:51 -0700

Briggs wrote:
> I am currently trying to figure out how to deploy Nutch and Hadoop
> separately.  I want to configure Hadoop outside of Nutch and have
> Nutch use that service, rather than configuring hadoop within nutch.
> I would think all that Nutch should need to know is the urls to
> connect to Hadoop, but can't figure out how to get this to work.
> 
> Is this possible?  If so, is there some sort of document, or archive
> of another list post for this?
> 
> Sorry for the ignorance.


If you have a clean hadoop installation up and running (made e.g. from 
one of the official Hadoop builds), it should be enough to put the 
nutch*.job file in ${hadoop.dir}, and copy bin/nutch (possibly with some 
minor modifications - my memory is a little vague on this ...).


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Re: [Nutch-general] Separating nutch and hadoop configurations.

Reply via email to