agreed. i'm interested in running only dfs at this time and have just about got a runtime boiled down independent from the default scripts which assume ssh/rsync/etc. my goal is to have a deployable zip upon which one can add prebaked configurations specific to the target environment. 1 sh, 1 lib dir and optional configs.
so, i'd like to see the sort of subsystem breakout you are referring to and i can possibly help w/ dfs aspsects. hth, - james On 10/26/06, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
I know Hadoop is separate from Nutch, but I found the Hadoop Tutorial (http://wiki.apache.org/nutch/NutchHadoopTutorial) on the Nutch Wiki to be quite informative in filling in some gray areas for me on how to get Hadoop working, so I was wondering if it is all right to link to this one, or should some effort be made (by me) to extract the relevant Hadoop pieces from this link and put them in a new page on the Hadoop wiki? I know some users may be confused by the talk of Nutch in it. Thanks, Grant
