Actually what I would like to see is a Nutch 2.x tutorial at the same level of detail as the http://wiki.apache.org/nutch/NutchHadoopTutorial What is the process of contributing to that wiki page?
On Tue, Jan 21, 2014 at 9:33 PM, Julien Nioche < lists.digitalpeb...@gmail.com> wrote: > Hi > > The whole thing has been replaced with > > http://wiki.apache.org/nutch/NutchHadoopSingleNodeTutorial<http://wiki.apache.org/nutch/NutchHadoopSingleNodeTutorial>which > does exactly what you described. +1 to remove the old > nutchhadooptutorial page > > J. > > > On 21 January 2014 17:44, Tejas Patil <tejas.patil...@gmail.com> wrote: > >> Hi nutch-dev, >> >> I was looking at [0] and realized that with the massive number of Hadoop >> setup tutorials out there on internet, we need not repeat the same on nutch >> wiki page and instead assume that user has already done Hadoop setup. For >> convinience, we could direct users to the Hadoop wiki page which has Hadoop >> setup details. >> Plus, I propose following: >> >> - Section "Downloading Hadoop and Nutch" : Remove the Hadoop portions and >> let the Nutch stuff stay. >> - Section "Setting Up The Deployment Architecture" must be removed. >> - Section "Deploy Nutch to Single Machine" and "Deploy Nutch to Multiple >> Machines" can be merged together. >> - Section "Performing a Nutch Crawl", "Testing the Crawl" and "Performing >> a Search" must be merged, its contents must be updated. >> - Section "Rsyncing Code to Slaves" and "Updates" can be completely >> removed. >> >> Any comments ? >> >> [0] : http://wiki.apache.org/nutch/NutchHadoopTutorial >> >> Thanks, >> Tejas >> > > > > -- > > Open Source Solutions for Text Engineering > > http://digitalpebble.blogspot.com/ > http://www.digitalpebble.com > http://twitter.com/digitalpebble >