I thought that by running the fetch command (bin/nutch fetch ...) it already does some kind of parsing , otherwise how it get the next level of URLss?

and in this case in what part the parsing is done in the mapping or in the reducing of the fetch process?

Thanks again,
Rafit



From: "Håvard W. Kongsgård" <[EMAIL PROTECTED]>
Reply-To: [email protected]
To: [email protected]
Subject: Re: The parsing is part of the Map or part of the Reduce?
Date: Sat, 28 Jan 2006 23:05:05 +0100

So you have been following the quick tutorial for nutch 0.8 and later at media-style…....................
The author has left out the parse and updatedb part.
After fetch simply run bin/nutch parse segment/2006xxxx and then bin/nutch crawldb updatedb segment/2006xxx.

Rafit Izhak_Ratzin wrote:

Hi,
In what part of the mapred the parsing is done in the Map part or in the Reduce part?

Thanks,
Rafit

_________________________________________________________________
Express yourself instantly with MSN Messenger! Download today it's FREE! http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/




_________________________________________________________________
FREE pop-up blocking with the new MSN Toolbar - get it now! http://toolbar.msn.click-url.com/go/onm00200415ave/direct/01/



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to