There's a straight-forward tutorial on writing a plugin to add custom
fields, albeit based on v0.9, at
http://sujitpal.blogspot.com/2009/07/nutch-custom-plugin-to-parse-and-add.html

More info on plugins at:
http://wiki.apache.org/nutch/AboutPlugins
http://wiki.apache.org/nutch/WritingPluginExample-1.2

Thank you,

Mo

On Wed, Mar 18, 2015 at 10:25 AM, julien <[email protected]> wrote:

> Hello,
>
> After a crawl nutch : I would like recover all status urls.
> Do you know how I can retrieve the status of urls? Code 200, 404, 503 ...?
>
> Thank
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/How-to-get-the-status-page-after-crawl-tp4193761.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>

Reply via email to