[ 
https://issues.apache.org/jira/browse/NUTCH-359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrzej Bialecki  closed NUTCH-359.
-----------------------------------

       Resolution: Fixed
    Fix Version/s: 1.0.0

> extraction of links will fail for whole page if one single link cannot be 
> parsed
> --------------------------------------------------------------------------------
>
>                 Key: NUTCH-359
>                 URL: https://issues.apache.org/jira/browse/NUTCH-359
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>         Environment: Ubuntu Dapper
>            Reporter: Renaud Richardet
>            Priority: Minor
>             Fix For: 1.0.0
>
>         Attachments: outlink.diff
>
>
> When Nutch parses the outlinks of a fetched page, the process will fail if a 
> single link cannot be parsed (e.g. java.net.MalformedURLException: unknown 
> protocol). The attached patch will keep indexing the remaining links on that 
> page even if one fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to