[ 
https://issues.apache.org/jira/browse/NUTCH-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tejas Patil resolved NUTCH-1039.
--------------------------------

    Resolution: Cannot Reproduce

I feel that thin item wont have any progress unless we get some real url 
wherein this gets reproduced (that will indicate if it really got fixed or not 
due to some checkin). Marking it as "cannot reproduce" for now. If anyone faces 
it, please re-open it so that we can work on it.
                
> Fetcher fails for pages without content-length header
> -----------------------------------------------------
>
>                 Key: NUTCH-1039
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1039
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.7
>
>
> Fetcher fails:
> 2011-07-11 14:45:34,764 ERROR http.Http - 
> org.apache.nutch.protocol.http.api.HttpException: bad content length:
> 2011-07-11 14:45:34,765 ERROR http.Http - at 
> org.apache.nutch.protocol.http.HttpResponse.readPlainContent(HttpResponse.java:218)
> 2011-07-11 14:45:34,765 ERROR http.Http - at 
> org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:158)
> 2011-07-11 14:45:34,765 ERROR http.Http - at 
> org.apache.nutch.protocol.http.Http.getResponse(Http.java:64)
> 2011-07-11 14:45:34,765 ERROR http.Http - at 
> org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:138)
> 2011-07-11 14:45:34,765 ERROR http.Http - at 
> org.apache.nutch.parse.ParserChecker.main(ParserChecker.java:79)
> Both fetcher and indexing filter checker fail sometimes. I'm unsure whether 
> this is something in Nutch or whether the remote server only returns 
> content-length incidentally.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to