Hi,

On 7/25/07, Carl Cerecke <[EMAIL PROTECTED]> wrote:
> Hi,
>
> Using nutch 0.9, although I get the same with a more recent nightly build.
>
> I'm getting NPE fetching these two pages:
>
> http://www.absoluteit.co.nz
> and
> http://defence.allmedia.co.nz
>
> I've tracked it down by putting a t.printStackTrace() in the catch
> (Throwable t) of the run() in Fetcher.java:
> java.lang.NullPointerException
>          at org.apache.hadoop.io.Text.encode(Text.java:375)
>          at org.apache.hadoop.io.Text.encode(Text.java:356)
>          at org.apache.hadoop.io.Text.writeString(Text.java:396)
>          at
> org.apache.nutch.protocol.Content.writeCompressed(Content.java:146)
>          at
> org.apache.hadoop.io.CompressedWritable.write(CompressedWritable.java:74)
>          at
> org.apache.nutch.fetcher.FetcherOutput.write(FetcherOutput.java:56)
>          at
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:315)
>          at
> org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:343)
>          at
> org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:191)
>
> I'm not sure where to go from here. Any suggestions?

Can you retry with the latest trunk?  Not that I think it will solve
your problem but Content.java has changed recently so I am not sure
what was in line 146. So, if problem reoccurs with latest trunk I can
check exactly which line is failing. Alternatively, you can send that
part of Content.java's code.

>
> Cheers,
> Carl.
>


-- 
Doğacan Güney
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >>  http://get.splunk.com/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to