Hi, On 7/25/07, Carl Cerecke <[EMAIL PROTECTED]> wrote: > Hi, > > Using nutch 0.9, although I get the same with a more recent nightly build. > > I'm getting NPE fetching these two pages: > > http://www.absoluteit.co.nz > and > http://defence.allmedia.co.nz > > I've tracked it down by putting a t.printStackTrace() in the catch > (Throwable t) of the run() in Fetcher.java: > java.lang.NullPointerException > at org.apache.hadoop.io.Text.encode(Text.java:375) > at org.apache.hadoop.io.Text.encode(Text.java:356) > at org.apache.hadoop.io.Text.writeString(Text.java:396) > at > org.apache.nutch.protocol.Content.writeCompressed(Content.java:146) > at > org.apache.hadoop.io.CompressedWritable.write(CompressedWritable.java:74) > at > org.apache.nutch.fetcher.FetcherOutput.write(FetcherOutput.java:56) > at > org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:315) > at > org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:343) > at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:191) > > I'm not sure where to go from here. Any suggestions?
Can you retry with the latest trunk? Not that I think it will solve your problem but Content.java has changed recently so I am not sure what was in line 146. So, if problem reoccurs with latest trunk I can check exactly which line is failing. Alternatively, you can send that part of Content.java's code. > > Cheers, > Carl. > -- Doğacan Güney ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
