Is anybody else getting NullPointerExceptions fetching either of these two sites (0.90 and latest from trunk) ?
http://www.absoluteit.co.nz http://defence.allmedia.co.nz I am, but would be grateful if someone else could test whether they work or not so I can eliminate nutch configuration issues. Cheers, Carl. Carl Cerecke wrote: > Hi, > > Using nutch 0.9, although I get the same with a more recent nightly build. > > I'm getting NPE fetching these two pages: > > http://www.absoluteit.co.nz > and > http://defence.allmedia.co.nz > > I've tracked it down by putting a t.printStackTrace() in the catch > (Throwable t) of the run() in Fetcher.java: > java.lang.NullPointerException > at org.apache.hadoop.io.Text.encode(Text.java:375) > at org.apache.hadoop.io.Text.encode(Text.java:356) > at org.apache.hadoop.io.Text.writeString(Text.java:396) > at > org.apache.nutch.protocol.Content.writeCompressed(Content.java:146) > at > org.apache.hadoop.io.CompressedWritable.write(CompressedWritable.java:74) > at > org.apache.nutch.fetcher.FetcherOutput.write(FetcherOutput.java:56) > at > org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:315) > at > org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:343) > at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:191) > > I'm not sure where to go from here. Any suggestions? > > Cheers, > Carl. > > _____________________________________________________________________ > > This has been cleaned & processed by www.rocketspam.co.nz > _____________________________________________________________________ > ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
