[ https://issues.apache.org/jira/browse/NUTCH-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13801603#comment-13801603 ]
Alfonso Nishikawa commented on NUTCH-1477: ------------------------------------------ [~alexmc] : Nutch Persistent classes have been changed by hand. You can find the missing methods at http://svn.apache.org/repos/asf/nutch/tags/release-2.2.1/src/java/org/apache/nutch/storage You must add methods Host.getInt(,), Host.getLong(,) and ProtocolStatus.isSuccess() to the classes resulting from compiling. > NPE when injecting with DataFileAvroStore > ----------------------------------------- > > Key: NUTCH-1477 > URL: https://issues.apache.org/jira/browse/NUTCH-1477 > Project: Nutch > Issue Type: Bug > Components: storage > Affects Versions: 2.1 > Environment: Java 1.6.0_35 > Reporter: Mike Baranczak > Assignee: Julien Nioche > Priority: Critical > Fix For: 2.4 > > Attachments: gora-core-0.2.1.jar, NUTCH-1477.patch, webpage.avsc, > webpage.avsc, webpage.avsc, webpage.avsc > > > Fresh installation of Nutch 2.1, configured to use DataFileAvroStore. > Injection job throws NullPointerException, see below. No error when I switch > to MemStore. > java.lang.NullPointerException > at org.apache.avro.io.BinaryEncoder.writeString(BinaryEncoder.java:133) > at > org.apache.avro.generic.GenericDatumWriter.writeString(GenericDatumWriter.java:176) > at > org.apache.avro.generic.GenericDatumWriter.writeString(GenericDatumWriter.java:171) > at > org.apache.avro.generic.GenericDatumWriter.write(GenericDatumWriter.java:72) > at > org.apache.avro.generic.GenericDatumWriter.writeRecord(GenericDatumWriter.java:89) > at > org.apache.avro.generic.GenericDatumWriter.write(GenericDatumWriter.java:62) > at > org.apache.avro.generic.GenericDatumWriter.write(GenericDatumWriter.java:55) > at org.apache.avro.file.DataFileWriter.append(DataFileWriter.java:245) > at > org.apache.gora.avro.store.DataFileAvroStore.put(DataFileAvroStore.java:54) > at > org.apache.gora.mapreduce.GoraRecordWriter.write(GoraRecordWriter.java:60) > at > org.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.write(MapTask.java:639) > at > org.apache.hadoop.mapreduce.TaskInputOutputContext.write(TaskInputOutputContext.java:80) > at > org.apache.nutch.crawl.InjectorJob$UrlMapper.map(InjectorJob.java:185) > at org.apache.nutch.crawl.InjectorJob$UrlMapper.map(InjectorJob.java:85) > at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) > at > org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) -- This message was sent by Atlassian JIRA (v6.1#6144)