Ticket created as https://issues.apache.org/jira/browse/MAHOUT-265. I noticed that I don't get this error with revision 897299, however it shows that it wrote 0 vectors when converting vectors from both the Solr index and the arff converter.
- jerry On 1/20/10 9:22 AM, "Grant Ingersoll" <[email protected]> wrote: Looks like a bug, can you open a JIRA ticket? Note, the ARFF support may not be complete as of yet, although I wonder if this is an issue with the upgrade to a newer version of Hadoop. I'll try to take a look in the next few days. -Grant On Jan 19, 2010, at 4:16 PM, Jerry Ye wrote: > Hi, > I've been trying to convert a simple arff file and I'm getting the following > error: > > -bash-3.1$ java -cp > mahout-core-0.3-SNAPSHOT.jar:mahout-utils-0.3-SNAPSHOT.jar:$(echo > dependency/*.jar . | sed 's/ /:/g') > org.apache.mahout.utils.vectors.arff.Driver -d vehicle.arff -o iris -t > iris/dict.txt > Jan 19, 2010 8:58:36 PM org.slf4j.impl.JCLLoggerAdapter info > INFO: Output Dir: iris > Jan 19, 2010 8:58:36 PM org.slf4j.impl.JCLLoggerAdapter info > INFO: Converting File: vehicle.arff > outfile: iris/vehicle.arff.mvc > Exception in thread "main" java.lang.NullPointerException > at > org.apache.hadoop.io.serializer.SerializationFactory.getSerializer(SerializationFactory.java:73) > at org.apache.hadoop.io.SequenceFile$Writer.init(SequenceFile.java:910) > at > org.apache.hadoop.io.SequenceFile$RecordCompressWriter.<init>(SequenceFile.java:1074) > at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:397) > at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:284) > at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:265) > at > org.apache.mahout.utils.vectors.arff.Driver.getSeqFileWriter(Driver.java:180) > at org.apache.mahout.utils.vectors.arff.Driver.writeFile(Driver.java:167) > at org.apache.mahout.utils.vectors.arff.Driver.main(Driver.java:132) > > My data is: > > @relation iris > > @attribute f1 numeric > @attribute f2 numeric > @attribute f3 numeric > @attribute f4 numeric > > @data > 5.1,3.5,1.4,0.2 > 4.9,3.0,1.4,0.2 > 4.7,3.2,1.3,0.2 > 4.6,3.1,1.5,0.2 > 5.0,3.6,1.4,0.2 > 5.4,3.9,1.7,0.4 > 4.6,3.4,1.4,0.3 > 5.0,3.4,1.5,0.2 > 4.4,2.9,1.4,0.2 > 4.9,3.1,1.5,0.1 > > > Any guidance? Thanks. > > - jerry
