Hi devs,
I am in the middle of a war here trying to figure out how to use
GoraInputFormat from outside GoraLand i.e. GoraMapper and friends. But I am
getting this exception when using HBase as backend:
2013-08-31 11:13:36,090 ERROR
org.apache.giraph.utils.LogStacktraceCallable: Execution of callable
failed
java.lang.NullPointerException
at
org.apache.hadoop.io.serializer.SerializationFactory.getSerializer(SerializationFactory.java:73)
at org.apache.gora.util.IOUtils.serialize(IOUtils.java:116)
at org.apache.gora.query.impl.QueryBase.write(QueryBase.java:252)
at
org.apache.gora.query.impl.PartitionQueryImpl.write(PartitionQueryImpl.java:133)
at
org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:90)
at
org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:77)
at org.apache.gora.util.IOUtils.serialize(IOUtils.java:121)
at org.apache.gora.util.IOUtils.serialize(IOUtils.java:150)
at
org.apache.gora.mapreduce.GoraInputSplit.write(GoraInputSplit.java:84)
at
org.apache.giraph.io.GiraphInputFormat.writeInputSplit(GiraphInputFormat.java:73)
at
org.apache.giraph.io.internal.WrappedVertexInputFormat.writeInputSplit(WrappedVertexInputFormat.java:88)
at
org.apache.giraph.master.BspServiceMaster$WriteInputSplit.call(BspServiceMaster.java:2029)
at
org.apache.giraph.master.BspServiceMaster$WriteInputSplit.call(BspServiceMaster.java:1969)
at
org.apache.giraph.utils.LogStacktraceCallable.call(LogStacktraceCallable.java:51)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
at java.lang.Thread.run(Thread.java:680)
Seems like we are not persisting the partitioned query properly, but I am
unsure this is really a problem as we are able to successfully run other
MapReduce jobs e.g. QueryCounter, Nutch's jobs, and others.
Any ideas, pointers, or ANYTHING is highly appreciated (:
Thanks in advance.
Renato M.