Re: Input format problems running Giraph 1.1.0 on Twitter dataset

2015-04-25 Thread Kenrick Fernandes
Hi Roman, Thanks for the quick response. There is no vertex data in this dataset though, and the vertex IDs posted above would fit in a Long. Would you advise changing the PageRankComputation formats, or working on a new input format ? Thanks, Kenrick On Sat, Apr 25, 2015 at 7:40 PM, Roman Shapo

Re: Input format problems running Giraph 1.1.0 on Twitter dataset

2015-04-25 Thread Roman Shaposhnik
One of the slightly annoying things in Giraph is that you have to manually match your input format to your computation. In your case, PageRankComputation requires LongWritable for vertex ID and DoubleWritable for vertex Data. You may need to hack one of the existing formats slightly. Thanks, Roma

Input format problems running Giraph 1.1.0 on Twitter dataset

2015-04-25 Thread Kenrick Fernandes
Hello, Im trying to get Giraph to read the Twitter dataset as input for the SimplePageRankComputation program. The dataset format looks like this: 61578010 61147436 61578037 61147436 61578040 61147436 (vertex id's, with pairs representing edges) When I run the command with *-vif org.apache.giraph