So I have to recompile the pagerank example?
Can I pass it as a parameter to the existing jar?

2012/9/19 Thomas Jungblut <[email protected]>

> Hey,
>
> if you read closely:
>
> http://wiki.apache.org/hama/WriteHamaGraphFile#Google_Web_dataset_.28local_mode.2C_pseudo_distributed_cluser.29
>
> You find that there is a property called "hama.graph.repair":
>
>     // hama takes care that the graph is complete
>     pageJob.set("hama.graph.repair", "true");
>
> This basically sends messages along the known edges and adds vertices
> if there aren't any on the "other side".
>
> If this isn't to scalable for you, then a preprocessing mapreduce job
> is fine, where you emit the vertex id as key along with the complete
> edge list as value, also the edge keys with an empty value.
> In the reducer you should get either multiple complete lines or empty
> values.
> In the case you get only an empty value, you know that this vertex
> wasn't included in the dataset and you can repair by emitting it in
> the reducer as single line.
>
>
> 2012/9/19 Sandy Ding <[email protected]>:
> > Hi, guys,
> >
> > The web-google dataset seems to miss some key sites, for example, there
> is
> > no entry starting with 111067.
> > This leads to weird NullPointerException. How do you fix this?
> >
> > Cheers,
> > Sandy
>

Reply via email to