Re: configuration needed to run twitter(25GB) dataset

2014-08-01 Thread shijiaxin
When I use fewer partitions, (like 6) It seems that all the task will be assigned to the same machine, because the machine has more than 6 cores.But this will run out of memory. How to set fewer partitions number and use all the machine at the same time? -- View this message in context:

VertexPartition and ShippableVertexPartition

2014-07-28 Thread shijiaxin
There is a VertexPartition in the EdgePartition,which is created by EdgePartitionBuilder.toEdgePartition. and There is also a ShippableVertexPartition in the VertexRDD. These two Partitions have a lot of common things like index, data and Bitset, why is this necessary? -- View this message in

Where is the PowerGraph abstraction

2014-07-22 Thread shijiaxin
I download the spark 1.0.1, but I cannot find the PowerGraph abstraction mentioned in the GraphX paper. What I can find is the pregel abstraction. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Where-is-the-PowerGraph-abstraction-tp10457.html Sent from