Hi,,

I have  large dataset of elemenst[RDD] and i want to divide it into two
exactly equal sized partitions maintaining order of elements.I tried using
RangePartitioner like  var data= partitionedFile.partitionBy(new
RangePartitioner(2, partitionedFile)).
This doesnt give satisfactory results becoz it divides roughly but not
exactly equal sized maintaining order of elements..
for example
if their are 64 elements ..we use
Rangepartitioner.>>>then it divides in to 31 elements and 33 elements..

I need partitioner such that i get exactly frirst 32 elements in one half
and other half contains second set of 32 elements..
Guys could anyone hlep me by suggestiing how to use customised partitioner
such that I get equally sized two halves...maintaing the order of elements..

Please help me...




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Regarding-Partitioner-tp4356.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

Reply via email to