Shuffle produces one huge partition

2015-06-16 Thread Al M
this? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Shuffle-produces-one-huge-partition-tp23358.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscri

Re: Shuffle produces one huge partition

2015-06-17 Thread Akhil Das
simpler way to solve this? > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Shuffle-produces-one-huge-partition-tp23358.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > -

Re: Shuffle produces one huge partition and many tiny partitions

2015-06-18 Thread Al M
at are known to appear very often are assigned random partitions instead of using the existing partitioning mechanism. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Shuffle-produces-one-huge-partition-and-many-tiny-partitions-tp23358p23387.html Sent from t

Re: Shuffle produces one huge partition and many tiny partitions

2015-06-18 Thread Du Li
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Shuffle-produces-one-huge-partition-and-many-tiny-partitions-tp23358p23387.html Sent from the Apache Spark User List mailing list archive at Nabble.com. -

Re: Shuffle produces one huge partition and many tiny partitions

2015-06-18 Thread Corey Nolet
eats certain "exception" keys differently. > These keys that are known to appear very often are assigned random > partitions instead of using the existing partitioning mechanism. > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3

Re: Shuffle produces one huge partition and many tiny partitions

2015-06-18 Thread Du Li
itioner that extends HashPartitioner.  It treats certain "exception" keys differently. These keys that are known to appear very often are assigned random partitions instead of using the existing partitioning mechanism. -- View this message in context: http://apache-spark-user-list.1001560.n3.n

Re: Shuffle produces one huge partition and many tiny partitions

2015-06-18 Thread Corey Nolet
n "exception" keys differently. > These keys that are known to appear very often are assigned random > partitions instead of using the existing partitioning mechanism. > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Shuffl