Hi there,
I'm interested if it is possible to get the same behavior as for reduce
function from MR framework. I mean for each key K get list of associated
values ListV.
There is function reduceByKey that works only with separate V from list. Is
it exist any way to get list? Because I have to
You may use groupByKey in this case.
On Aug 7, 2014, at 9:18 PM, Konstantin Kudryavtsev
kudryavtsev.konstan...@gmail.com wrote:
Hi there,
I'm interested if it is possible to get the same behavior as for reduce
function from MR framework. I mean for each key K get list of associated
] to RDD[ String, Seq[String] ]
then, reduceByKey(_ ++ _) on this RDD[ String, Seq[String] ]
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/reduceByKey-to-get-all-associated-values-tp11645p11652.html
Sent from the Apache Spark User List mailing list archive
] to RDD[ String, Seq[String] ]
then, reduceByKey(_ ++ _) on this RDD[ String, Seq[String] ]
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/reduceByKey-to-get-all-associated-values-tp11645p11652.html
Sent from the Apache Spark User List mailing list archive
] ],
we can also do it with reduceByKey:
at first, map RDD[ String, String ] to RDD[ String, Seq[String] ]
then, reduceByKey(_ ++ _) on this RDD[ String, Seq[String] ]
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/reduceByKey-to-get-all-associated