Re: Alternatives to groupByKey

2014-12-03 Thread Xuefeng Wu
looks good. I concern about the foldLeftByKey which looks break the consistence from foldLeft in RDD and aggregateByKey in PairRDD Yours, Xuefeng Wu 吴雪峰 敬上 > On 2014年12月4日, at 上午7:47, Koert Kuipers wrote: > > foldLeftByKey ---

Re: Alternatives to groupByKey

2014-12-03 Thread Koert Kuipers
3. I can't use reduceByKey() or aggregateByKey() are some of my >> aggregations >> need to have a view of the entire window. >> >> Only other way I could think of is partitioning my RDDs into multiple RDDs >> with each RDD representing a window. Is th

Re: Alternatives to groupByKey

2014-12-03 Thread Xuefeng Wu
the RDD functionality at this point instead of the >> iterables. >> 3. I can't use reduceByKey() or aggregateByKey() are some of my aggregations >> need to have a view of the entire window. >> >> Only other way I could think of is partitioning my RDDs into multip

Re: Alternatives to groupByKey

2014-12-03 Thread Nathan Kronenfeld
re some of my > aggregations > need to have a view of the entire window. > > Only other way I could think of is partitioning my RDDs into multiple RDDs > with each RDD representing a window. Is this a sensible approach? Or is > there any other way of going about this? > > >

Alternatives to groupByKey

2014-12-03 Thread ameyc
message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Alternatives-to-groupByKey-tp20293.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.