Impact, You can group by the data and then sort it by timestamp and take max to select the oldest value. On Aug 21, 2015 11:15 PM, "Impact" <nat...@skone.org> wrote:
> I am also looking for a way to achieve the reducebykey functionality on > data > frames. In my case I need to select one particular row (the oldest, based > on > a timestamp column value) by key. > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Aggregate-to-array-or-slice-by-key-with-DataFrames-tp23636p24399.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >