like to answer the following customized aggregation query on Spark SQL
1. Group the table by the value of Name
2. For each group, choose the tuple with the max value of Age (the ages are
distinct for every name)
I am wondering what's the best way to do it on Spark SQL? Should I use UDAF
On Sat, Apr 25, 2015 at 2:32 PM, Wenlei Xie wenlei@gmail.com
wrote:
Hi,
I would like to answer the following customized aggregation query on
Spark SQL
1. Group the table by the value of Name
2. For each group, choose the tuple with the max value of Age (the ages
are distinct
can you give an example set of data and desired output
On Sat, Apr 25, 2015 at 2:32 PM, Wenlei Xie wenlei@gmail.com wrote:
Hi,
I would like to answer the following customized aggregation query on Spark
SQL
1. Group the table by the value of Name
2. For each group, choose the tuple
query on
Spark SQL
1. Group the table by the value of Name
2. For each group, choose the tuple with the max value of Age (the ages
are distinct for every name)
I am wondering what's the best way to do it on Spark SQL? Should I use
UDAF? Previously I am doing something like the following
Hi,
I would like to answer the following customized aggregation query on Spark
SQL
1. Group the table by the value of Name
2. For each group, choose the tuple with the max value of Age (the ages are
distinct for every name)
I am wondering what's the best way to do it on Spark SQL? Should I use