Re: Aggregated column name

2017-03-23 Thread Wen Pei Yu
expr) Yu Wenpei. From: Kevin Mellott <kevin.r.mell...@gmail.com> To: Wen Pei Yu <yuw...@cn.ibm.com> Cc: user <user@spark.apache.org> Date: 03/24/2017 09:48 AM Subject:Re: Aggregated column name I'm not sure of the answer to your question; however, when

Re: Aggregated column name

2017-03-23 Thread Kevin Mellott
I'm not sure of the answer to your question; however, when performing aggregates I find it useful to specify an *alias* for each column. That will give you explicit control over the name of the resulting column. In your example, that would look something like:

Aggregated column name

2017-03-23 Thread Wen Pei Yu
Hi All   I found some spark version(spark 1.4) return upper case aggregated column,  and some return low case. As below code, df.groupby(col("...")).agg(count("number"))  may return   COUNT(number)  -- spark 1,4 count(number) - spark 1.6   Anyone know if there is configure parameter for