Re: Spark Aggregator for array of doubles

2017-01-04 Thread Anton Okolnychyi
Hi, take a look at this pull request that is not merged yet: https://github.com/apache/spark/pull/16329 . It contains examples in Java and Scala that can be helpful. Best regards, Anton Okolnychyi On Jan 4, 2017 23:23, "Anil Langote" wrote: > Hi All, > > I have been

Spark Aggregator for array of doubles

2017-01-04 Thread Anil Langote
Hi All, I have been working on a use case where I have a DF which has 25 columns, 24 columns are of type string and last column is array of doubles. For a given set of columns I have to apply group by and add the array of doubles, I have implemented UDAF which works fine but it's expensive in