Re: pivot over non numerical data

2017-02-02 Thread Darshan Pandya
Thanks Kevin, Worked like a charm. FYI for readers, val temp1 = temp.groupBy("reference_id").pivot("char_name").agg(max($"char_value")) I didn't know I can use 'agg' with a string max. I was using it incorrectly as below temp.groupBy("reference_id").pivot("char_name").max("char_value") On Wed,

Re: pivot over non numerical data

2017-02-01 Thread Kevin Mellott
This should work for non-numerical data as well - can you please elaborate on the error you are getting and provide a code sample? As a preliminary hint, you can "aggregate" text values using *max*. df.groupBy("someCol") .pivot("anotherCol") .agg(max($"textCol")) Thanks, Kevin On Wed, Feb

pivot over non numerical data

2017-02-01 Thread Darshan Pandya
Hello, I am trying to transpose some data using groupBy pivot aggr as mentioned in this blog https://databricks.com/blog/2016/02/09/reshaping-data-with-pivot-in-apache-spark.html But this works only for numerical data. Any hints for doing the same thing for non numerical data ? -- Sincerely,