RE: Need to user univariate summary stats

2016-02-04 Thread Lohith Samaga M
Hi Arun, You can do df.agg(max(,,), min(..)). Best regards / Mit freundlichen Grüßen / Sincères salutations M. Lohith Samaga From: Arunkumar Pillai [mailto:arunkumar1...@gmail.com] Sent: Thursday, February 04, 2016 14.53 To: user@spark.apache.org Subject: Need to user univariate

Need to user univariate summary stats

2016-02-04 Thread Arunkumar Pillai
Hi I'm currently using query sqlContext.sql("SELECT MAX(variablesArray) FROM " + tableName) to extract mean max min. is there any better optimized way ? In the example i saw df.groupBy("key").agg(skewness("a"), kurtosis("a")) But i don't have key anywhere in the data. How to extract the