Hi all,I am new in Spark, and I have a question in dealing with RDD.I’ve converted RDD to DataFrame. So there are two DF: DF1 and DF2DF1 contains: userID, time, dataUsage, durationDF2 contains: userIDEach userID has multiple rows in DF1.DF2 has distinct userID, and I would like to compute the average, max and min value of both dataUsage and duration for each userID in DF1?And store the results in a new dataframe.How can I do that?Thanks a lot.BestFrank
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Question-of-RDD-in-calculation-tp25100.html Sent from the Apache Spark User List mailing list archive at Nabble.com.