Question of RDD in calculation

Shepherd Fri, 16 Oct 2015 12:15:07 -0700

Hi all,I am new in Spark, and I have a question in dealing with RDD.I’ve
converted RDD to DataFrame. So there are two DF: DF1 and DF2DF1 contains:
userID, time, dataUsage, durationDF2 contains: userIDEach userID has
multiple rows in DF1.DF2 has distinct userID, and I would like to compute
the average, max and min value of both dataUsage and duration for each
userID in DF1?And store the results in a new dataframe.How can I do
that?Thanks a lot.BestFrank




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Question-of-RDD-in-calculation-tp25100.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

Question of RDD in calculation

Reply via email to