Have you looked at the new Spark ML library? You can use a DataFrame directly 
with the Spark ML API.

https://spark.apache.org/docs/latest/ml-guide.html


Mohammed

From: Sourav Mazumder [mailto:sourav.mazumde...@gmail.com]
Sent: Monday, July 6, 2015 10:29 AM
To: user
Subject: How to create a LabeledPoint RDD from a Data Frame

Hi,
I have a Dataframe which I want to use for creating a RandomForest model using 
MLLib.
The RandonForest model needs a RDD with LabeledPoints.
Wondering how do I convert the DataFrame to LabeledPointRDD
Regards,
Sourav

Reply via email to