subject:"Reg\:Reading a csv file with String label into labelepoint"

Re: Reg:Reading a csv file with String label into labelepoint

2016-03-16 Thread Yanbo Liang

Actually it's unnecessary to convert csv row to LabeledPoint, because we use DataFrame as the standard data format when training a model by Spark ML. What you should do is converting double attributes to Vector named "feature". Then you can train the ML model by specifying the featureCol and

Reg:Reading a csv file with String label into labelepoint

2016-03-15 Thread Dharmin Siddesh J

Hi I am trying to read a csv with few double attributes and String Label . How can i convert it to labelpoint RDD so that i can run it with spark mllib classification algorithms. I have tried The LabelPoint Constructor (is available only for Regression ) but it accepts only double format label.