Re: LabeledPoint creation

2016-09-08 Thread 市场部
scala> result.show() +-+-+ |label| features| +-+-+ | 0.0|[1.0,0.0,0.0]| | 1.0|[0.0,0.0,1.0]| | 2.0|[0.0,1.0,0.0]| | 3.0|[1.0,0.0,0.0]| | 4.0|[1.0,0.0,0.0]| | 5.0|[0.0,1.0,0.0]| | 6.0|[0.0,0.0,0.0]| +-+-+ 发件人: Madabhattula Rajesh Kumar mailto:mr

Re: LabeledPoint creation

2016-09-07 Thread Madabhattula Rajesh Kumar
Hi, I have done this in different way. Please correct me, is this approach right ? val df = spark.createDataFrame(Seq( (0, "a"), (1, "b"), (2, "c"), (3, "a"), (4, "a"), (5, "c"), (6, "d"))).toDF("id", "category") val categories: List[String] = List("a

Re: LabeledPoint creation

2016-09-07 Thread aka.fe2s
It has 4 categories a = 1 0 0 b = 0 0 0 c = 0 1 0 d = 0 0 1 -- Oleksiy Dyagilev On Wed, Sep 7, 2016 at 10:42 AM, Madabhattula Rajesh Kumar < mrajaf...@gmail.com> wrote: > Hi, > > Any help on above mail use case ? > > Regards, > Rajesh > > On Tue, Sep 6, 2016 at 5:40 PM, Madabhattula Rajesh Kumar

Re: LabeledPoint creation

2016-09-07 Thread Madabhattula Rajesh Kumar
Hi, Any help on above mail use case ? Regards, Rajesh On Tue, Sep 6, 2016 at 5:40 PM, Madabhattula Rajesh Kumar < mrajaf...@gmail.com> wrote: > Hi, > > I am new to Spark ML, trying to create a LabeledPoint from categorical > dataset(example code from spark). For this, I am using One-hot encodin

LabeledPoint creation

2016-09-06 Thread Madabhattula Rajesh Kumar
Hi, I am new to Spark ML, trying to create a LabeledPoint from categorical dataset(example code from spark). For this, I am using One-hot encoding feature. Below is my code val df = sparkSession.createDataFrame(Seq( (0, "a"), (1, "b"), (2,