Re: LabeledPoint creation

2016-09-08 Thread 市场部
umar <mrajaf...@gmail.com<mailto:mrajaf...@gmail.com>> 日期: 2016年9月8日 星期四 下午2:10 至: "aka.fe2s" <aka.f...@gmail.com<mailto:aka.f...@gmail.com>> 抄送: "user@spark.apache.org<mailto:user@spark.apache.org>" <user@spark.apache.org<mailto:user@spark.apache.org&

Re: LabeledPoint creation

2016-09-08 Thread Madabhattula Rajesh Kumar
Hi, I have done this in different way. Please correct me, is this approach right ? val df = spark.createDataFrame(Seq( (0, "a"), (1, "b"), (2, "c"), (3, "a"), (4, "a"), (5, "c"), (6, "d"))).toDF("id", "category") val categories: List[String] =

Re: LabeledPoint creation

2016-09-07 Thread aka.fe2s
It has 4 categories a = 1 0 0 b = 0 0 0 c = 0 1 0 d = 0 0 1 -- Oleksiy Dyagilev On Wed, Sep 7, 2016 at 10:42 AM, Madabhattula Rajesh Kumar < mrajaf...@gmail.com> wrote: > Hi, > > Any help on above mail use case ? > > Regards, > Rajesh > > On Tue, Sep 6, 2016 at 5:40 PM, Madabhattula Rajesh

Re: LabeledPoint creation

2016-09-07 Thread Madabhattula Rajesh Kumar
Hi, Any help on above mail use case ? Regards, Rajesh On Tue, Sep 6, 2016 at 5:40 PM, Madabhattula Rajesh Kumar < mrajaf...@gmail.com> wrote: > Hi, > > I am new to Spark ML, trying to create a LabeledPoint from categorical > dataset(example code from spark). For this, I am using One-hot

LabeledPoint creation

2016-09-06 Thread Madabhattula Rajesh Kumar
Hi, I am new to Spark ML, trying to create a LabeledPoint from categorical dataset(example code from spark). For this, I am using One-hot encoding feature. Below is my code val df = sparkSession.createDataFrame(Seq( (0, "a"), (1, "b"), (2,