Re: Help in Parsing 'Categorical' type of data

2017-06-23 Thread Yanbo Liang
Please consider to use other classification models such as logistic regression or GBT. Naive bayes usually consider features as count, which is not suitable to be used on features generated by one-hot encoder. Thanks Yanbo On Wed, May 31, 2017 at 3:58 PM, Amlan Jyoti wrote:

Help in Parsing 'Categorical' type of data

2017-05-31 Thread Amlan Jyoti
Hi, I am trying to run Naive Bayes Model using Spark ML libraries, in Java. The sample snippet of dataset is given below: Raw Data - But, as the input data needs to in numeric, so I am using one-hot-encoder on the Gender field[m->0,1][f->1,0]; and the finally the 'features' vector is