Please consider to use other classification models such as logistic
regression or GBT. Naive bayes usually consider features as count, which is
not suitable to be used on features generated by one-hot encoder.
Thanks
Yanbo
On Wed, May 31, 2017 at 3:58 PM, Amlan Jyoti wrote:
Hi,
I am trying to run Naive Bayes Model using Spark ML libraries, in Java.
The sample snippet of dataset is given below:
Raw Data -
But, as the input data needs to in numeric, so I am using one-hot-encoder
on the Gender field[m->0,1][f->1,0]; and the finally the 'features' vector
is