Re: SVM in Mahout

2013-12-24 Thread tuku
someone tried to implement SVM in a summer google code but it turns out map reduced version of svm is too difficult to implement and they dropped the project. I bet you can train via libsvm and use just classification part with map reduce but if I have a choice I prefer logistic regression too

Re: Mahout for text classification

2013-12-11 Thread tuku
I am currently using naive bayes for text classification. I prefer NB over SVM because; - SVM has long training time - NB can be incremental - NB can be fully parallel the main decisions you should make while using NB is using tf or tfidf and using binary NB or multinomial if you classify short

theta normalization fo naive bayes is commented out

2013-12-02 Thread tuku
hello; i searched wiki and the web but couldn't find the reason why theta normalization is commented out for naive bayes classification. there is a todo comment on top that states this will be enabled soon. is there any schedule for this? do anyone know the reason not to use theta normalization?