Re: Usage of TF-IDF weights in cbayes Mahout

2010-10-21 Thread Neil Ghosh
Thanks for Clarification Ted/Robin.I got the point of TF and IDF. Is there any other technique/weight optimization which I can use along with Mahout especially while using cbayes ? On Thu, Sep 30, 2010 at 11:30 PM, Ted Dunning wrote: > That is exactly what it does. > > On Thu, Sep 30, 2010 at 8:

Re: Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Ted Dunning
That is exactly what it does. On Thu, Sep 30, 2010 at 8:37 AM, Neal Richter wrote: > On Thu, Sep 30, 2010 at 8:37 AM, Neil Ghosh wrote: > > Does anybody have examples/reference how to use TF-IDF weights in mahout > > cbayes for particular words and phrases while doing text classification ? > >

Re: Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Robin Anil
extra file or extra line, duplicated instances(to decrease the weights) or duplicate feature in the same instance to increase the weights(classic tf-idf) Robin On Thu, Sep 30, 2010 at 9:50 PM, Neil Ghosh wrote: > So All I have to do is add an extra file containing > > LABELproblemcomplaintprobl

Re: Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Neil Ghosh
So All I have to do is add an extra file containing LABELproblemcomplaintproblemo Along with the usual training data in Bayes format ? On Thu, Sep 30, 2010 at 9:44 PM, Robin Anil wrote: > >>> Or Do I have flexibility to give some other input specific to my problem >>> ? Such as if words like "

Re: Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Robin Anil
> > >> Or Do I have flexibility to give some other input specific to my problem ? >> Such as if words like "Problem", "Complaint" etc are more likely to appear >> in a text containing grievance. >> >> >> > You can provide a Weight, usually TF-IDF, that often does a good job of >> factoring in the

Re: Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Neil Ghosh
Thanks for replying Robin , I am quoting conversation between Grant and Me earlier Now I want to know how to implement the second problem ? To be specific my problem is to classify a piece text crawled from web into > two classes > > 1.It is a +ve feedback > 2.It is

Re: Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Neal Richter
On Thu, Sep 30, 2010 at 8:37 AM, Neil Ghosh wrote: > Does anybody have examples/reference how to use TF-IDF weights in mahout > cbayes for particular words and phrases while doing text classification ? http://people.csail.mit.edu/jrennie/papers/icml03-nb.pdf - Neal

Re: Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Robin Anil
It does that by default for all words. What else do you have in mind? On Thu, Sep 30, 2010 at 8:07 PM, Neil Ghosh wrote: > Does anybody have examples/reference how to use TF-IDF weights in mahout > cbayes for particular words and phrases while doing text classification ? > > -- > Thanks and Rega

Usage of TF-IDF weights in cbayes Mahout

2010-09-30 Thread Neil Ghosh
Does anybody have examples/reference how to use TF-IDF weights in mahout cbayes for particular words and phrases while doing text classification ? -- Thanks and Regards Neil http://neilghosh.com