non-text NB classifiers?

2012-07-31 Thread Eric Friedman
All of the examples that I've found for training NB classifiers seem to have textual data as input. Is there a way to build a classifier with more general attributes? I found this jira ticket (https://issues.apache.org/jira/browse/MAHOUT-286), but it's been closed:duplicate under

Re: non-text NB classifiers?

2012-07-31 Thread Sean Owen
I don't know this code too much, but, there is simply a step in front I believe that vectorizes text with TF-IDF. The result are simple vectors. You could just inject your vectors (i.e. real-value attributes) at that stage and skip the TF-IDF. It may need a little hacking. On Tue, Jul 31, 2012 at

Re: non-text NB classifiers?

2012-07-31 Thread Robin Anil
You can pass in any vector(not just a tfidf vector). For example the asf-email example script using Vectors generated using the randomized encoding. -- Robin Anil On Tue, Jul 31, 2012 at 12:26 PM, Sean Owen sro...@gmail.com wrote: I don't know this code too much, but, there is simply a

Re: non-text NB classifiers?

2012-07-31 Thread Eric Friedman
Can you point me to the class I should look at to see how this is done? On Tue, Jul 31, 2012 at 10:49 AM, Robin Anil robin.a...@gmail.com wrote: You can pass in any vector(not just a tfidf vector). For example the asf-email example script using Vectors generated using the randomized encoding.

Re: non-text NB classifiers?

2012-07-31 Thread Robin Anil
its EncodedVectorsFromSequenceFiles.java I believe -- Robin Anil On Tue, Jul 31, 2012 at 6:05 PM, Eric Friedman e...@spottedsnake.netwrote: Can you point me to the class I should look at to see how this is done? On Tue, Jul 31, 2012 at 10:49 AM, Robin Anil robin.a...@gmail.com wrote: