Re: text classification using mahout and lucene index

David Rahman Fri, 14 Oct 2011 04:10:18 -0700

Ok, thanks.
Just to make it clear to me: I take the date with the lucene vectors and
operate a training Alg. on them. And this should result into a model. I
don't need some preprocessing steps or anything else?

Another question: your book MiA gives a good explanation and overview about
mahout. Can you tell me, if there is more coming about mahout+lucene? I'm
new at this stuff, and I need some more readings.

I did find "Taming Text" but from the abstract I could not determine if this
applies to my problem.

Thanks and regards,
David

take lucene vectors --> train on them with nBayes or another Alg. -->
getting a model

2011/10/13 Ted Dunning <ted.dunn...@gmail.com>

> I just meant that there are separate components to do the different steps.
>  Historically, some glue code was required between them, but I think that
> the gap has been narrowed lately.
>
> On Thu, Oct 13, 2011 at 12:41 PM, David Rahman
> <drahman1...@googlemail.com>wrote:
>
> > @Ted: Clould you explain the last part of your respond, please. That I
> > didn't understand:
> >
> > >You will need to glue the lucene document vector extraction to the
> > >naive bayes and you may want to adapt it to use feature hashing for the
> > SGD
> > >classifiers.
> >
>

Re: text classification using mahout and lucene index

Reply via email to