Marko, Suneel's answer is much better than mine.
On Wed, Sep 24, 2014 at 10:10 PM, Suneel Marthi <suneel.mar...@gmail.com> wrote: > @Marko, Subject: Streaming KMeans > > See > > http://stackoverflow.com/questions/17272296/how-to-use-mahout-streaming-k-means/18090471#18090471 > for how to invoke Streaming Kmeans > > Also look at examples/bin/cluster-reuters.sh for the Streaming KMeans > option. > > > On Wed, Sep 24, 2014 at 11:34 AM, Marko <marko.di...@nissatech.com> wrote: > > > Hello everyone, > > > > I'm very sorry to bump in like this, I have been added to the mail list > (I > > think), but it seems that I'm somehow unable to ask a question, that is, > I > > asked a question full times and got no answer. I hope this way will work. > > > > I'm new to Mahout and I've been struggling with Streaming K-means for a > > while now. Is there any tutorial or example of how to use it, how to get > > results, how to call clustering function? > > > > Any help would be great, > > Thanks > > > > > > On 24.09.2014. 15:14, Arian Pasquali wrote: > > > >> Yes, > >> I'm studying his work <http://nlp.uned.es/~jperezi/Lucene-BM25/> and > the > >> current mahout's tfidf code. > >> Trying to understand how I would port that to mr. > >> I ll try to share something if I succeed. > >> > >> > >> > >> > >> > >> Arian Pasquali > >> http://about.me/arianpasquali > >> > >> 2014-09-24 5:12 GMT+01:00 Suneel Marthi <suneel.mar...@gmail.com>: > >> > >> Lucene 4.x supports okapi-bm25. So it should be easy to implement. > >>> > >>> On Tue, Sep 23, 2014 at 11:57 PM, Ted Dunning <ted.dunn...@gmail.com> > >>> wrote: > >>> > >>> Should be pretty easy. I haven't heard of anyone doing it. > >>>> > >>>> Sent from my iPhone > >>>> > >>>> On Sep 23, 2014, at 18:53, Arian Pasquali <ar...@arianpasquali.com> > >>>>> > >>>> wrote: > >>>> > >>>>> Hi, > >>>>> I was wondering if would be possible to support bm25 term weighting > >>>>> extending Mahout's tf-idf implementation. > >>>>> > >>>>> I was curious to know if anyone here has already tried to do so. > >>>>> If not, what would be your suggestion for such implementation on > >>>>> > >>>> Mahout? > >>> > >>>> > >>>>> Arian Pasquali > >>>>> http://about.me/arianpasquali > >>>>> > >>>> > > >