Re: [Scikit-learn-general] TFIDF question

2013-11-29 Thread Andreas Hjortgaard Danielsen
On 29 November 2013 14:43, Olivier Grisel wrote: > 2013/11/29 Andreas Hjortgaard Danielsen : > > Hi, > > > > It might be worth noting that Lucene uses the same implementation: > > > http://lucene.apache.org/core/4_0_0/core/org/apache/lucene/search/similarities/TF

Re: [Scikit-learn-general] TFIDF question

2013-11-29 Thread Andreas Hjortgaard Danielsen
Hi, It might be worth noting that Lucene uses the same implementation: http://lucene.apache.org/core/4_0_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html And Gensim has an option for choosing an addition constant (although the default is 0). https://github.com/piskvorky/gensim/bl