Re: [Scikit-learn-general] Updated KMeansCoder now available as gist

2013-12-17 Thread Gael Varoquaux
> I am not sure that K-SVD is any better, but one of the other people in > my group proposed using K-SVD, so I will likely have to implement it. It's a reference algorithm. I'd sure think that it would be a good addition. ---

Re: [Scikit-learn-general] Fwd: libsvm PR

2013-12-17 Thread Joel Nothman
On Wed, Dec 18, 2013 at 12:40 AM, Olivier Grisel wrote: > 2013/12/17 James Bergstra : > > News: Chih-Jen Lin wrote to me a few days ago to let me know that he has > > created a github project for libsvm. > > > > https://github.com/cjlin1/libsvm > > \o/ > > Wow. A first step for the community shoul

Re: [Scikit-learn-general] SGDClassifier(loss='log') vs. LogisticRegression

2013-12-17 Thread Mathieu Blondel
On Tue, Dec 17, 2013 at 9:17 AM, Doug Newman wrote: > > So, my question is two-fold: (1) Why this difference? and (2) Would you > have any recommendations going forward? Is there a better algorithm or > technique I could read up on that would give me a confidence score on a > per-prediction basis

Re: [Scikit-learn-general] Call for reviews - Text Analysis tutorial

2013-12-17 Thread Jaques Grobler
Agg -- new conflicts.. still - review's welcome 2013/12/17 Jaques Grobler > Hi all.. > > If anybody has some time - I'd really appreciate last reviews on this PR > for the Text Analysis tutorial. The PR is available here below: > > https://github.com/scikit-learn/scikit-learn/pull/1971 > > Onli

[Scikit-learn-general] Call for reviews - Text Analysis tutorial

2013-12-17 Thread Jaques Grobler
Hi all.. If anybody has some time - I'd really appreciate last reviews on this PR for the Text Analysis tutorial. The PR is available here below: https://github.com/scikit-learn/scikit-learn/pull/1971 Online build available here: http://jaquesgrobler.github.io/online-sklearn-build/documentation

Re: [Scikit-learn-general] macro and micro average output

2013-12-17 Thread Yuan Luo
Thank you all for helping out! Best, Yuan On Mon, Dec 16, 2013 at 4:58 PM, Lars Buitinck wrote: > 2013/12/16 Joel Nothman : > > In some parts of the codebase, we have near duplicate implementations > with > > different names (e.g. classifiers and regressors), but for metrics we > make > > impl

Re: [Scikit-learn-general] Fwd: libsvm PR

2013-12-17 Thread Olivier Grisel
2013/12/17 James Bergstra : > News: Chih-Jen Lin wrote to me a few days ago to let me know that he has > created a github project for libsvm. > > https://github.com/cjlin1/libsvm \o/ > Anyone want to try a rebase (!?) That would indeed be great to maintain a fork with our patches in a regularly

[Scikit-learn-general] Fwd: libsvm PR

2013-12-17 Thread James Bergstra
News: Chih-Jen Lin wrote to me a few days ago to let me know that he has created a github project for libsvm. https://github.com/cjlin1/libsvm Anyone want to try a rebase (!?) -- Rapidly troubleshoot problems before they

Re: [Scikit-learn-general] SGDClassifier(loss='log') vs. LogisticRegression

2013-12-17 Thread Joel Nothman
I think alpha = 1/2C On Tue, Dec 17, 2013 at 7:46 PM, Olivier Grisel wrote: > 2013/12/17 Doug Newman : > > Hello, > > > > I am relatively new to classification problems in machine learning and > have > > a somewhat general question regarding the behavior of SGDClassifer with > > loss='log' as co

Re: [Scikit-learn-general] finding fitted coefficients using cross_val_score?

2013-12-17 Thread Chris Holdgraf
Ah - thanks very much for the clarification! On Thu, Dec 12, 2013 at 9:26 AM, Gael Varoquaux < gael.varoqu...@normalesup.org> wrote: > > Is there a way for me to access this information (the fitted > > coefficients for each split)? It's clearly being calculated on each > > iteration, so I assume

Re: [Scikit-learn-general] SGDClassifier(loss='log') vs. LogisticRegression

2013-12-17 Thread Olivier Grisel
2013/12/17 Doug Newman : > Hello, > > I am relatively new to classification problems in machine learning and have > a somewhat general question regarding the behavior of SGDClassifer with > loss='log' as compared to LogisticRegression in the sklearn package. In theory they are optimizing the same