Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Olivier Grisel
Le 6 avril 2012 04:31, Lee Zamparo a écrit : > Hi folks, > > I haven't had time to work on and submit a patch to scikit-learn by > this evening, as I'm facing down a conference deadline.  Thanks to > everyone who provided valuable feedback, and hopefully I'll be able to > submit next year. You ca

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Gael Varoquaux
Hi Lee, As you can see on http://wiki.python.org/moin/SummerOfCode/2012, you have another 10 days to meet these expectations. That said, if you do not have time to pursue the GSOC, you shouldn't apply. I stress that it is a full time job. Good luck with your deadline, Gael On Thu, Apr 05, 2012

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Lee Zamparo
Hi folks, I haven't had time to work on and submit a patch to scikit-learn by this evening, as I'm facing down a conference deadline. Thanks to everyone who provided valuable feedback, and hopefully I'll be able to submit next year. Thanks, Lee. On Mon, Apr 2, 2012 at 12:14 PM, Olivier Grisel

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Olivier Grisel
Le 5 avril 2012 22:58, Vlad Niculae a écrit : > Submitted: > http://www.google-melange.com/gsoc/proposal/review/google/gsoc2012/vladn/15002# > > Just need to make some links blue, but the editor is tedious. > > I'm afraid I won't have the time to write up a proposal for the matrix > completion p

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Vlad Niculae
Submitted: http://www.google-melange.com/gsoc/proposal/review/google/gsoc2012/vladn/15002# Just need to make some links blue, but the editor is tedious. I'm afraid I won't have the time to write up a proposal for the matrix completion project. I still remain highly interested in it. Best, Vlad

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Olivier Grisel
Le 5 avril 2012 14:25, Vlad Niculae a écrit : > Hi everyone > > I have updated my proposal thanks to your excellent suggestions. > > I also pointed out the style of optimization that will be applied by linking > to my blog post on optimizing orthogonal matching pursuit code. Unfortunately > this

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Gael Varoquaux
On Thu, Apr 05, 2012 at 03:25:59PM +0300, Vlad Niculae wrote: > I plan to submit tonight. Do: you should be able to edit till the deadline. To all the students: I must apologize, Alex (Gramfort) and I have a conference deadline tonight, and we must worry about our graduate students that are submi

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-05 Thread Vlad Niculae
Hi everyone I have updated my proposal thanks to your excellent suggestions. I also pointed out the style of optimization that will be applied by linking to my blog post on optimizing orthogonal matching pursuit code. Unfortunately this will also flash the bug I introduced before everyone's eye

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-04 Thread Olivier Grisel
Le 4 avril 2012 20:19, Alexandre Gramfort a écrit : > hello vlad, > > hope you're doing better. > > My gut feeling reading the proposal is that you clearly know what you're > talking > about as you know well the code base but I think you should be more specific > about where the low hanging fruit

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-04 Thread Alexandre Gramfort
hello vlad, hope you're doing better. My gut feeling reading the proposal is that you clearly know what you're talking about as you know well the code base but I think you should be more specific about where the low hanging fruits are and which modules deserve some love in terms of speed. Alex

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-04 Thread Vlad Niculae
Hello guys, Unfortunately I have come down with the flu, and therefore missed a good amount of time to work on gsoc 2012 proposals. I know that there's not much time left for review, but here is my pre-proposal for a overall speedup and benchmarking project. https://docs.google.com/document/d/

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-02 Thread Olivier Grisel
Le 2 avril 2012 18:06, Lee Zamparo a écrit : > > Regarding the suggested additions, I'm interested in Olivier's > suggestion of Power Iteration Clustering, and seeing how it fares > against kernel K-means as well as the convex exemplar based clustering > paper suggested by Bertrand.  I'll revise m

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-02 Thread Lee Zamparo
ist in digest mode. Lee. > > Message: 8 > Date: Mon, 2 Apr 2012 16:51:56 +0200 > From: Olivier Grisel > Subject: Re: [Scikit-learn-general] GSoC 2012 pre-application > To: scikit-learn-general@lists.sourceforge.net > Message-ID: >         > Content-Type: text/plain; chars

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-04-02 Thread Olivier Grisel
Le 30 mars 2012 15:21, Olivier Grisel a écrit : > > Lee, what is your github account? Do you have prior experience with > Numpy / Scipy / Cython development? > > Also about kernel k-means: I don't know this algorithm myself. Do you > have practical evidence that this approach is really working a >

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-03-30 Thread Olivier Grisel
Le 30 mars 2012 07:19, Gael Varoquaux a écrit : > Hi Lee, > > Welcome! Thanks for preparing a proposal. My impression looking at it, is > that it seems a bit light for 2.5 months of work. It is pretty much > centered around implementing one algorithm, weighted k-means. One way to complement this

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-03-30 Thread Gael Varoquaux
Our affinity propagation really doesn't scale. I quickly tried to make it work on sparse matrices, but it turned out to be some work. That said, it's a crappy algorithm :$. Gael - Original message - > Hi Lee. > I'd have to have a look that the papers again to judge this better. > Maybe

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-03-30 Thread Andreas
Hi Lee. I'd have to have a look that the papers again to judge this better. Maybe I'll have time on the weekend. What I would also like to see in the clustering module would be a more scalable mean shift and maybe also quickshift. Cheers, Andy On 03/30/2012 06:30 AM, Lee Zamparo wrote: > Hello

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-03-30 Thread Bertrand Thirion
- > De: "Robert Layton" > À: scikit-learn-general@lists.sourceforge.net > Envoyé: Vendredi 30 Mars 2012 07:24:25 > Objet: Re: [Scikit-learn-general] GSoC 2012 pre-application > On 30 March 2012 16:19, Gael Varoquaux < gael.varoqu...@normalesup.org > > wrote: > >

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-03-29 Thread Mathieu Blondel
Could you elaborate your strategies for speeding up kernel k-means? As far as I know, kernel k-means is very expensive. Mathieu -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-03-29 Thread Robert Layton
On 30 March 2012 16:19, Gael Varoquaux wrote: > Hi Lee, > > Welcome! Thanks for preparing a proposal. My impression looking at it, is > that it seems a bit light for 2.5 months of work. It is pretty much > centered around implementing one algorithm, weighted k-means. > > Cheers, > > Gael > > > On

Re: [Scikit-learn-general] GSoC 2012 pre-application

2012-03-29 Thread Gael Varoquaux
Hi Lee, Welcome! Thanks for preparing a proposal. My impression looking at it, is that it seems a bit light for 2.5 months of work. It is pretty much centered around implementing one algorithm, weighted k-means. Cheers, Gael On Fri, Mar 30, 2012 at 12:30:46AM -0400, Lee Zamparo wrote: > Hello

[Scikit-learn-general] GSoC 2012 pre-application

2012-03-29 Thread Lee Zamparo
Hello everyone, I'm a prospective applicant to GSoC 2012, and am drafting a proposal. I would really appreciate if you could spare some time to give me feedback. My proposal is centred around sklearn.cluster, so I would like to ask Andreas Muller, Olivier Grisel or Lars Buitinck if they would con

Re: [Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-11 Thread Shankar Satish
Hello everyone, I have perused the past email archives for discussion on "partial_fit" and the online-learning APIs. As per my understanding, having a "partial_fit" kind of API essentially modifies a supervised learning classifier so that it can accept a continuous stream of data. In that case, m

Re: [Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-11 Thread Shankar Satish
Hi Andy, An explanation of Gittin's index: Imagine you are in a casino which has N slot machines. Each machine can be played by inserting a 1-dollar coin in it and pulling a lever. Every time you pull a lever, the machine might spit out some amount of money, governed by an underlying (unknown) st

Re: [Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-11 Thread Andreas Mueller
Hi Shankar. Can you explain in how far these two ideas are related to online learning? I am not familiar with Gittin's index but value iteration and policy iteration are dynamic programming algorithms that assume a model is given. I would implement those by either giving the full transition mod

[Scikit-learn-general] GSOC-2012 project idea: online learning algorithms.

2012-03-10 Thread Shankar Satish
Hello everyone, I am a prospective GSOC-2012 student. I have some project ideas that i would like to bounce-off the community: I would like add online-learning functionality. To do so, we can implement some reinforcement-learning algorithms. The problem is described in terms of an "agent" that ne

Re: [Scikit-learn-general] GSOC 2012

2012-03-07 Thread David Warde-Farley
On 2012-03-07, at 8:45 AM, Olivier Grisel wrote: > I think was DWF calls sparse coding is the LASSO implemented with > coordinate descent (sparse coding with a fixed dictionary). Indeed, the encoding step rather than the dictionary learning step. When benchmarking unsupervised feature learning m

Re: [Scikit-learn-general] GSOC 2012

2012-03-07 Thread Alexandre Gramfort
> We could indeed merge both proposals in a single GSoC proposal, but I > would like to keep them as 2 separate steps with the two examples: > group lens movie recommendation and out-of-core NMF for topic modeling > on wikipedia text. > > We could also work on making the MiniBatchSparseDirectionary

Re: [Scikit-learn-general] GSOC 2012

2012-03-07 Thread Olivier Grisel
2012/3/6 Mathieu Blondel : > Even if they would be useful, I'd rather avoid projects like > "maintenance" or "speed things up". I think projects with a > well-identified goal are more likely to be accepted by the PSF. > > I like Olivier's proposals for SGD-based low-rank and non-negative > matrix f

Re: [Scikit-learn-general] GSOC 2012

2012-03-07 Thread Olivier Grisel
2012/3/6 Alexandre Gramfort : >> a) sparse coding is about 2 orders of magnitude slower than competing >>   implementations right now, making it kind of useless except in toy >>   1996-sized situations (I'm supposed to find a way to benchmark >>   this for Alex, but I can tell you that the situatio

Re: [Scikit-learn-general] GSOC 2012

2012-03-06 Thread Alexandre Gramfort
> a) sparse coding is about 2 orders of magnitude slower than competing >   implementations right now, making it kind of useless except in toy >   1996-sized situations (I'm supposed to find a way to benchmark >   this for Alex, but I can tell you that the situation is fairly bad >   currently, com

Re: [Scikit-learn-general] GSOC 2012

2012-03-06 Thread Mathieu Blondel
Even if they would be useful, I'd rather avoid projects like "maintenance" or "speed things up". I think projects with a well-identified goal are more likely to be accepted by the PSF. I like Olivier's proposals for SGD-based low-rank and non-negative matrix factorization but I'd rather merge them

Re: [Scikit-learn-general] GSOC 2012

2012-03-06 Thread David Warde-Farley
On Tue, Mar 06, 2012 at 10:05:19PM +0100, Gael Varoquaux wrote: > Hi Vlad and Andy, > > First of all, Vlad, thanks for offering to take up the less sexy work. I > don't think that maintenance can qualify as a GSOC project. A big > refactor could, but I don't think that we need one right now. What

Re: [Scikit-learn-general] GSOC 2012

2012-03-06 Thread Gael Varoquaux
Hi Vlad and Andy, First of all, Vlad, thanks for offering to take up the less sexy work. I don't think that maintenance can qualify as a GSOC project. A big refactor could, but I don't think that we need one right now. With regards to reaching 1.0 release, one thing that I would really like to ac

Re: [Scikit-learn-general] GSOC 2012

2012-03-06 Thread Andreas
Hi Vlad. I would gladly mentor a maintenance-oriented project. I feel Gael could probably do this better, as he has more experience, but I guess he doesn't have to much time on his hands. There is still a lot to do and we should come up with a list of issues to address, preferably by consensus ;)

Re: [Scikit-learn-general] GSOC 2012

2012-03-06 Thread Vlad Niculae
Hello everybody, The two project ideas with Olivier's name are very interesting to me. Notably the NMF project could also include a rehaul of the current implementation, a cython non-negative least squares solver, addition of KL divergence loss, and as much love as possible for sparse matrices.

[Scikit-learn-general] GSOC 2012

2012-03-06 Thread Gael Varoquaux
Hi team, First of all, I need to apologize for not having been responsive in the discussions concerning the 2012 edition of the GSOC. I am a bit overcomitted, and the discussions were going well, so I didn't feel that I could add something terribly useful to them. The ground-to-earth organization

Re: [Scikit-learn-general] GSoC 2012

2012-01-19 Thread Vincent Michel
Hi list, I'm more than +1 for online learning, it could be a killing feature of the scikit ! I also like the first suggestion of Andreas, about Multinomial Logistic regression. I think there is interesting work to do in the junction with Bayesian statistics and priors. Vincent 2012/1/19 Alex

Re: [Scikit-learn-general] GSoC 2012

2012-01-19 Thread Alexandre Gramfort
i've created the wiki page to organize what was suggested and so people can volunteer for mentoring. https://github.com/scikit-learn/scikit-learn/wiki/A-list-of-topics-for-a-google-summer-of-code-%28gsoc%29-2012 Alex On Thu, Jan 19, 2012 at 8:38 AM, Peter Prettenhofer wrote: [..] > - S

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Peter Prettenhofer
>>> [..] - Structured SVM / CRF learning      This is a big one. Not sure what other people think of it.      I think having a structured SVM would be great. >> >>> +100 on this one... >>> >> For this, do we need to have our own SVM solver? This is a naive >> question, I have ne

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Gael Varoquaux
On Thu, Jan 19, 2012 at 12:08:41AM +0100, Andreas wrote: > I have no experience with GSoC and I will totally bow > to you wisdom there. My thinking was that single > algorithms are more "project-like" than doing polishing here and > there. Yes. My point was that I'd like to see project that help u

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Mathieu Blondel
On Thu, Jan 19, 2012 at 7:44 AM, Gael Varoquaux wrote: > On Wed, Jan 18, 2012 at 11:37:15PM +0100, Andreas wrote: >> Having this feature might get us a LOT of attention. >> But this is really not a simple project. > > Before trying to jump to the super fancy features, I'd rather have a > polished

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Andreas
On 01/18/2012 11:44 PM, Gael Varoquaux wrote: > On Wed, Jan 18, 2012 at 11:37:15PM +0100, Andreas wrote: > >> Having this feature might get us a LOT of attention. >> But this is really not a simple project. >> > Before trying to jump to the super fancy features, I'd rather have a > polish

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Vlad Niculae
On Jan 19, 2012, at 00:23 , Gael Varoquaux wrote: > On Wed, Jan 18, 2012 at 07:37:12PM +0900, Mathieu Blondel wrote: >> It would be nice if you could make a few contributions to scikit-learn >> before the application process starts. This will allow you to >> familiarize with the code base, us to

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Gael Varoquaux
On Wed, Jan 18, 2012 at 11:37:15PM +0100, Andreas wrote: > Having this feature might get us a LOT of attention. > But this is really not a simple project. Before trying to jump to the super fancy features, I'd rather have a polished and versatile version of the scikit. They are many things that I

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Andreas
On 01/18/2012 11:26 PM, Gael Varoquaux wrote: > On Wed, Jan 18, 2012 at 11:28:52AM +0100, Lars Buitinck wrote: > >> 2012/1/18 Andreas: >> >>> - Structured SVM / CRF learning >>> This is a big one. Not sure what other people think of it. >>> I think having a structured SVM would

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Gael Varoquaux
On Wed, Jan 18, 2012 at 11:28:52AM +0100, Lars Buitinck wrote: > 2012/1/18 Andreas : > > - Structured SVM / CRF learning > >     This is a big one. Not sure what other people think of it. > >     I think having a structured SVM would be great. > +100 on this one... For this, do we need to have ou

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Gael Varoquaux
On Wed, Jan 18, 2012 at 07:37:12PM +0900, Mathieu Blondel wrote: > It would be nice if you could make a few contributions to scikit-learn > before the application process starts. This will allow you to > familiarize with the code base, us to evaluate your potential and, if > I remember correctly, t

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Bala Subrahmanyam Varanasi
Hi Jaidev, Well, the two of us do have a busy summer coming up, but a word of > caution - Google hasn't decided yet whether they will hold GSoC this > year. Please join the GSoC mailing list too. > Hm... Let us hope for the best. > > We'll talk more tonight if you are free... > Sure. See you

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Jaidev Deshpande
Hi Bala, Well, the two of us do have a busy summer coming up, but a word of caution - Google hasn't decided yet whether they will hold GSoC this year. Please join the GSoC mailing list too. We'll talk more tonight if you are free... Cheers ---

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Bala Subrahmanyam Varanasi
Hi :) You might start on this one: > https://github.com/scikit-learn/scikit-learn/issues/559 > It should be fairly easy to do. > Okay... Sure ! I'll try to do this. > > > -- > Keep Your Developer Skills Current with Lea

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Andreas
You might start on this one: https://github.com/scikit-learn/scikit-learn/issues/559 It should be fairly easy to do. -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library f

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Mathieu Blondel
On Wed, Jan 18, 2012 at 7:47 PM, Bala Subrahmanyam Varanasi wrote: > Upto now, I pulled two commits regarding the documentation. I hope I could > do more in the coming days. Here are my commits. > > https://github.com/Balu-Varanasi/scikit-learn/commit/36d0adb8c14b8105b9ba690073d0501955bce328 > >

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Bala Subrahmanyam Varanasi
Dear Mathieu, It would be nice if you could make a few contributions to scikit-learn > before the application process starts. This will allow you to > familiarize with the code base, us to evaluate your potential and, if > I remember correctly, this is actually a requirement from the PSF. > I wou

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Mathieu Blondel
On Wed, Jan 18, 2012 at 3:12 PM, Bala Subrahmanyam Varanasi wrote: > Also... I'm attending to Stanford's Online courses - ML class and NLP class. > I believe this is the right time to discuss. Because, I can learn new things > before the start of GSoC and can work on challenging implementations i

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Lars Buitinck
2012/1/18 Andreas : > - Structured SVM / CRF learning >     This is a big one. Not sure what other people think of it. >     I think having a structured SVM would be great. +100 on this one... >     Designing the interface is also non-trivial. Indeed. I suspect different APIs would be needed for

Re: [Scikit-learn-general] GSoC 2012

2012-01-18 Thread Andreas
Hi Bela. I'm not sure how this usually goes but here is my current wish list. We'd have to discuss whether any of that actually fits into the scikits, thou ;) - Multilayer Perceptron and Multinomial Logistic regression I have been working on that so maybe there is not enough left to do

[Scikit-learn-general] GSoC 2012

2012-01-17 Thread Bala Subrahmanyam Varanasi
Dear all, I would like to participate in Google Summer of Code this year. Please let me know the ideas which you would like to implement in scikit-learn in GSoC - 2012. Also... I'm attending to Stanford's Online courses - ML class and NLP class. I believe this is the right time to discuss. Becaus