[scikit-learn] GSoC 2017 Proposal: Improve online learning for linear models

2017-04-03 Thread Yizheng Zhao
Hi developers, It is excited that I have opportunity work with you! I am Yizheng Zhao, a graduate student at Carnegie Mellon University majoring in Software Engineering and I’ve got my Bachelor’s degree in Math in 2016 at Jilin University. I love python and machine learning and that why I wann

Re: [scikit-learn] GSoC 2017

2017-04-02 Thread Jacob Schreiber
Make sure that you tag your proposal with 'scikit-learn' when you submit it so that we can identify them easily. On Sun, Apr 2, 2017 at 10:47 PM, Jacob Schreiber wrote: > Less than 11 hours left in the application period! If you've asked for > feedback and we haven't gotten back to you, make sur

Re: [scikit-learn] GSoC 2017

2017-04-02 Thread Jacob Schreiber
Less than 11 hours left in the application period! If you've asked for feedback and we haven't gotten back to you, make sure you submit anyway. If you don't get your submission in before the deadline (April 3rd, 9:00am PST) we won't be able to consider you. On Tue, Mar 21, 2017 at 3:27 PM, Jacob S

[scikit-learn] GSoC 2017 Proposal: Improve online learning for linear models

2017-03-30 Thread Yizheng Zhao
Hi developers, It is excited that I have opportunity work with you! I am Yizheng Zhao, a graduate student at Carnegie Mellon University majoring in Software Engineering and I’ve got my Bachelor’s degree in Math in 2016 at Jilin University. I love python and machine learning and that why I wann

Re: [scikit-learn] GSoC 2017 : "Parallel Decision Tree Building"

2017-03-26 Thread Aman Pratik
I will be occupied with my tests for a couple of days, will get back with the changes as soon as possible. In the Gaussian Process parallelization there was an error while using the multiprocessing backend, which couldn't be solved by simple changes in the code. Hence we had to drop the idea for t

Re: [scikit-learn] GSoC 2017 : "Parallel Decision Tree Building"

2017-03-26 Thread Jacob Schreiber
Hi Aman Thanks for the updates, it looks more complete now. I don't see what the benefit is of considering three different parallelism techniques. I'm not sure how you would do sample parallelism given that you need to sort all of the samples-- maybe a merge sort? That doesn't seem the most effic

Re: [scikit-learn] GSoC 2017 : "Parallel Decision Tree Building"

2017-03-26 Thread Aman Pratik
Hello Jacob, This is my second draft for the proposal, Proposal : Second Draft It is incomplete in some places, related to detailing etc. I will need little more time for that. Meanwhile, I await your feedb

Re: [scikit-learn] GSoC 2017 : "Parallel Decision Tree Building"

2017-03-22 Thread Jacob Schreiber
Hi Aman Likely the easiest way to parallelize decision tree building is to parallelize the finding of the best split at each node, as it checks every non-constant feature for the best split. Several other approaches focus on how to parallelize tree building in the streaming or distributed cases, w

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-22 Thread SHUBHAM BHARDWAJ 15BCE0704
Hello Sir, Added benchmarks, kindly let me know further improvements and that whether if its a good idea to consider the next parts listed in the to-do list of my pr for proposal.Thanks. pr: https://github.com/scikit-learn/scikit-learn/pull/8585 Regards Shubham Bhardwaj On Wed, Mar 15, 2017 at 10

[scikit-learn] GSoC 2017 : "Parallel Decision Tree Building"

2017-03-22 Thread Aman Pratik
Hello Developers, This is Aman Pratik. I am currently pursuing my B.Tech from Indian Institute of Technology, Varanasi. After doing some research I have found some material on Decision Trees and Parallelization. Hence, I propose my first draft for the project "Parallel Decision Tree Building" for

[scikit-learn] GSoC 2017

2017-03-21 Thread Jacob Schreiber
Starting yesterday, students were able to submit their proposals on the GSoC website. Please review this site thoroughly before making a submission. We're eager to hear what prospective students have in mind for a

Re: [scikit-learn] [GSoC 2017] First Draft, request for suggestions - Improve Online Learning of Linear Models.

2017-03-16 Thread Karan Desai
Oh, and I forgot to mention. Some of the easily doable callbacks include: 1. Verbose Logs (maybe progress bars ? Saw an issue earlier) 2. Model Checkpoints 3. Early Stopping 4. Learning Rate annealing As a second alternative, we can boil everything down and simply define lea

Re: [scikit-learn] [GSoC 2017] First Draft, request for suggestions - Improve Online Learning of Linear Models.

2017-03-16 Thread Karan Desai
The problem with callbacks is that for callbacks on each iteration to be feasible, they need to be cython functions.> Otherwise they will be too slow. You could do python callbacks, but they could not be called at every iteration, and so they wouldn't be suitable to implement something like adag

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-15 Thread SHUBHAM BHARDWAJ 15BCE0704
Hello Sir, Greetings. I have coded a sequential version of Scalable Kmeans++ (#8585) and have included a test script for testing it in the pr's description. https://github.com/scikit-learn/scikit-learn/pull/8585. Regards Shubham Bhardwaj On Tue, Mar 14, 2017 at 3:59 AM, Shreyas Saligrama chandra

Re: [scikit-learn] [GSoC 2017] First Draft, request for suggestions - Improve Online Learning of Linear Models.

2017-03-15 Thread Andreas Mueller
On 03/15/2017 04:48 AM, Karan Desai wrote: 4. About a tool to anneal learning rate: I suggest a new approach to look at this - as a callback. I searched through the documentation and I could not find this way of handling tidbits during training of models. We should be able to provide a callba

[scikit-learn] [GSoC 2017] First Draft, request for suggestions - Improve Online Learning of Linear Models.

2017-03-15 Thread Karan Desai
Hello developers, I'm Karan Desai, an Electrical Engineering Undergraduate at IIT Roorkee. I was following the community since October and initially planned to work on Pytest Migration idea. But on meticulous discussions, it was concluded that the migration task might be short for a three month w

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-13 Thread Shreyas Saligrama chandrakan
Hi, Is it possible for me to contribute a library to introduce SVM's with tree kernel (like current available one in svmlight) which is currently missing in scikit-learn? Best, Shreyas On 5 Mar 2017 11:03 a.m., "Andreas Mueller" wrote: > There was a PR here: > https://github.com/scikit-learn/s

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-06 Thread SHUBHAM BHARDWAJ 15BCE0704
Hello Sir, Thanks for the reply, I will try to reproduce the claims of the paper and would update about my progress. Regards Shubham On Mon, Mar 6, 2017 at 12:32 AM, Andreas Mueller wrote: > There was a PR here: > https://github.com/scikit-learn/scikit-learn/pull/5530 > > but it didn't seem to

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-05 Thread Andreas Mueller
There was a PR here: https://github.com/scikit-learn/scikit-learn/pull/5530 but it didn't seem to work. Feel free to convince us otherwise ;) On 03/02/2017 08:23 PM, SHUBHAM BHARDWAJ 15BCE0704 wrote: Hello Sir, Very Sorry for the numbers I saw this written in the comments.I assumed -Given the

[scikit-learn] GSoC 2017

2017-03-03 Thread Aman Pratik
Hello Developers, This is Aman Pratik. I am currently pursuing my B.Tech from Indian Institute of Technology, Varanasi. I am a keen software developer and not very new to the open source community. I am interested in your project "*Improve online learning for linear models*" for GSoC 2017. I have

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-02 Thread SHUBHAM BHARDWAJ 15BCE0704
Hello Sir, Very Sorry for the numbers I saw this written in the comments.I assumed -Given the person who suggested the paper might have taken a look into the number of citations.I will make sure to personally check myself. Regards Shubham Bhardwaj On Fri, Mar 3, 2017 at 6:40 AM, Guillaume Lemaîtr

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-02 Thread Guillaume Lemaître
I think that you mean this paper -> Scalable K-Means++ -> 218 citations On 3 March 2017 at 02:00, SHUBHAM BHARDWAJ 15BCE0704 < shubham.bhardwaj2...@vit.ac.in> wrote: > Hello Sir, > > Thanks a lot for the reply. Sorry for not being elaborate about what I was > trying to address. I wanted to implem

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-02 Thread SHUBHAM BHARDWAJ 15BCE0704
Hello Sir, Thanks a lot for the reply. Sorry for not being elaborate about what I was trying to address. I wanted to implement this [ http://ilpubs.stanford.edu:8090/778/1/2006-13.pdf] (1200+citations)- mentioned in comments. This pertains to the stalled issue #4357 .Proposal idea - implementing a

Re: [scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-02 Thread Jacob Schreiber
Hi Shubham Thanks for your interest. I'm eager to see your contributions to sklearn in the future. However, I'm pretty sure kmeans++ is already implemented: http://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html Jacob On Thu, Mar 2, 2017 at 1:07 AM, SHUBHAM BHARDWAJ 15BCE07

[scikit-learn] GSoc, 2017 (proposal idea and intro) .reg

2017-03-02 Thread SHUBHAM BHARDWAJ 15BCE0704
Hello Sir, My introduction : I am a 2nd year student studying Computer Science and engineering from VIT, Vellore. I work in Google Developers Group VIT. All my experience has been about collaborating with a lot of people ,working as a team, building products and learning along the way. Since sciki

Re: [scikit-learn] GSoC, 2017 - Parallel Decision Tree Building

2017-02-28 Thread Aman Dalmia
Hello Sir, Thank you for your response. You made it very clear for me as to what needs to be done. I'll have a careful look at the code for the tree module and would try to start implementing some part of the functionality desired. I'll get back to you if I get stuck and post the link of my propos

Re: [scikit-learn] GSoC, 2017 - Parallel Decision Tree Building

2017-02-28 Thread Jacob Schreiber
Hi Aman I responded to your other email, but I'm not sure if it actually went through. Thanks for your interest in the project, and your current PRs. If you're looking to apply, you should write a gist which follows the format that nelson-liu used here: https://github.com/scikit-learn/scikit-lear

[scikit-learn] GSoC, 2017 - Parallel Decision Tree Building

2017-02-28 Thread Aman Dalmia
Hello everyone, I am a pre-final year student studying Electronics & Communication Engineering at IIT Guwahati. I am a member of Prof. Amit Sethi 's research group where I work on cancer recurrence prediction using deep learning and have also start

Re: [scikit-learn] GSoC 2017

2017-02-27 Thread Pradeep Thalasta
Thanks Joel, will start with the contribution soon. On 27 Feb 2017 2:35 pm, "Joel Nothman" wrote: Hi Pradeep, we would usually only accept candidates who have shown their proficiency and understanding of our package and processes by making some contributions prior to this stage. you are certainl

Re: [scikit-learn] GSoC 2017

2017-02-27 Thread Joel Nothman
Hi Pradeep, we would usually only accept candidates who have shown their proficiency and understanding of our package and processes by making some contributions prior to this stage. you are certainly welcome to aim for GSoC 2018 by beginning to develop your familiarity and rapport now. cheers, Joel

Re: [scikit-learn] GSoC 2017

2017-02-27 Thread Pradeep Thalasta
Hi, I'm new to open source contribution. Can i take part in GSoc as well? On Mon, Feb 27, 2017 at 11:29 AM, Raghav R V wrote: > Or simply a public gist and importantly the link mailed here would do I > think... > > On 27 Feb 2017 8:28 p.m., "Raghav R V" wrote: > >> They can still edit a wiki p

Re: [scikit-learn] GSoC 2017

2017-02-27 Thread Raghav R V
Or simply a public gist and importantly the link mailed here would do I think... On 27 Feb 2017 8:28 p.m., "Raghav R V" wrote: > They can still edit a wiki page from their fork of scikit learn I think. > So I'd suggest doing that and mailing to this thread, the link to their > proposal... > > On

Re: [scikit-learn] GSoC 2017

2017-02-27 Thread Raghav R V
They can still edit a wiki page from their fork of scikit learn I think. So I'd suggest doing that and mailing to this thread, the link to their proposal... On 27 Feb 2017 6:55 p.m., "Nelson Liu" wrote: > In past years students made a page on the wiki with their proposal; this > isn't possible a

Re: [scikit-learn] GSoC 2017

2017-02-27 Thread Nelson Liu
In past years students made a page on the wiki with their proposal; this isn't possible anymore due to GitHub permissions. Perhaps an alternative method for getting feedback should be suggested on the introduction page? Nelson Liu On Mon, Feb 27, 2017 at 2:58 AM, Gael Varoquaux < gael.varoqu...@n

[scikit-learn] GSoC 2017

2017-02-27 Thread Gael Varoquaux
Hi, Students have been inquiring about the GSoC (Google Summer of Code) with scikit-learn, and the core team has been quite silent about team. I am happy to announce that we will be taking part in the scikit-learn again. The reason that we decided to do this is to give a chance to the young, tale

[scikit-learn] GSOC 2017: NumFOCUS will be an umbrella organization

2017-01-16 Thread Max Linke
Hi Organizations can start submitting applications for Google Summer of Code 2017 on January 19 (and the deadline is February 9) https://developers.google.com/open-source/gsoc/timeline?hl=en NumFOCUS will be applying again this year. If you want to work with us please let me know and if you app