Hello All, I have created a proposal for d) Perftest : automated performance tests of algorithms (I am most comfortable with bash scripting and Python)
https://docs.google.com/document/d/1DKWZTWvrvs73GYa1q3XEN5GFo8ALGjLH2DrIfRsJksA/edit?usp=sharing Please share your feedback on the proposal. If someone from the community could mentor, it would be great. Regards, Krishna On Mon, Mar 27, 2017 at 6:07 PM, Krishna Kalyan <krishnakaly...@gmail.com> wrote: > Thanks Nakul, > Replied to the JIRA thread. > > Cheers, > Krishna > > On Mon, Mar 27, 2017 at 2:51 PM, Nakul Jindal <naku...@gmail.com> wrote: > >> Hi Krishna, >> >> We have 2 proposals up : >> https://issues.apache.org/jira/issues/?filter=12339687&jql= >> project%20%3D%20SYSTEMML%20AND%20labels%20%3D%20gsoc2017% >> 20ORDER%20BY%20created%20DESC >> >> Would you be interested in any of these? >> If you are specifically interested in the Python DSL project, we can look >> for more volunteers or I could just volunteer to mentor it. >> >> -Nakul >> >> >> >> On Fri, Mar 24, 2017 at 12:05 PM, Nakul Jindal <naku...@gmail.com> wrote: >> >>> Hi Krishna, >>> >>> We are working on putting together some proposals. I created is for a >>> GPU based project. >>> https://issues.apache.org/jira/browse/SYSTEMML-1436 >>> Be on the lookout for more. >>> >>> Thanks, >>> Nakul >>> >>> >>> >>> >>> On Tue, Mar 21, 2017 at 10:01 AM, Krishna Kalyan < >>> krishnakaly...@gmail.com> wrote: >>> >>>> Hello Adina and Arvind thanks you for your reply, >>>> I am open to writing a proposal with a mentor and would appreciate if we >>>> could take action quickly on this. >>>> >>>> Best Regards, >>>> Krishna >>>> >>>> On Sun, Mar 19, 2017 at 6:02 PM, Adina Crainiceanu <ad...@usna.edu> >>>> wrote: >>>> >>>> > Apache Software Foundation applied and was accepted for GSOC. I >>>> believe >>>> > SystemML could still participate as part of ASF if interested (record >>>> your >>>> > ideas in JIRA and put gsoc2017 as label). See messages on this >>>> subject on >>>> > the community.apache.org mailing list from Ulrich Stark. >>>> > The following page also has useful info, even if it is not updated >>>> for this >>>> > year: http://community.apache.org/gsoc.html - mentors need to >>>> register >>>> > very >>>> > soon. >>>> > >>>> > Best regards, >>>> > Adina >>>> > >>>> > >>>> > On Sun, Mar 19, 2017 at 3:51 PM, Arvind Surve <ac...@yahoo.com.invalid >>>> > >>>> > wrote: >>>> > >>>> > > Thanks Krishna for your interest. >>>> > > Unfortunately we could not submit topic to GSoc on time.However >>>> please >>>> > > feel free to leverage SystemML for your use cases and do possible >>>> > > contribution to SystemML. >>>> > > Please let us know if you have any question. >>>> > > >>>> > > Arvind Surve | Spark Technology Center | http://www.spark.tc/ >>>> > > >>>> > > From: Krishna Kalyan <krishnakaly...@gmail.com> >>>> > > To: dev@systemml.incubator.apache.org >>>> > > Sent: Saturday, March 18, 2017 8:18 AM >>>> > > Subject: Re: GSoc 2017 >>>> > > >>>> > > Hello All, >>>> > > A Gentle ping. Student applications open in a couple of days. I >>>> like to >>>> > > work on 'Support for Python DSLs'. >>>> > > However for now I am not sure on how to proceed. >>>> > > >>>> > > Thank you, >>>> > > Krishna >>>> > > >>>> > > On Thu, Jan 12, 2017 at 6:08 PM, <dusenberr...@gmail.com> wrote: >>>> > > >>>> > > > Yeah helping to build out our Python DSL into a full-out >>>> replacement >>>> > for >>>> > > > the current "DML" language would be great, and we'd be quite >>>> > supportive! >>>> > > > >>>> > > > -Mike >>>> > > > >>>> > > > -- >>>> > > > >>>> > > > Mike Dusenberry >>>> > > > GitHub: github.com/dusenberrymw >>>> > > > LinkedIn: linkedin.com/in/mikedusenberry >>>> > > > >>>> > > > Sent from my iPhone. >>>> > > > >>>> > > > >>>> > > > > On Jan 12, 2017, at 2:58 PM, fschue...@posteo.de wrote: >>>> > > > > >>>> > > > > Hi Krishna, >>>> > > > > >>>> > > > > cool to see that you're interested in SystemML! >>>> > > > > >>>> > > > > From your list I personally think that a) and d) would be well >>>> suited >>>> > > > for projects, especially a good python DSL is a high priority. >>>> > > > > >>>> > > > > We will apply as an organization to GSoC once organization >>>> > applications >>>> > > > are open (Jan. 19th) and I think we will find mentors for at >>>> least a) >>>> > and >>>> > > > d). If you already want to take a look at what is currently >>>> there, I >>>> > > > suggest to look at our python APIs and documentation. If you want >>>> to >>>> > take >>>> > > > on the DSL project it might also be a good idea to look into the >>>> DML >>>> > > > documentation and related papers to see what we need to support. >>>> > > > > >>>> > > > > The proposals will probably circulate on the mailinglist, too, >>>> so >>>> > keep >>>> > > > an eye on that :) >>>> > > > > >>>> > > > > -Felix >>>> > > > > >>>> > > > > Am 12.01.2017 23:13 schrieb Krishna Kalyan: >>>> > > > >> Hello All, >>>> > > > >> Thank you for your wonderful replies. >>>> > > > >> Tasks that I am interested in: >>>> > > > >> a) Support for Python DSLs >>>> > > > >> b) Python wrappers for all existing algorithms >>>> > > > >> c) GPU support >>>> > > > >> d) Perftest : automated performance tests of algorithms >>>> > > > >> I am also willing to work on the tasks that SystemML community >>>> think >>>> > > are >>>> > > > >> important. >>>> > > > >> Regards, >>>> > > > >> Krishna >>>> > > > >> On Fri, Jan 6, 2017 at 10:14 PM, Mike Dusenberry < >>>> > > > dusenberr...@gmail.com> >>>> > > > >> wrote: >>>> > > > >>> Hi Krishna! Welcome, and thanks for your interest! >>>> > > > >>> We would definitely be excited to collaborate with you on a >>>> GSOC >>>> > > > project. >>>> > > > >>> We've started another thread to discuss possible new >>>> proposals, and >>>> > > we >>>> > > > >>> would also be quite interested in any particular proposal >>>> that you >>>> > > > might >>>> > > > >>> like to generate tailored towards your interests. Copied >>>> from the >>>> > > > other >>>> > > > >>> thread, some possible ideas could include: building out a >>>> full ML >>>> > > demo >>>> > > > to >>>> > > > >>> solve a real, large-scale problem that would benefit from a >>>> > > distributed >>>> > > > >>> approach; overall performance improvements that address a full >>>> > class, >>>> > > > or >>>> > > > >>> wider area, of ML algorithms, rather than a single, specific >>>> > script; >>>> > > > >>> infrastructure for [performance] testing, and identification >>>> of >>>> > wide >>>> > > > areas >>>> > > > >>> of improvement; helping with building out fully-featured, >>>> clean, >>>> > > > >>> well-tested DSLs in Python & Scala (we've started, but it >>>> would be >>>> > > > good to >>>> > > > >>> continue stressing them -- we could even aim to replace DML >>>> with >>>> > the >>>> > > > DSLs); >>>> > > > >>> etc. Overall, we want to improve the ability of the user to >>>> work >>>> > on >>>> > > a >>>> > > > wide >>>> > > > >>> range of large-scale, distributed ML problems in a simple and >>>> easy >>>> > > > manner >>>> > > > >>> on top of Spark. >>>> > > > >>> In the meantime, you could explore our recent open issues [1] >>>> and >>>> > > even >>>> > > > >>> begin discussions or contributions on any of the items. You >>>> could >>>> > > also >>>> > > > >>> view our recent roadmap discussion thread on the mailing list, >>>> > > starting >>>> > > > >>> with the first email [2]: >>>> > > > >>> [1]: >>>> > > > >>> https://issues.apache.org/jira/issues/?jql=project%20%3D% >>>> > > > 20SYSTEMML%20AND% >>>> > > > >>> 20resolution%20%3D%20Unresolved%20ORDER%20BY%20updated%20DES >>>> C%2C% >>>> > > > >>> 20priority%20DESC >>>> > > > >>> [2]: >>>> > > > >>> http://mail-archives.apache.org/mod_mbox/incubator- >>>> > > > >>> systemml-dev/201701.mbox/%3C9eb780f0-ff28-c702-117c- >>>> > > > >>> bad740599...@gmail.com%3E >>>> > > > >>> - Mike >>>> > > > >>> -- >>>> > > > >>> Michael W. Dusenberry >>>> > > > >>> GitHub: github.com/dusenberrymw >>>> > > > >>> LinkedIn: linkedin.com/in/mikedusenberry >>>> > > > >>> On Fri, Jan 6, 2017 at 12:34 PM, Luciano Resende < >>>> > > luckbr1...@gmail.com >>>> > > > > >>>> > > > >>> wrote: >>>> > > > >>> > As some folks have described on this thread, it would be >>>> great to >>>> > > > get you >>>> > > > >>> > familiarized with SystemML. >>>> > > > >>> > >>>> > > > >>> > In parallel, I would look for a mentor from the active >>>> committer >>>> > > > list and >>>> > > > >>> > start working on a project proposal which could be based on >>>> the >>>> > > > recent >>>> > > > >>> > Roadmap discussion [1]. >>>> > > > >>> > >>>> > > > >>> > If you are looking for some guidance on how Apache >>>> participate on >>>> > > > GSOC, >>>> > > > >>> > take a look at the following resources [2] and [3], and >>>> don't >>>> > > > hesitate to >>>> > > > >>> > ask questions here. >>>> > > > >>> > >>>> > > > >>> > >>>> > > > >>> > [1] >>>> > > > >>> > https://www.mail-archive.com/d >>>> ev@systemml.incubator.apache.o >>>> > > > >>> > rg/msg01199.html >>>> > > > >>> > [2] http://community.apache.org/gsoc.html >>>> > > > >>> > [3] >>>> > > > >>> > http://www.slideshare.net/luck >>>> br1975/how-mentoring-can-help- >>>> > > > >>> > you-start-contributing-to-open-source >>>> > > > >>> > >>>> > > > >>> > On Thu, Jan 5, 2017 at 3:15 PM, Krishna Kalyan < >>>> > > > krishnakaly...@gmail.com >>>> > > > >>> > >>>> > > > >>> > wrote: >>>> > > > >>> > >>>> > > > >>> > > Hello Developers, >>>> > > > >>> > > I am Krishna, currently a 2nd year Masters student in >>>> (MSc. in >>>> > > Data >>>> > > > >>> > Mining) >>>> > > > >>> > > currently in Barcelona studying at Université >>>> Polytechnique de >>>> > > > >>> Catalogne. >>>> > > > >>> > > I was interested in contributing to SystemML this year >>>> under >>>> > GSoc >>>> > > > >>> > program. >>>> > > > >>> > > Could anyone please guide on how to go about it?. (I >>>> understand >>>> > > > the I >>>> > > > >>> > need >>>> > > > >>> > > to write a proposal) >>>> > > > >>> > > >>>> > > > >>> > > Related Experience: >>>> > > > >>> > > My masters is mostly focussed on data mining techniques. >>>> Before >>>> > > my >>>> > > > >>> > masters, >>>> > > > >>> > > I was a data engineer with IBM (India). I was >>>> responsible for >>>> > > > managing >>>> > > > >>> > 50 >>>> > > > >>> > > node Hadoop Cluster for more than a year. Most of my time >>>> was >>>> > > spent >>>> > > > >>> > > optimising and writing ETL (Apache Pig) jobs. >>>> > > > >>> > > >>>> > > > >>> > > I am the most comfortable with Python followed by R and >>>> Scala. >>>> > > > >>> > > >>>> > > > >>> > > My Webpage >>>> > > > >>> > > kkalyan.in >>>> > > > >>> > > >>>> > > > >>> > > My Spark Pull Requests >>>> > > > >>> > > https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q= >>>> > > > >>> is%3Apr%20author% >>>> > > > >>> > > 3Akrishnakalyan3%20 >>>> > > > >>> > > >>>> > > > >>> > > Thank you so much, >>>> > > > >>> > > Krishna >>>> > > > >>> > > >>>> > > > >>> > >>>> > > > >>> > >>>> > > > >>> > >>>> > > > >>> > -- >>>> > > > >>> > Luciano Resende >>>> > > > >>> > http://twitter.com/lresende1975 >>>> > > > >>> > http://lresende.blogspot.com/ >>>> > > > >>> > >>>> > > > >>>> > > >>>> > > >>>> > > >>>> > >>>> > >>>> > >>>> > -- >>>> > Dr. Adina Crainiceanu >>>> > Associate Professor, Computer Science Department >>>> > United States Naval Academy >>>> > 410-293-6822 >>>> > ad...@usna.edu >>>> > http://www.usna.edu/Users/cs/adina/ >>>> > >>>> >>> >>> >> >