Yeah helping to build out our Python DSL into a full-out replacement for the current "DML" language would be great, and we'd be quite supportive!
-Mike -- Mike Dusenberry GitHub: github.com/dusenberrymw LinkedIn: linkedin.com/in/mikedusenberry Sent from my iPhone. > On Jan 12, 2017, at 2:58 PM, fschue...@posteo.de wrote: > > Hi Krishna, > > cool to see that you're interested in SystemML! > > From your list I personally think that a) and d) would be well suited for > projects, especially a good python DSL is a high priority. > > We will apply as an organization to GSoC once organization applications are > open (Jan. 19th) and I think we will find mentors for at least a) and d). If > you already want to take a look at what is currently there, I suggest to look > at our python APIs and documentation. If you want to take on the DSL project > it might also be a good idea to look into the DML documentation and related > papers to see what we need to support. > > The proposals will probably circulate on the mailinglist, too, so keep an eye > on that :) > > -Felix > > Am 12.01.2017 23:13 schrieb Krishna Kalyan: >> Hello All, >> Thank you for your wonderful replies. >> Tasks that I am interested in: >> a) Support for Python DSLs >> b) Python wrappers for all existing algorithms >> c) GPU support >> d) Perftest : automated performance tests of algorithms >> I am also willing to work on the tasks that SystemML community think are >> important. >> Regards, >> Krishna >> On Fri, Jan 6, 2017 at 10:14 PM, Mike Dusenberry <dusenberr...@gmail.com> >> wrote: >>> Hi Krishna! Welcome, and thanks for your interest! >>> We would definitely be excited to collaborate with you on a GSOC project. >>> We've started another thread to discuss possible new proposals, and we >>> would also be quite interested in any particular proposal that you might >>> like to generate tailored towards your interests. Copied from the other >>> thread, some possible ideas could include: building out a full ML demo to >>> solve a real, large-scale problem that would benefit from a distributed >>> approach; overall performance improvements that address a full class, or >>> wider area, of ML algorithms, rather than a single, specific script; >>> infrastructure for [performance] testing, and identification of wide areas >>> of improvement; helping with building out fully-featured, clean, >>> well-tested DSLs in Python & Scala (we've started, but it would be good to >>> continue stressing them -- we could even aim to replace DML with the DSLs); >>> etc. Overall, we want to improve the ability of the user to work on a wide >>> range of large-scale, distributed ML problems in a simple and easy manner >>> on top of Spark. >>> In the meantime, you could explore our recent open issues [1] and even >>> begin discussions or contributions on any of the items. You could also >>> view our recent roadmap discussion thread on the mailing list, starting >>> with the first email [2]: >>> [1]: >>> https://issues.apache.org/jira/issues/?jql=project%20%3D%20SYSTEMML%20AND% >>> 20resolution%20%3D%20Unresolved%20ORDER%20BY%20updated%20DESC%2C% >>> 20priority%20DESC >>> [2]: >>> http://mail-archives.apache.org/mod_mbox/incubator- >>> systemml-dev/201701.mbox/%3C9eb780f0-ff28-c702-117c- >>> bad740599...@gmail.com%3E >>> - Mike >>> -- >>> Michael W. Dusenberry >>> GitHub: github.com/dusenberrymw >>> LinkedIn: linkedin.com/in/mikedusenberry >>> On Fri, Jan 6, 2017 at 12:34 PM, Luciano Resende <luckbr1...@gmail.com> >>> wrote: >>> > As some folks have described on this thread, it would be great to get you >>> > familiarized with SystemML. >>> > >>> > In parallel, I would look for a mentor from the active committer list and >>> > start working on a project proposal which could be based on the recent >>> > Roadmap discussion [1]. >>> > >>> > If you are looking for some guidance on how Apache participate on GSOC, >>> > take a look at the following resources [2] and [3], and don't hesitate to >>> > ask questions here. >>> > >>> > >>> > [1] >>> > https://www.mail-archive.com/dev@systemml.incubator.apache.o >>> > rg/msg01199.html >>> > [2] http://community.apache.org/gsoc.html >>> > [3] >>> > http://www.slideshare.net/luckbr1975/how-mentoring-can-help- >>> > you-start-contributing-to-open-source >>> > >>> > On Thu, Jan 5, 2017 at 3:15 PM, Krishna Kalyan <krishnakaly...@gmail.com >>> > >>> > wrote: >>> > >>> > > Hello Developers, >>> > > I am Krishna, currently a 2nd year Masters student in (MSc. in Data >>> > Mining) >>> > > currently in Barcelona studying at Université Polytechnique de >>> Catalogne. >>> > > I was interested in contributing to SystemML this year under GSoc >>> > program. >>> > > Could anyone please guide on how to go about it?. (I understand the I >>> > need >>> > > to write a proposal) >>> > > >>> > > Related Experience: >>> > > My masters is mostly focussed on data mining techniques. Before my >>> > masters, >>> > > I was a data engineer with IBM (India). I was responsible for managing >>> > 50 >>> > > node Hadoop Cluster for more than a year. Most of my time was spent >>> > > optimising and writing ETL (Apache Pig) jobs. >>> > > >>> > > I am the most comfortable with Python followed by R and Scala. >>> > > >>> > > My Webpage >>> > > kkalyan.in >>> > > >>> > > My Spark Pull Requests >>> > > https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q= >>> is%3Apr%20author% >>> > > 3Akrishnakalyan3%20 >>> > > >>> > > Thank you so much, >>> > > Krishna >>> > > >>> > >>> > >>> > >>> > -- >>> > Luciano Resende >>> > http://twitter.com/lresende1975 >>> > http://lresende.blogspot.com/ >>> >