Thanks for the links.
On Sun, Apr 6, 2014 at 1:26 AM, Sri <srisat...@0xdata.com> wrote: > Indeed, > Integration work/prototype in-progress is here (it's not ready for review > but it's not being done behind closed doors!) > > https://github.com/tdunning/h2o-matrix > > 100% Open Source Apache v2 > H2O is here - we have developed H2O on a live master past two years :) > > https://github.com/0xdata/h2o > > Thanks & let's let temperatures chill - > > I strongly believe that this is going to work out for the Mahout community > longer term - > > Sri > > On Apr 5, 2014, at 22:48, Ted Dunning <ted.dunn...@gmail.com> wrote: > > > > > The prototype is open to all from my personal GitHub. Indeed the > entirety of h2o is also open source. To assert otherwise is simply to > propagate misinformation. > > > > Sent from my iPhone > > > >> On Apr 6, 2014, at 5:35, Andrew Musselman <andrew.mussel...@gmail.com> > wrote: > >> > >> I agree with this sentiment, that one big drop could be more than > people could/would devote time to, and that small proposals/prototypes > would be more digestible. > >> > >> Also would be easier to steer course as we go. > >> > >>> On Apr 5, 2014, at 8:30 PM, Dmitriy Lyubimov <dlie...@gmail.com> > wrote: > >>> > >>> PS. I personally don't think there would be significant hiccups with > the > >>> review process. There's a very good chance things are either > resolvable or > >>> insignificant enough to be foregone due to "power of do" Apache > principle. > >>> However, please keep in mind the costs of commiters' time -- the best > way > >>> is to do things in smaller steps. We also need some time to collect > some > >>> input from users of Mahout APIs, not just internally in the project -- > if > >>> there's any change to such apis. > >>> > >>> -d > >>> > >>> > >>>> On Sat, Apr 5, 2014 at 8:04 PM, Dmitriy Lyubimov <dlie...@gmail.com> > wrote: > >>>> > >>>> > >>>> > >>>> > >>>>> On Fri, Apr 4, 2014 at 2:13 PM, Ted Dunning <ted.dunn...@gmail.com> > wrote: > >>>>> > >>>>> To add to Sri's comments: > >>>> > >>>>> This code is intended for contribution if the > >>>>> > >>>>> objections of one committer are over-come by the concrete results of > the > >>>>> prototype. > >>>> > >>>> I would like to comment that there are no concerns against making this > >>>> contribution -- not at this point anyway. > >>>> > >>>> There is a technicality concern based solely on vague and very > >>>> non-specific communication of intended contributor. However, since > >>>> prototype is not made available to Mahout community, there's no way to > >>>> either confirm, refute or resolve this -- or any other -- concern at > this > >>>> point. > >>>> > >>>> No physical & tangible contribution -- no concerns. Can't be. > >>>> > >>>> There are of course plenty of cases when closed project becomes open, > but > >>>> usually this either goes through Apache incubation process, or > there's a > >>>> legitimate reason to keep it closed (e.g. novel methodology and > patent or > >>>> publication pending). > >>>> > >>>> If none of this apply, i would respectfully urge the perspective > >>>> contributors to submit their work for early review, assuming everyone > is > >>>> holding Mahout community interests dear first. > >>>> > >>>> The reasons to make prototype and TDD available early include: > >>>> > >>>> -- eliminate all sorts of speculative thinking per above. The sooner > we do > >>>> that, the less speculations we'll produce in waiting. > >>>> -- it is hard for committers to do a quality review on a super-massive > >>>> commit dumps due to time constraints. It is much easier to do so in > steps > >>>> and portions. > >>>> -- failure to engage community into the effort: No coder alone making > any > >>>> changes to Mahout code could reliably assert that they are not > creating > >>>> problems for Mahout and/or outside users, since no one has the entire > >>>> Mahout picture in his or her head. We need the entire community to > assert > >>>> benign nature of Mahout code modifications or additions. > >>>> -- it is also more expensive to resolve architectural problems once > >>>> siginficant amount of changes is made, it would be a bit of "my way of > >>>> highway" way of offering things. > >>>> -- development of intended open software contribution that is > available > >>>> only to corporate entities, is not, well, open by definition. > >>>> > >>>> > >>>> > >>>>> > >>>>> > >>>>> On Fri, Apr 4, 2014 at 6:47 PM, SriSatish Ambati < > srisat...@0xdata.com > >>>>>> wrote: > >>>>> > >>>>>> Grant, > >>>>>> On 0xdata / H2O front: > >>>>>> > >>>>>> We feel very excited at making Apache Mahout the principal platform > for > >>>>>> scalable machine learning and are rapidly prototyping an initial > >>>>>> integration with the Matrix API. Ted (apache.org), Cliff Click ( > >>>>>> acm.org/0xdata), Anand Avati (Redhat) and Michal Malohava (0xdata) > are > >>>>>> heads down on that & making brisk progress. We hope to get the > >>>>> discussions > >>>>>> restarted in the JIRAs and google hangouts as soon as we get past > the > >>>>> first > >>>>>> cut . > >>>>>> > >>>>>> We also chose to have the first level integration with Mahout will > be > >>>>> as a > >>>>>> maven dependency - > >>>>>> That way we can flesh things out without major interruption and the > >>>>> grant > >>>>>> work. > >>>>>> > >>>>>> In parallel, several members and teams have been reworking the core > >>>>>> architecture to get a clean separation on the Algorithms & Core, an > >>>>>> in-memory (mr/task) API and a decent client framework with data > >>>>> read/write. > >>>>>> This will allow Apache Mahout and other ML libraries to use Spark, > >>>>>> Stratosphere or other engines for performance and extensibility. > >>>>>> > >>>>>> This is the state of the union at the moment - > >>>>>> I'm very enthusiastic at making this a win for the ardent Community > of > >>>>>> Machine Learning users and developers. > >>>>>> We are very grateful for the warmth, welcome, attention and > impassionate > >>>>>> reviews we received from the Apache community. Thank you for that. > >>>>>> We should have more to report in the month ahead. > >>>>>> > >>>>>> Looking forward, Sri > >>>>>> > >>>>>> > >>>>>> > >>>>>> On Fri, Apr 4, 2014 at 6:44 AM, Grant Ingersoll < > gsing...@apache.org> > >>>>>> wrote: > >>>>>> > >>>>>>> Can someone summarize the 0xData and the Spark work for me for the > >>>>> board > >>>>>>> report? I've unfortunately been too busy to keep up on the > threads on > >>>>>> it, > >>>>>>> but need to write the board report for this month. > >>>>>>> > >>>>>>> You can either summarize here or add it to the community section at > >>>>> > https://svn.apache.org/repos/asf/mahout/pmc/board-reports/2014/board-report-apr.txt > >>>>>>> > >>>>>>> Also, assuming we are going ahead w/ the 0xData stuff, we likely > need > >>>>> to > >>>>>>> do a software grant for that. > >>>>>>> > >>>>>>> Thanks, > >>>>>>> Grant > >>>>>>> > >>>>>>> -------------------------------------------- > >>>>>>> Grant Ingersoll | @gsingers > >>>>>>> http://www.lucidworks.com > >>>>>> > >>>>>> > >>>>>> -- > >>>>>> ceo & co-founder, 0 <http://www.0xdata.com/>*x*data Inc > >>>>>> +1-408.316.8192 > >>>> > >>>> >