Indeed,
Integration work/prototype in-progress is here (it's not ready for review but 
it's not being done behind closed doors!)

https://github.com/tdunning/h2o-matrix

100% Open Source Apache v2 
H2O is here - we have developed H2O on a live master past two years :)

https://github.com/0xdata/h2o

Thanks & let's let temperatures chill - 

I strongly believe that this is going to work out for the Mahout community 
longer term -

Sri

On Apr 5, 2014, at 22:48, Ted Dunning <[email protected]> wrote:

> 
> The prototype is open to all from my personal GitHub.  Indeed the entirety of 
> h2o is also open source.  To assert otherwise is simply to propagate 
> misinformation.  
> 
> Sent from my iPhone
> 
>> On Apr 6, 2014, at 5:35, Andrew Musselman <[email protected]> wrote:
>> 
>> I agree with this sentiment, that one big drop could be more than people 
>> could/would devote time to, and that small proposals/prototypes would be 
>> more digestible.
>> 
>> Also would be easier to steer course as we go.
>> 
>>> On Apr 5, 2014, at 8:30 PM, Dmitriy Lyubimov <[email protected]> wrote:
>>> 
>>> PS. I personally don't think there would be significant hiccups with the
>>> review process. There's a very good chance things are either resolvable or
>>> insignificant enough to be foregone due to "power of do" Apache principle.
>>> However, please keep in mind the costs of commiters' time -- the best way
>>> is to do things in smaller steps. We also need some time to collect some
>>> input from users of Mahout APIs, not just internally in the project -- if
>>> there's any change to such apis.
>>> 
>>> -d
>>> 
>>> 
>>>> On Sat, Apr 5, 2014 at 8:04 PM, Dmitriy Lyubimov <[email protected]> wrote:
>>>> 
>>>> 
>>>> 
>>>> 
>>>>> On Fri, Apr 4, 2014 at 2:13 PM, Ted Dunning <[email protected]> wrote:
>>>>> 
>>>>> To add to Sri's comments:
>>>> 
>>>>> This code is intended for contribution if the
>>>>> 
>>>>> objections of one committer are over-come by the concrete results of the
>>>>> prototype.
>>>> 
>>>> I would like to comment that there are no concerns against making this
>>>> contribution -- not at this point anyway.
>>>> 
>>>> There is a technicality concern based solely on vague and very
>>>> non-specific communication of intended contributor. However, since
>>>> prototype is not made available to Mahout community, there's no way to
>>>> either confirm, refute or resolve this -- or any other -- concern at this
>>>> point.
>>>> 
>>>> No physical & tangible contribution -- no concerns. Can't be.
>>>> 
>>>> There are of course plenty of cases when closed project becomes open, but
>>>> usually this either goes through Apache incubation process, or there's a
>>>> legitimate reason to keep it closed (e.g. novel methodology and patent or
>>>> publication pending).
>>>> 
>>>> If none of this apply, i would respectfully urge the perspective
>>>> contributors to submit their work for early review, assuming everyone is
>>>> holding Mahout community interests dear first.
>>>> 
>>>> The reasons to make prototype and TDD available early include:
>>>> 
>>>> -- eliminate all sorts of speculative thinking per above. The sooner we do
>>>> that, the less speculations we'll produce in waiting.
>>>> -- it is hard for committers to do a quality review on a super-massive
>>>> commit dumps due to time constraints. It is much easier to do so in steps
>>>> and portions.
>>>> -- failure to engage community into the effort: No coder alone making any
>>>> changes to Mahout code could reliably assert that they are not creating
>>>> problems for Mahout and/or outside users, since no one has the entire
>>>> Mahout picture in his or her head.  We need the entire community to assert
>>>> benign nature of Mahout code modifications or additions.
>>>> -- it is also more expensive to resolve architectural problems once
>>>> siginficant amount of changes is made, it would be a bit of "my way of
>>>> highway" way of offering things.
>>>> -- development of intended open software contribution that is available
>>>> only to corporate entities, is not, well, open by definition.
>>>> 
>>>> 
>>>> 
>>>>> 
>>>>> 
>>>>> On Fri, Apr 4, 2014 at 6:47 PM, SriSatish Ambati <[email protected]
>>>>>> wrote:
>>>>> 
>>>>>> Grant,
>>>>>> On 0xdata / H2O front:
>>>>>> 
>>>>>> We feel very excited at making Apache Mahout the principal platform for
>>>>>> scalable machine learning and are rapidly prototyping an initial
>>>>>> integration with the Matrix API. Ted (apache.org), Cliff Click (
>>>>>> acm.org/0xdata), Anand Avati (Redhat) and Michal Malohava (0xdata) are
>>>>>> heads down on that & making brisk progress. We hope to get the
>>>>> discussions
>>>>>> restarted in the JIRAs and google hangouts as soon as we get past the
>>>>> first
>>>>>> cut .
>>>>>> 
>>>>>> We also chose to have the first level integration with Mahout will be
>>>>> as a
>>>>>> maven dependency -
>>>>>> That way we can flesh things out without major interruption and the
>>>>> grant
>>>>>> work.
>>>>>> 
>>>>>> In parallel, several members and teams have been reworking the core
>>>>>> architecture to get a clean separation on the Algorithms & Core, an
>>>>>> in-memory (mr/task) API and a decent client framework with data
>>>>> read/write.
>>>>>> This will allow Apache Mahout and other ML libraries to use Spark,
>>>>>> Stratosphere or other engines for performance and extensibility.
>>>>>> 
>>>>>> This is the state of the union at the moment -
>>>>>> I'm very enthusiastic at making this a win for the ardent Community of
>>>>>> Machine Learning users and developers.
>>>>>> We are very grateful for the warmth, welcome, attention and impassionate
>>>>>> reviews we received from the Apache community.  Thank you for that.
>>>>>> We should have more to report in the month ahead.
>>>>>> 
>>>>>> Looking forward, Sri
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> On Fri, Apr 4, 2014 at 6:44 AM, Grant Ingersoll <[email protected]>
>>>>>> wrote:
>>>>>> 
>>>>>>> Can someone summarize the 0xData and the Spark work for me for the
>>>>> board
>>>>>>> report?  I've unfortunately been too busy to keep up on the threads on
>>>>>> it,
>>>>>>> but need to write the board report for this month.
>>>>>>> 
>>>>>>> You can either summarize here or add it to the community section at
>>>>> https://svn.apache.org/repos/asf/mahout/pmc/board-reports/2014/board-report-apr.txt
>>>>>>> 
>>>>>>> Also, assuming we are going ahead w/ the 0xData stuff, we likely need
>>>>> to
>>>>>>> do a software grant for that.
>>>>>>> 
>>>>>>> Thanks,
>>>>>>> Grant
>>>>>>> 
>>>>>>> --------------------------------------------
>>>>>>> Grant Ingersoll | @gsingers
>>>>>>> http://www.lucidworks.com
>>>>>> 
>>>>>> 
>>>>>> --
>>>>>> ceo & co-founder, 0 <http://www.0xdata.com/>*x*data Inc
>>>>>> +1-408.316.8192
>>>> 
>>>> 

Reply via email to