I would like to find some way of speed up matrix library, ie JNI+C++.
2014-03-04 22:53 GMT+01:00 Frank Scholten <fr...@frankscholten.nl>: > Yes, I like to work on standardizing the code around input formats. > > > On Mon, Mar 3, 2014 at 7:37 PM, Suneel Marthi <suneel_mar...@yahoo.com > >wrote: > > > To get things moving for 1.0: > > > > > > a) Address the 4 issues that Sean had raised - we have already started > > looking at Backlog and closing them, started looking at converting old > > MapReduce to newer MapReduce API. > > > > If someone could start looking at standardizing the input/output > > formats across classifiers, clustering and recommenders that would be > > great. Guess Frank S. has already started work in that direction. > > > > b) Need a better and cleaner serialized form of Vectors to handle names > > and other kind'a stuff, this is gonna impact everything that's presently > > implemented. > > > > c) Agree with ssc, to start looking at Spark-Mahout integration. > > > > > > d) Need volunteers to QA/address issues with the present > > classifiers/clustering algorithms. I personally can vouch for how > > disastrous it is to deploy any of Mahout's classifiers/clustering > > implementations in an Operations environment. A good example of that is > > Sean's recent patch for RDF. > > > > Naive Bayes code as it is now seems half-baked and is incomplete. Not > > every code path has been tested on Streaming KMeans. > > > > This should go some way in addressing the technical debt that's been > piled > > over the years. > > > > > > > > > > > > On Monday, March 3, 2014 1:05 PM, Sebastian Schelter <s...@apache.org> > > wrote: > > > > I would like to discuss whether we should start to have some > > Spark-related code in Mahout. > > > > --sebastian > > > > > > On 03/03/2014 06:56 PM, Suneel Marthi wrote: > > > Grant had setup a Google Hangout for Mahout sometime last year before > > 0.8 release. I had one setup too for 0.9 release. I definitely wouldn't > > want to have a hangout on Saturday or weekend. > > > > > > > > > > > > > > > > > > On Monday, March 3, 2014 12:52 PM, Ted Dunning <ted.dunn...@gmail.com> > > wrote: > > > > > > Happy to organize a google hangout. That has the advantage of allowing > > more attendees and supporting YouTube archiving. > > > > > > Sent from my iPhone > > > > > > > > >> On Mar 3, 2014, at 9:34, Giorgio Zoppi <giorgio.zo...@gmail.com> > wrote: > > >> > > >> Hello All, > > >> Dr.Dunning could you set a meeting next Sat morning, so we can chat > and > > >> discuss by skype improvements and what to do and indentify volunteer > and > > >> tasks. > > >> Best Regards, > > >> Giorgio > > >> > > >> > > >> 2014-03-03 18:30 GMT+01:00 peng <pc...@uowmail.edu.au>: > > >> > > >>> Me three > > >>> > > >>> > > >>>> On Sun 02 Mar 2014 11:45:33 AM EST, Ted Dunning wrote: > > >>>> > > >>>> Ravi, > > >>>> > > >>>> Good points. > > >>>> > > >>>> On Sun, Mar 2, 2014 at 12:38 AM, Ravi Mummulla < > > ravi.mummu...@gmail.com> > > >>>> wrote: > > >>>> > > >>>> - Natively support Windows (guidance, etc. No documentation exists > > today, > > >>>>> for instance) > > >>>> There is a bit of demand for that. > > >>>> > > >>>> - Faster time to first application (from discovery to first > > application > > >>>> > > >>>>> currently takes a non-trivial amount of effort; how can we lower > the > > bar > > >>>>> and reduce the friction for adoption?) > > >>>> There is huge evidence that this is important. > > >>>> > > >>>> > > >>>> - Better documenting use cases with working samples/examples > > >>>>> (Documentation > > >>>>> on https://mahout.apache.org/users/basics/algorithms.html is > spread > > out > > >>>>> and > > >>>>> there is too much focus on algorithms as opposed to use cases - > this > > is > > >>>>> an > > >>>>> adoption > > blocker) > > >>>> This is also important. > > >>>> > > >>>> > > >>>> - Uniformity of the API set across all algorithms (are we providing > > the > > >>>>> same experience across all APIs?) > > >>>> And many people have been tripped up by this. > > >>>> > > >>>> > > >>>> - Measuring/publishing scalability metrics of various algorithms > > (why > > >>>>> would > > >>>>> we want users to adopt Mahout vs. other frameworks for ML at > scale?) > > >>>> I don't see this as important as some of your other points, but is > > still > > >>>> useful. > > >> > > >> > > >> -- > > >> Quiero ser el rayo de sol que cada día te despierta > > >> para hacerte respirar y vivir en me. > > >> "Favola -Moda". > > > -- Quiero ser el rayo de sol que cada día te despierta para hacerte respirar y vivir en me. "Favola -Moda".