I probably will also put out a distributed QR (just for completeness) as currently solved for MR SSVD. but we know that actual SSVD can avoid this -- and it will in the new version -- just like in the in-core version.
there are gaps still in the optimizer (i.e. optimizer has some holes for some algorithms and when it choses them at action time, the UnsupportedOperationException is generated, even though the expression formally compiles). E.g. there's a gap for big, graph or no-graph A'A algorithms. I also did not investigate GraphX backed implementations yet, just was trying to make the minimum viable product. But it is enough now to script out distributed SSVD and weighted ALS. On Tue, Mar 4, 2014 at 9:59 AM, Dmitriy Lyubimov <dlie...@gmail.com> wrote: > Yes. I am pretty close to do fairly big commits in linalg department > there. (distributed dsl expression optimizer and scripted-out SSVD). > > We also possibly may want to think about scala script engine to run 3rd > party mahout-math scripts or interactive sessions. > > -d > > > On Mon, Mar 3, 2014 at 10:02 AM, Sebastian Schelter <s...@apache.org>wrote: > >> I would like to discuss whether we should start to have some >> Spark-related code in Mahout. >> >> --sebastian >> >> >> On 03/03/2014 06:56 PM, Suneel Marthi wrote: >> >>> Grant had setup a Google Hangout for Mahout sometime last year before >>> 0.8 release. I had one setup too for 0.9 release. I definitely wouldn't >>> want to have a hangout on Saturday or weekend. >>> >>> >>> >>> >>> >>> On Monday, March 3, 2014 12:52 PM, Ted Dunning <ted.dunn...@gmail.com> >>> wrote: >>> >>> Happy to organize a google hangout. That has the advantage of allowing >>> more attendees and supporting YouTube archiving. >>> >>> Sent from my iPhone >>> >>> >>> On Mar 3, 2014, at 9:34, Giorgio Zoppi <giorgio.zo...@gmail.com> wrote: >>>> >>>> Hello All, >>>> Dr.Dunning could you set a meeting next Sat morning, so we can chat and >>>> discuss by skype improvements and what to do and indentify volunteer and >>>> tasks. >>>> Best Regards, >>>> Giorgio >>>> >>>> >>>> 2014-03-03 18:30 GMT+01:00 peng <pc...@uowmail.edu.au>: >>>> >>>> Me three >>>>> >>>>> >>>>> On Sun 02 Mar 2014 11:45:33 AM EST, Ted Dunning wrote: >>>>>> >>>>>> Ravi, >>>>>> >>>>>> Good points. >>>>>> >>>>>> On Sun, Mar 2, 2014 at 12:38 AM, Ravi Mummulla < >>>>>> ravi.mummu...@gmail.com> >>>>>> wrote: >>>>>> >>>>>> - Natively support Windows (guidance, etc. No documentation exists >>>>>> today, >>>>>> >>>>>>> for instance) >>>>>>> >>>>>> There is a bit of demand for that. >>>>>> >>>>>> - Faster time to first application (from discovery to first >>>>>> application >>>>>> >>>>>> currently takes a non-trivial amount of effort; how can we lower the >>>>>>> bar >>>>>>> and reduce the friction for adoption?) >>>>>>> >>>>>> There is huge evidence that this is important. >>>>>> >>>>>> >>>>>> - Better documenting use cases with working samples/examples >>>>>> >>>>>>> (Documentation >>>>>>> on https://mahout.apache.org/users/basics/algorithms.html is spread >>>>>>> out >>>>>>> and >>>>>>> there is too much focus on algorithms as opposed to use cases - this >>>>>>> is >>>>>>> an >>>>>>> adoption blocker) >>>>>>> >>>>>> This is also important. >>>>>> >>>>>> >>>>>> - Uniformity of the API set across all algorithms (are we providing >>>>>> the >>>>>> >>>>>>> same experience across all APIs?) >>>>>>> >>>>>> And many people have been tripped up by this. >>>>>> >>>>>> >>>>>> - Measuring/publishing scalability metrics of various algorithms >>>>>> (why >>>>>> >>>>>>> would >>>>>>> we want users to adopt Mahout vs. other frameworks for ML at scale?) >>>>>>> >>>>>> I don't see this as important as some of your other points, but is >>>>>> still >>>>>> useful. >>>>>> >>>>> >>>> >>>> -- >>>> Quiero ser el rayo de sol que cada día te despierta >>>> para hacerte respirar y vivir en me. >>>> "Favola -Moda". >>>> >>> >> >