Hi, On Mon, Mar 25, 2013 at 9:43 AM, Manuel Blechschmidt < manuel.blechschm...@gmx.de> wrote:
> One big piece that is missing for all the algorithms is a complete bundled > data set that is solving a real business problem and with bundled I mean > that it is in the Mahout source tree. If no real data is available > generated data could be used. > Good point. There are a few examples in the "examples" module - either relying on generated data or on easy to download data. One problem with bringing data into the project is the licensing of that data. There's not too much I'm aware of that can easily be re-distributed under an Apache license. > > I tried to fill this gap for recommendations with my github project: > > https://github.com/ManuelB/facebook-recommender-demo Hmm - I would actually like to list such "complimentary" projects prominently from the Mahout page somewhere. What do you think? > So this is the nature of a framework. If you really want people to get > started easily you have to provide a full blown example where you can just > replace the example data with your data. > > I don't think that enough manpower can be acquired to create a visual GUI > for Mahout. Further I don't think that this would help. There are already > excellent GUIs for ML e.g. Weka (http://www.cs.waikato.ac.nz/ml/weka/) > and RStudio (http://www.rstudio.com/) > +1 In addition to my knowledge Mahout itself has been integrated with a nice graphical ML tool already: < http://rapid-i.com/component/option,com_myblog/show,Big-data-analytics-made-easy-Radoop.html/Itemid,172/ > Isabel