On Tue, Jan 31, 2017 at 3:01 AM, Isabel Drost-Fromm <isa...@apache.org> wrote:
> > Hi, > > > To give some advise to downstream users in the field - what would be your > advise > for people tasked with concrete use cases (stuff like fraud detection, > anomaly > detection, learning search ranking functions, building a recommender > system)? If you are an off-the-shelf practitioner (most of smaller startup companies without a chief scientist), with very few exceptions you might want to look for an off-the-shelf solution where it exists, and most likely it does not exist on Samsara in open domain. Except for a several applied off-the-shelves, Mahout has not (hopefully just yet) developed a comprehensive set of things to use. The off-the-shelves currently are cross-occurrence recommendations (which still require real time serving component taken from elsewhere), svd-pca, some algebra, and Naive/complement Bayes at scale. Most of the bigger companies i worked for never deal with completely the off-the-shelf open source solutions. It always requires more understanding of their problem. (E.g., much as COO recommender is wonderful, i don't think Netflix would entertain taking Mahout's COO run on it verbatim). It is quite common that companies invest in their own specific understanding of their problem and requirements and a specific solution to their problem through iterative experimentation with different methodologies, most of which are either new-ish enough or proprietary enough that public solution does not exist. That latter case was pretty much motivation for Samsara. If you are a practitioner solving numerical problems thru experimentation cycle, Mahout is much more useful than any of the off-the-shelf collections. So the idea, first, is to get R-like platform out for the practitioners, and grow packages (just like with R). The platform obviously needs work which unfortunately is not sufficiently sponsored imo at the moment by industry or academia, compared to other projects. Is there even interest from users in such a use case based > perspective? If so, would there be interest among the Mahout committers to > help > users publicly create docs/examples/modules to support these use cases? > yes > > > Isabel > >