Hi Pat,

Thank you for your reply, I fully understand that core algorithms and data
are 2 different part of the system, this is why we have 2 major idea: "Big
data" and "Machine Learning".

My requirements of Recommenders are just like what Amazon does: Item-based,
but the number of items and users is very big, so there comes to a very
huge matrix. So I am still learning using Mahout to make the matrix
computing on a distributed system. After I am familiar with Mahout, I think
I can have some works on GPU acceleration for Matrix computing and some
other mathematical optimization.
About the data prep, I think we can define an abstraction of
conventions in data
prep, data ingestion, and serving components. Users can following some
conventions to feed data to Mahout.

Steven NASa
2016/05/21

2016-05-21 22:06 GMT+08:00 Pat Ferrel <p...@occamsmachete.com>:

> Hi Stephen,
>
> We have implemented SVD, ALS, and CCO for recommender, but these are only
> core algorithms, not really recommenders as Mahout has done in the past.
> The reason for this is that there are data prep, data ingestion, and
> serving components that, in a modern system, must be supplied also. So far
> Mahout has stayed aways from actually including servers, either for input
> of output.
>
> That said there is plenty of room for algorithm development in Mahout. I
> worked on the CCO algorithm, which uses PredictionIO (proposed for the
> Apache Incubator) to supply the serving components.
>
> Someone with your experience in real-life use of recommenders is certainly
> welcome.
>
> What type of project did you have in mind?
>
>
> On May 20, 2016, at 10:00 AM, Suneel Marthi <smar...@apache.org> wrote:
>
> Welcome to the project Steven!!
>
> On Fri, May 20, 2016 at 10:07 AM, Steven NASa <cj.n...@gmail.com> wrote:
>
> > Hi Folk & Masters,
> >
> > My name is *NASa*. I am now working for an e-commerce B2C company in
> China,
> > dealing with Transaction Process development in C++ & Java on Linux
> > environment.
> >
> > As you know, *Recommender System* is quite valuable and important to an
> > e-commerce online shopping website like Amazon. I was told and required
> to
> > design and implement a Recommender System which can bring some value to
> my
> > Company. Our System is based on C++ codes. So I was searching for an
> robust
> > Machine Learning framework in C++ which can help me to easily implement a
> > Recommender System. I did not find any one which can satisfy my
> > requirements, but only some C++ math libraries.
> >
> > Our system is based on an internal distributed frameworks like RPC and DB
> > access on Linux environment based on C++ programming language. But I find
> > it is really inconvenient to implement a Recommender System in C++ from
> > zero without distributed computing library supporting, like
> > implementing *Collaborative
> > Filtering* with SVD in a distributed computing way. So I am trying to
> find
> > a framework/library with is designed based on Distributed-System. There I
> > come to *Mahout*.
> >
> > I wish I can build a library that can help people easily and quickly
> build
> > up a Recommender System based on Distributed System and also use the
> > Machine Learning Algorithms in distributed way. Apache has many amazing
> > projects which can help people to build up robust distributed system
> > easily. So I am moving to using “Java” environment.
> >
> > I am new to *Mahout* and *Hadoop*, *Spark*, *Scala* and I learned Andrew
> > Ng’s “Machine Learning” from Coursera
> > <https://www.coursera.org/learn/machine-learning/home/welcome>. So I
> have
> > the basic knowledge of Machine Learning, and now I am keeping forward to
> > *Deep
> > Learning* and *Convex Optimization*, some other Mathematical Optimization
> > implementation. I am now still learning and getting famiIiar with
> Mahout. I
> > hope I can contribute some codes to Mahout in the early future with
> > learning by coding and coding by learning.
> > NASa 2016/05/20
> > ​
> >
>
>

Reply via email to