Re: matrix factorization cross validation

2014-10-29 Thread Debasish Das
Is there an example of how to use RankingMetrics ? Let's take the user, document example...we get user x topic and document x topic matrices as the model... Now for each user, we can generate topK document by doing a sort on (1 x topic)dot(topic x document) and picking topK... Is it possible to

Re: How to run tests properly?

2014-10-29 Thread Sean Owen
On Wed, Oct 29, 2014 at 6:02 PM, Niklas Wilcke <1wil...@informatik.uni-hamburg.de> wrote: > The core tests seems to fail because of my german locale. Some tests are > locale dependend like the > UtilsSuite.scala > - "string formatting of time durations" - checks for locale dependend > seperators l

Re: matrix factorization cross validation

2014-10-29 Thread Debasish Das
Makes sense for the binary and ranking problem but for example linear regression for multi-class also optimizes on RMSE but we still measure the prediction efficiency using some measure on confusion matrix...Is not the same idea should hold for ALS as well ? On Wed, Oct 29, 2014 at 12:14 PM, Xian

Re: matrix factorization cross validation

2014-10-29 Thread Xiangrui Meng
Let's narrow the context from matrix factorization to recommendation via ALS. It adds extra complexity if we treat it as a multi-class classification problem. ALS only outputs a single value for each prediction, which is hard to convert to probability distribution over the 5 rating levels. Treating

matrix factorization cross validation

2014-10-29 Thread Debasish Das
Hi, In the current factorization flow, we cross validate on the test dataset using the RMSE number but there are some other measures which are worth looking into. If we consider the problem as a regression problem and the ratings 1-5 are considered as 5 classes, it is possible to generate a confu

Re: How to run tests properly?

2014-10-29 Thread Patrick Wendell
One thing is you need to do a "maven package" before you run tests. The "local-cluster" tests depend on Spark already being packaged. - Patrick On Wed, Oct 29, 2014 at 10:02 AM, Niklas Wilcke <1wil...@informatik.uni-hamburg.de> wrote: > Hi Sean, > > thanks for your reply. The tests still don't wo

Re: How to run tests properly?

2014-10-29 Thread Niklas Wilcke
Hi Sean, thanks for your reply. The tests still don't work. I focused on the mllib and core tests and made some observations. The core tests seems to fail because of my german locale. Some tests are locale dependend like the UtilsSuite.scala - "string formatting of time durations" - checks for l