Re: ALS-WR on Million Song dataset

Sean Owen Mon, 18 Mar 2013 09:47:40 -0700

One word of caution, is that there are at least two papers on ALS and they
define lambda differently. I think you are talking about "Collaborative
Filtering for Implicit Feedback Datasets".

I've been working with some folks who point out that alpha=40 seems to be
too high for most data sets. After running some tests on common data sets,
alpha=1 looks much better. YMMV.

In the end you have to evaluate these two parameters, and the # of
features, across a range to determine what's best.

Is this data set not a bunch of audio features? I am not sure it works for
ALS, not naturally at least.

On Mon, Mar 18, 2013 at 12:39 PM, Han JU <ju.han.fe...@gmail.com> wrote:

> Hi,
>
> I'm wondering has someone tried ParallelALS with implicite feedback job on
> million song dataset? Some pointers on alpha and lambda?
>
> In the paper alpha is 40 and lambda is 150, but I don't know what are their
> r values in the matrix. They said is based on time units that users have
> watched the show, so may be it's big.
>
> Many thanks!
> --
> *JU Han*
>
> UTC   -  Université de Technologie de Compiègne
> *     **GI06 - Fouille de Données et Décisionnel*
>
> +33 0619608888
>

Re: ALS-WR on Million Song dataset

Reply via email to