Thanks for quick responses.

Yes it's that dataset. What I'm using is triplets of "user_id song_id
play_times", of ~ 1m users. No audio things, just plein text triples.

It seems to me that the paper about "implicit feedback" matchs well this
dataset: no explicit ratings, but times of listening to a song.

Thank you Sean for the alpha value, I think they use big numbers is because
their values in the R matrix is big.


2013/3/18 Sebastian Schelter <ssc.o...@googlemail.com>

> JU,
>
> are you refering to this dataset?
>
> http://labrosa.ee.columbia.edu/millionsong/tasteprofile
>
> On 18.03.2013 17:47, Sean Owen wrote:
> > One word of caution, is that there are at least two papers on ALS and
> they
> > define lambda differently. I think you are talking about "Collaborative
> > Filtering for Implicit Feedback Datasets".
> >
> > I've been working with some folks who point out that alpha=40 seems to be
> > too high for most data sets. After running some tests on common data
> sets,
> > alpha=1 looks much better. YMMV.
> >
> > In the end you have to evaluate these two parameters, and the # of
> > features, across a range to determine what's best.
> >
> > Is this data set not a bunch of audio features? I am not sure it works
> for
> > ALS, not naturally at least.
> >
> >
> > On Mon, Mar 18, 2013 at 12:39 PM, Han JU <ju.han.fe...@gmail.com> wrote:
> >
> >> Hi,
> >>
> >> I'm wondering has someone tried ParallelALS with implicite feedback job
> on
> >> million song dataset? Some pointers on alpha and lambda?
> >>
> >> In the paper alpha is 40 and lambda is 150, but I don't know what are
> their
> >> r values in the matrix. They said is based on time units that users have
> >> watched the show, so may be it's big.
> >>
> >> Many thanks!
> >> --
> >> *JU Han*
> >>
> >> UTC   -  Université de Technologie de Compiègne
> >> *     **GI06 - Fouille de Données et Décisionnel*
> >>
> >> +33 0619608888
> >>
> >
>
>


-- 
*JU Han*

Software Engineer Intern @ KXEN Inc.
UTC   -  Université de Technologie de Compiègne
*     **GI06 - Fouille de Données et Décisionnel*

+33 0619608888

Reply via email to