Re: Paper on Mahout's ALS implementation accepted at RecSys'13

2013-07-23 Thread
oo > Music dataset (> 700M ratings) and a synthetic dataset generated from > Netflix with 25M users and more than 5 billion ratings. > > Best, > Sebastian > -- 刘鎏

Re: About QRDecomposition

2012-01-18 Thread
> If you post your exact results, I could comment more specifically. Without > more details, I really can't answer your question in any specific way. > > On Wed, Jan 18, 2012 at 9:39 AM, 刘鎏 wrote: > > > Hi, > > When I run QRDecomposition in mahout , I find the r

Re: MinHash implementation

2011-08-15 Thread
I think, if your input vector is a set, the ele.get() should be used, instead, if your input vector is a sparse vector, the ele.index() would be used. Pls correct me if I'm wrong. for (int i = 0; i < numHashFunctions; i++) { for (Vector.Element ele : featureVector) { /// Shouldn't the foll

A problem about RowSimilarityJob

2011-03-10 Thread
Hi, all I notice that in RowWeightMapper of RowSimilarityJob, if the input vector is sparsevector, only non-zero column is write out. If the input matrix is like: user1, user2 item1 10 item2 01 then output of RowWeightMapper should be {user1:item1} and {user2:item2}