Re: Any way Mahout overcome the data sparsity problem ?

2012-04-02 Thread ziad kamel
CF suffers from the data sparsity problem, where users only rate a small set of items. That makes the computation of similarity between users imprecise and consequently reduces the accuracy of CF algorithms. http://www.jucs.org/jucs_17_4/a_clustering_approach_for On Sun, Apr 1, 2012 at 1:20 PM,

Re: Any way Mahout overcome the data sparsity problem ?

2012-04-02 Thread Ted Dunning
This problem is much more commonly referred to as the cold start problem and is far smaller than many authors assume. Typically a dozen good interactions is plenty to get good recommendation performance and half a dozen suffices to do pretty well. Obviously if you are using ratings then most

Re: Any way Mahout overcome the data sparsity problem ?

2012-04-02 Thread ziad kamel
By anonymous you mean using Boolean preferences ? What about semi-anonymous model ? very good answer. Thanks Ted On Mon, Apr 2, 2012 at 7:40 PM, Ted Dunning ted.dunn...@gmail.com wrote: This problem is much more commonly referred to as the cold start problem and is far smaller than many

Re: Any way Mahout overcome the data sparsity problem ?

2012-04-02 Thread Ted Dunning
No. By anonymous I mean that when a user first arrives, you should add their actions to their own history and to the history of the anonymous user. This user will always have recent actions because there will always have been previous users who would have populated this user with actions. The

Re: Any way Mahout overcome the data sparsity problem ?

2012-04-01 Thread Ted Dunning
Could you say a bit more about what you mean? Which data sparsity problem? Sent from my iPhone On Apr 1, 2012, at 6:35 AM, ziad kamel ziad.kame...@gmail.com wrote: Hi, Is there any ways that mahout CF can overcome the data sparsity problem? Thanks