CF suffers from the data sparsity problem, where users only rate a
small set of items. That makes the computation of similarity between
users imprecise and consequently reduces the accuracy of CF
algorithms.
http://www.jucs.org/jucs_17_4/a_clustering_approach_for
On Sun, Apr 1, 2012 at 1:20 PM,
This problem is much more commonly referred to as the cold start problem and is
far smaller than many authors assume. Typically a dozen good interactions is
plenty to get good recommendation performance and half a dozen suffices to do
pretty well.
Obviously if you are using ratings then most
By anonymous you mean using Boolean preferences ? What about
semi-anonymous model ? very good answer. Thanks Ted
On Mon, Apr 2, 2012 at 7:40 PM, Ted Dunning ted.dunn...@gmail.com wrote:
This problem is much more commonly referred to as the cold start problem and
is far smaller than many
No. By anonymous I mean that when a user first arrives, you should add
their actions to their own history and to the history of the anonymous
user. This user will always have recent actions because there will always
have been previous users who would have populated this user with actions.
The
Could you say a bit more about what you mean? Which data sparsity problem?
Sent from my iPhone
On Apr 1, 2012, at 6:35 AM, ziad kamel ziad.kame...@gmail.com wrote:
Hi,
Is there any ways that mahout CF can overcome the data sparsity problem?
Thanks