[
https://issues.apache.org/jira/browse/MAHOUT-393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Schelter updated MAHOUT-393:
--------------------------------------
Attachment: MAHOUT-393.patch
> Distributed item similarity functions
> -------------------------------------
>
> Key: MAHOUT-393
> URL: https://issues.apache.org/jira/browse/MAHOUT-393
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Reporter: Sebastian Schelter
> Attachments: MAHOUT-393.patch
>
>
> To complete the work started in MAHOUT-389, I've created a distributed
> version of any item similarity function that is currently already available
> in a non-distributed manner. An additional M/R job was necessary to compute
> the number of all users which is needed by some similarity functions (like
> LogLikelihoodSimilarity for example).
> There is still some optimization potential in the code as not every
> similarity function needs all information that is currently extracted (like
> the number of users e.g.), but the optimization would have made the code much
> less readable so I did not do any work on that.
> I hope you consider this a useful addition.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.