Sebastian Schelter created MAHOUT-1289:
------------------------------------------
Summary: Move downsampling code into RowSimilarityJob
Key: MAHOUT-1289
URL: https://issues.apache.org/jira/browse/MAHOUT-1289
Project: Mahout
Issue Type: Improvement
Components: Math
Reporter: Sebastian Schelter
Assignee: Sebastian Schelter
Fix For: 0.9
When computing similarities with RowSimilarityJob, downsampling highly frequent
things is crucial for performance. At the moment, this is done by the data
preparation code for collaborative filtering.
We should move the downsampling directly into RowSimilarityJob as we've seen a
lot of cases where users want to directly use it.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira