Sebastian Schelter created MAHOUT-1289: ------------------------------------------
Summary: Move downsampling code into RowSimilarityJob Key: MAHOUT-1289 URL: https://issues.apache.org/jira/browse/MAHOUT-1289 Project: Mahout Issue Type: Improvement Components: Math Reporter: Sebastian Schelter Assignee: Sebastian Schelter Fix For: 0.9 When computing similarities with RowSimilarityJob, downsampling highly frequent things is crucial for performance. At the moment, this is done by the data preparation code for collaborative filtering. We should move the downsampling directly into RowSimilarityJob as we've seen a lot of cases where users want to directly use it. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira