Sebastian Schelter created MAHOUT-1289:
------------------------------------------

             Summary: Move downsampling code into RowSimilarityJob
                 Key: MAHOUT-1289
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1289
             Project: Mahout
          Issue Type: Improvement
          Components: Math
            Reporter: Sebastian Schelter
            Assignee: Sebastian Schelter
             Fix For: 0.9


When computing similarities with RowSimilarityJob, downsampling highly frequent 
things is crucial for performance. At the moment, this is done by the data 
preparation code for collaborative filtering.

We should move the downsampling directly into RowSimilarityJob as we've seen a 
lot of cases where users want to directly use it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to