[
https://issues.apache.org/jira/browse/MAHOUT-460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12898751#action_12898751
]
Sean Owen commented on MAHOUT-460:
----------------------------------
I have some general comments and then believe you are welcome to commit.
- Your IDE seems to be reordering imports. I'd leave them as they are as
they're reasonably standard in ordering across the code.
- Some of the changes also seem to be changes in whitespace indentation --
should be 2 spaces per unit of indentation everywhere. For instance see
MaybePruneRowsMapper.countSeen()
- MathHelper: I wouldn't concatenate a string together with '+' and then append
to StringBuffer. Append each piece to take advantage of it.
- Also we should all use StringBuilder, not StringBuffer
- ToItemVectorsReducer: attach the Apache copyright header?
> Add "maxPreferencesPerItemConsidered" option to
> o.a.m.cf.taste.hadoop.similarity.item.ItemSimilarityJob
> -------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-460
> URL: https://issues.apache.org/jira/browse/MAHOUT-460
> Project: Mahout
> Issue Type: Improvement
> Components: Collaborative Filtering
> Reporter: Sebastian Schelter
> Attachments: MAHOUT-460.patch
>
>
> Because "coocurrence algorithms ... scale in the square of the number of
> occurrences most popular item" (Ted wrote that in a recent mail) we should
> offer a parameter to the ItemSimilarity job that makes it limit the number of
> considered preferences per item. RecommenderJob already has such an option.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.