[jira] Commented: (MAHOUT-340) org.apache.mahout.cf.taste.hadoop.cooccurence can not support long as user_id and item_id

Sean Owen (JIRA) Thu, 25 Mar 2010 05:10:53 -0700

    [ 
https://issues.apache.org/jira/browse/MAHOUT-340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12849679#action_12849679
 ]


Sean Owen commented on MAHOUT-340:
----------------------------------

That is a good question -- the two implement the same algorithm, but 
'cooccurrence' tries to distribute the matrix - user vector multiplication, 
while 'item' does not. It's not yet clear what's better. You could adapt either 
one's approach to completing this multiplication.

The 'item' handles long IDs as inputs. To do this, you need to create a long 
<-> int mapping between the original long IDs, and the dimensions in the vector 
or matrix they map to -- which must be ints. It collects this information and 
reverses the transformation later. For this reason, if you need long IDs, you 
may find it more natural to adapt 'item' since it handles this issue.

> org.apache.mahout.cf.taste.hadoop.cooccurence can not support long as user_id 
> and item_id
> -----------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-340
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-340
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.3
>            Reporter: Hui Wen Han
>             Fix For: 0.4
>
>
> I have preferences data using long as user_id and item_id,
> hadoop cooccurence arithmetic  can not support it

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAHOUT-340) org.apache.mahout.cf.taste.hadoop.cooccurence can not support long as user_id and item_id

Reply via email to