[ 
https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840575#action_12840575
 ] 

Sean Owen commented on MAHOUT-320:
----------------------------------

I can see that at the end now. How do you feel about replacing Bigram with this 
and moving it to a common package? Also how about using Bigram's 
variable-length encoding? It might be more efficient as well as avoid you 
writing routines to serialize an int to a byte stream.

> Modify IntPairWritable in LDA implementation to be binary comparable to 
> improve performance.
> --------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-320
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-320
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Clustering
>    Affects Versions: 0.3
>            Reporter: Drew Farris
>            Assignee: Robin Anil
>            Priority: Minor
>         Attachments: MAHOUT-320.patch
>
>
> Per discussion with Robin, modifying o.a.m.clustering.lda.IntPairWritable to 
> be binary comparable will improve the performance of the comparison 
> operations during a sort because no marshaling will need to occur to compare 
> IntPairWritable instances.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to