[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720784#action_12720784 ]
Ted Dunning commented on MAHOUT-65: ----------------------------------- bq. I've seen this happen a lot with Solr and other XML based apps in that if you control both ends of the pipe, String based approaches, while nice from a readability standpoint, are showstoppers for performance. Having a converter from binary to String-based can then be employed when readability/debugging is required. The natural binary alternatives would include: 1) thrift (or not ... could be complicated due to lots of compatibility concerns) 2) protocol buffers (not really open source) 3) ad hoc hadoop writable format (but writables may be going away in favor of (4)) 4) avro (but it is really new) My tendency would be either (1) or (4). > Add Element Labels to Vectors and Matrices > ------------------------------------------ > > Key: MAHOUT-65 > URL: https://issues.apache.org/jira/browse/MAHOUT-65 > Project: Mahout > Issue Type: New Feature > Components: Matrix > Affects Versions: 0.1 > Reporter: Jeff Eastman > Assignee: Jeff Eastman > Attachments: MAHOUT-65-name.patch, MAHOUT-65-name.patch, > MAHOUT-65-name.patch, MAHOUT-65.patch, MAHOUT-65b.patch, MAHOUT-65c.patch, > MAHOUT-65d.patch > > > Many applications can benefit by accessing elements in vectors and matrices > using String labels in addition to numeric indices. Investigate adding such a > capability. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.