[jira] Commented: (MAHOUT-391) Make vector more space efficient with variable-length encoding, et al

2010-05-08 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865446#action_12865446 ] Robin Anil commented on MAHOUT-391: --- I am sure its just some bug somewhere, conceptually,

Re: [jira] Commented: (MAHOUT-391) Make vector more space efficient with variable-length encoding, et al

2010-05-06 Thread Robin Anil
I am guessing SequenceFile does something. Compressing the two yields similar size with gzip

[jira] Commented: (MAHOUT-391) Make vector more space efficient with variable-length encoding, et al

2010-05-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864851#action_12864851 ] Sean Owen commented on MAHOUT-391: -- Hmm, I got similar results from a crude test, generati

[jira] Commented: (MAHOUT-391) Make vector more space efficient with variable-length encoding, et al

2010-05-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864308#action_12864308 ] Sean Owen commented on MAHOUT-391: -- Oh I get it. My other outstanding patch for MAHOUT-302

[jira] Commented: (MAHOUT-391) Make vector more space efficient with variable-length encoding, et al

2010-05-05 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864304#action_12864304 ] Robin Anil commented on MAHOUT-391: --- import org.apache.mahout.common.MahoutTestCase; Thi

[jira] Commented: (MAHOUT-391) Make vector more space efficient with variable-length encoding, et al

2010-05-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864303#action_12864303 ] Sean Owen commented on MAHOUT-391: -- Bleh, that's not much of a difference at all. Are thes

[jira] Commented: (MAHOUT-391) Make vector more space efficient with variable-length encoding, et al

2010-05-05 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864292#action_12864292 ] Robin Anil commented on MAHOUT-391: --- I am getting build errors. Maybe you are different J