[ https://issues.apache.org/jira/browse/MAHOUT-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13144481#comment-13144481 ]
Grant Ingersoll commented on MAHOUT-873: ---------------------------------------- I've checked in some baseline functionality here. Going to leave this open, as I think we can really take this to an interesting level of capabilities. > Provide MapReduce job for creating Encoded Vectors from sequence files > ---------------------------------------------------------------------- > > Key: MAHOUT-873 > URL: https://issues.apache.org/jira/browse/MAHOUT-873 > Project: Mahout > Issue Type: Improvement > Reporter: Grant Ingersoll > Assignee: Grant Ingersoll > Fix For: 0.6 > > Attachments: MAHOUT-873.patch, MAHOUT-873.patch, MAHOUT-873.patch > > > Similar to SparseVectorsFromSequenceFiles, provide a version that can do > encoded vectors. Start simple by handling basic text, but this could easily > evolve to handle pluggable Vectorizer's that can better deal with features > (numerics, etc.). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira