[ https://issues.apache.org/jira/browse/MAHOUT-560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Isabel Drost updated MAHOUT-560: -------------------------------- Attachment: MAHOUT-560.patch Changes I made - any comments welcome (especially if there is an easier, more obvious way I have over-looked) > Support for more flexible file handling in text to sequence file conversion > --------------------------------------------------------------------------- > > Key: MAHOUT-560 > URL: https://issues.apache.org/jira/browse/MAHOUT-560 > Project: Mahout > Issue Type: Improvement > Components: Utils > Affects Versions: 0.5 > Reporter: Isabel Drost > Assignee: Isabel Drost > Priority: Trivial > Fix For: 0.5 > > Attachments: MAHOUT-560.patch > > > Currently SequenceFilesFromDirectory supports for conversion of texts to > sequence file. The exact file (and potentially text from file) selection is > not configurable. I'd like to re-use most of the conversion logic but change > the exact text selection. (More information on what exactly I want to do: > http://tinyurl.com/35pv8jg ) > I slightly changed SequenceFilesFromDirectory to make that possible. (Added > one additional optional parameter, but by default the current behaviour is > used). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.