[ https://issues.apache.org/jira/browse/MAHOUT-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13982431#comment-13982431 ]
Suneel Marthi commented on MAHOUT-1252: --------------------------------------- Drew, I am leaning towards moving existing seq2sparse to Spark and do away with MR, given that all of the existing MR has already been moved under 'mrlegacy'. Question for u - If we were to do seq2sparse all over again, would word2vec be a good candidate to look at? I believe it was either u (or someone else from Booz Allen) who had a prezo on word2vec at DC Big Data Conf in March. > Add support for Finite State Transducers (FST) as a DictionaryType. > ------------------------------------------------------------------- > > Key: MAHOUT-1252 > URL: https://issues.apache.org/jira/browse/MAHOUT-1252 > Project: Mahout > Issue Type: Improvement > Components: Integration > Affects Versions: 0.7 > Reporter: Suneel Marthi > Assignee: Suneel Marthi > Fix For: 1.0 > > > Add support for Finite State Transducers (FST) as a DictionaryType, this > should result in an order of magnitude speedup of seq2sparse. -- This message was sent by Atlassian JIRA (v6.2#6252)