[ https://issues.apache.org/jira/browse/MAHOUT-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13975083#comment-13975083 ]
Sebastian Schelter commented on MAHOUT-1518: -------------------------------------------- [~pferrel] the attached patch contains a preprocessor that should be able to handle your input. Have a look at the PreprocessorExample in the patch to see how it is applied. Once we're happy with this, we can marry it with MAHOUT-1464 > Preprocessing for collaborative filtering with the Scala DSL > ------------------------------------------------------------ > > Key: MAHOUT-1518 > URL: https://issues.apache.org/jira/browse/MAHOUT-1518 > Project: Mahout > Issue Type: New Feature > Components: Collaborative Filtering > Reporter: Sebastian Schelter > Assignee: Sebastian Schelter > Fix For: 1.0 > > Attachments: MAHOUT-1518.patch > > > The aim here is to provide some easy-to-use machinery to enable the usage of > the new Cooccurrence Analysis code from MAHOUT-1464 with datasets represented > as follows in a CSV file with the schema _timestamp, userId, itemId, action_, > e.g. > {code} > timestamp1, userIdString1, itemIdString1, “view" > timestamp2, userIdString2, itemIdString1, “like" > {code} -- This message was sent by Atlassian JIRA (v6.2#6252)