[ https://issues.apache.org/jira/browse/MAHOUT-212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ted Dunning updated MAHOUT-212: ------------------------------- Attachment: MAHOUT-212.patch Hmm... didn't get asked for where the patch file was when marking the bug as patch available. > Need random sampler for use in reducers > --------------------------------------- > > Key: MAHOUT-212 > URL: https://issues.apache.org/jira/browse/MAHOUT-212 > Project: Mahout > Issue Type: Bug > Components: Utils > Affects Versions: 0.2 > Reporter: Ted Dunning > Assignee: Ted Dunning > Fix For: 0.3 > > Attachments: MAHOUT-212.patch > > > For a variety of mining algorithms, it helps to have a uniform way to only > process a sub-set of the records in a reducer. > As such, I have written a simple generic sampler that filters an Iterator > returning a fair sample of at most a specified size. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.