[
https://issues.apache.org/jira/browse/MAHOUT-676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lance Norskog updated MAHOUT-676:
---------------------------------
Attachment: MAHOUT-676.patch
I got interested again :)
This includes full unit tests and a new sampler. The sampler interface is
changed: you can add samples, iterate the current list, and check whether the
sample would be dropped. This kicks forward the state machine inside the
sampler.
The major point of interest is a brute-force implementation of "Slice
Sampling": you supply a function on your samples, and the sampler keeps samples
based on the "area" under the function. Example: it doesn't matter how many
movies a user watched above 20 movies. So, a function on the sample returns the
number of movies.
> Random samplers in a modular library
> ------------------------------------
>
> Key: MAHOUT-676
> URL: https://issues.apache.org/jira/browse/MAHOUT-676
> Project: Mahout
> Issue Type: New Feature
> Components: Math
> Reporter: Lance Norskog
> Priority: Minor
> Attachments: MAHOUT-676.patch, Sampler.patch
>
>
> This is a modular suite of samplers.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira