Because of its non-deterministic nature, Dirichlet is darn hard to test. The 2-d tests offer the option of plotting out the points and the models and eyeballing the result (http://cwiki.apache.org/MAHOUT/dirichlet-process-clustering.html) but more rigorous testing and higher order problems in general are needed. There was a student on this list last summer who offered some pointed suggestions but he did not follow up and I've been under water in a startup.

Ted Dunning wrote:
Because the unit tests were 2-dimensional examples.

On Wed, Jan 13, 2010 at 3:34 PM, Jake Mannix <[email protected]> wrote:

Ack, this is bad - why have we not caught this in unit tests?





Reply via email to