Because of its non-deterministic nature, Dirichlet is darn hard to test.
The 2-d tests offer the option of plotting out the points and the models
and eyeballing the result
(http://cwiki.apache.org/MAHOUT/dirichlet-process-clustering.html) but
more rigorous testing and higher order problems in general are needed.
There was a student on this list last summer who offered some pointed
suggestions but he did not follow up and I've been under water in a startup.
Ted Dunning wrote:
Because the unit tests were 2-dimensional examples.
On Wed, Jan 13, 2010 at 3:34 PM, Jake Mannix <[email protected]> wrote:
Ack, this is bad - why have we not caught this in unit tests?