This is absolutely necessary, if not for just showing off with the project, then
certainly for verification of correctness of algorithms inside it.
I will certainly hop in to such a subtask to the extent of my current available
time resources (not much, sadly).
D.
Grant Ingersoll wrote:
Now that we have some code in place for clustering, I think it would be
cool to put together some examples/demos of real world problems. Things
like clustering text (perhaps we can use the wikipedia download or the
reuters download that Lucene contrib/benchmark uses) or clustering other
pieces of data.
We could setup a demo area of code and use Lucene's analysis code to
create document vectors.
Ideas and/or thoughts or volunteers?
Cheers,
Grant