This is absolutely necessary, if not for just showing off with the project, then certainly for verification of correctness of algorithms inside it.

I will certainly hop in to such a subtask to the extent of my current available time resources (not much, sadly).

D.

Grant Ingersoll wrote:
Now that we have some code in place for clustering, I think it would be cool to put together some examples/demos of real world problems. Things like clustering text (perhaps we can use the wikipedia download or the reuters download that Lucene contrib/benchmark uses) or clustering other pieces of data.

We could setup a demo area of code and use Lucene's analysis code to create document vectors.

Ideas and/or thoughts or volunteers?

Cheers,
Grant

Reply via email to