Incremental Clustering from Text Data

2014-01-16 Thread John White
Hello, I use seq2sparse with -wt tfidf option and execute the kmeans pipeline. If new data comes at a later date, should I decide which cluster it belongs using Listing 9.4 News clustering using canopy generation and k-means clustering in Mahout in Action, or is there a better/more generic (i.e.

Re: Incremental Clustering from Text Data

2014-01-16 Thread John White
/16 John White devilgr...@gmail.com Hello, I use seq2sparse with -wt tfidf option and execute the kmeans pipeline. If new data comes at a later date, should I decide which cluster it belongs using Listing 9.4 News clustering using canopy generation and k-means clustering in Mahout in Action