Hey All, I intend to build a KMeans clustering for my documents but does not want to use the tf/ tf-idf based vectors. I have a map of <term,weight> associated with the document. I want to use these weights in place of the term counts for my computation. Was having a look at DocumentProcessor class, it is primarily driven by the term count. wondering whether I need to do something on my own or is there an inbuilt support for this.
-- Thanks, Sharath
