Hi Ted, We've been discussing this on and off and I'd like to pick up the thread again. Currently we have AbstractVectorClassifier (in pkg classifier) and VectorModelClassifier (in pkg clustering). This allows any set of Cluster Models (List< Model<VectorWritable>>) to function as a classifier. In your last email you indicated this as a step in the right direction. What else is needed?
One thought I've had is this: Most clustering algorithms - the older ones anyway - have static Driver methods "buildClusters()" and "clusterData()". Would it help with the convergence process if these were simply renamed to "trainClusters()" and "classifyData()" (or something similar) respectively? I know it took me a while to see the isomorphism between clustering and classification, so perhaps something simple like this would be an improvement.
