Hi Tharindu, Rupert
El 12/03/14 09:07, Rupert Westenthaler escribió:
I like the general Architecture containing of a
* TopicClassifier
* TrainingSet
as this allows to have different implementations of managing the
training set (e.g. in Solr, a RDF tripleStore, a database or simple
files in a file system) and TopicClassifiers (Solr, OpenNLP, Mahout,
...) Note also the the trainingSet part is optional and only required
for TopicClassifier that can dynamically update their classification
models.
Regarding this, maybe it is important again to bring to the scene a new
version of the CMS Adapter and ContentHub which could easily feed the
training API.
Cheers, Rafa