On Tue, Jul 28, 2009 at 12:18 AM, Robin Anil <[email protected]> wrote:

> ... We need modules
> to convert data in databases (Flatfiles, XMLdumps, MySQL, Different
> formats on  HDFS, Hbase) into intermediate form(say vector).


Yes.  We do need that.


> Ever considered having a Workflow where we select InputformatReader Job and
> an algorithm to perform (classification, clustering , itemset mining).
> where the first process breaks different sources into the vector
> format. and then launches the algorithms.


That is an intriguing thought.  How many algorithms have the same shape?
(as in, one input, one output, one algorithm, one input format)?

Reply via email to