Re: Question about data warehousing and mining through Mahout

Ted Dunning Tue, 31 Aug 2010 15:49:09 -0700

For categorization, there are several different answers to the integration
problem, but text
export of a sampled and curated data file is pretty typical as a data path.

The on-line sequential classifiers are a bit more flexible and would allow
different input
formats at the cost of coding on your part.

Keep in mind that Hive is keeping flat files in HDFS anyway.  Adding an
additional format
so that you don't have to copy a Hive output file one extra time isn't hard,
but neither is
it hard to have Hive pop out something like comma separated values.

On Tue, Aug 31, 2010 at 3:41 PM, hdev ml <[email protected]> wrote:

> Now can Mahout integrate with Hive to make use of this data for learning,
> mining etc.? or do I have to export the hive data into text files which can
> be hosted by Haddop/HDFS which later on Mahout can use for data mining.
>
> In short, can data warehousing part be done by Hive and then can data
> mining
> part be done by Mahout on this hive data?
>

Re: Question about data warehousing and mining through Mahout

Reply via email to