Thanks Ted. That again validates my path. Thanks Ted, Chris and Sean for your valuable inputs.
Community Rocks!!!! Off the topic - A few years back, I was on the JavaCC mailing list. There were 2 guys - one from New Zealand and the other one from France - replying to my problems. I was literally getting round-the-clock support. More power to the community!!!! -H On Tue, Aug 31, 2010 at 3:48 PM, Ted Dunning <[email protected]> wrote: > For categorization, there are several different answers to the integration > problem, but text > export of a sampled and curated data file is pretty typical as a data path. > > The on-line sequential classifiers are a bit more flexible and would allow > different input > formats at the cost of coding on your part. > > Keep in mind that Hive is keeping flat files in HDFS anyway. Adding an > additional format > so that you don't have to copy a Hive output file one extra time isn't > hard, > but neither is > it hard to have Hive pop out something like comma separated values. > > On Tue, Aug 31, 2010 at 3:41 PM, hdev ml <[email protected]> wrote: > > > Now can Mahout integrate with Hive to make use of this data for learning, > > mining etc.? or do I have to export the hive data into text files which > can > > be hosted by Haddop/HDFS which later on Mahout can use for data mining. > > > > In short, can data warehousing part be done by Hive and then can data > > mining > > part be done by Mahout on this hive data? > > >
