I see what you are saying. Yes, I am going into the Hive direction for now. Currently, installing/configuring the packages.
I think after a couple of months when things settle down with Hadoop and Hive, I will take the Mahout course. Thanks -H On Tue, Aug 31, 2010 at 5:34 PM, Ted Dunning <[email protected]> wrote: > I think that Chris was actually recommending stuff that is too simple to > call data-mining. > > Basically this stuff is simpler than any machine learning algorithm so > there > isn't anything really > to write. > > An example for recommendations is to simply recommend the most popular > items > to everybody, > possibly with a bit of dithering so it doesn't look so static. This *is* > actually a recommendation > algorithm just like random selection is. Both of these provide interesting > baseline levels for > clicks and engagement. You *might* want to use Mahout to implement these, > but it is probably > better to get the rest of the framework in place first. > > On Tue, Aug 31, 2010 at 4:03 PM, hdev ml <[email protected]> wrote: > > > 3. Hhhmm..That seems like a very good suggestion. I am not averse to the > > idea of writing my own implementation of mining algorithms. I am just > > worried about their accuracy and stability. So summary is basically do > the > > transformation and statistical part first. When it comes to data mining, > > write your own algorithms or use Mahout (if at all hive integration is > > possible, or maybe reuse the raw text files or output dump of Hive > tables) > > >
