Re: GSoC 2009-Discussion

2009-03-24 Thread deneche abdelhakim
e Samuel would be interested in the first...but of course if actually the community need them both :) --- En date de : Mar 24.3.09, Ted Dunning a écrit : > De: Ted Dunning > Objet: Re: GSoC 2009-Discussion > À: mahout-dev@lucene.apache.org > Date: Mardi 24 Mars 2009, 0h07 > There

Re: GSoC 2009-Discussion

2009-03-23 Thread Ted Dunning
There are other algorithms of serious interest. Bayesian Additive Regression Trees (BART) would make a very interesting complement to Random Forests. I don't know how important it is to get a normal decision tree algorithm going because the cost to build these is often not that high. Boosted deci

Re: GSoC 2009-Discussion

2009-03-23 Thread Otis Gospodnetic
2009 7:17:11 PM > Subject: GSoC 2009-Discussion > > Hi, > I just browsed through the idea list in GSoC 2009 and I'm interested > to work in Apache Mahout. > Currently, I'm doing my master project in my university related to > machine learning + information retrieva

Re: GSoC 2009-Discussion

2009-03-23 Thread Dawid Weiss
> [snip] a web crawler. By doing this, a crawler, for instance, can use the output of the classification to only follow certain links that lie on informative content parts. Is this interesting & make sense for you guys? Hi Samuel. This would be of great interest for the Nutch folks, I thi

GSoC 2009-Discussion

2009-03-22 Thread Samuel Louvan
Hi, I just browsed through the idea list in GSoC 2009 and I'm interested to work in Apache Mahout. Currently, I'm doing my master project in my university related to machine learning + information retrieval. More specifically it's about how to discover informative content in a web page by using mac