Utilizing multiple, boolean pref data models in a single recommender

2013-04-09 Thread robert.x.campbell
I'd like to expand on this question and answer: http://stackoverflow.com/questions/15707013/utilizing-multiple-weighed-data-models-for-a-mahout-recommender Besides ParallelALSFactorizationJob and Myrrix, are there any other ways in which you can feed multiple boolean pref data models into a

Re: Does Mahout decision forest support regression?

2013-04-09 Thread Ted Dunning
Not at this time. On Fri, Apr 5, 2013 at 4:48 PM, S. Zhou myx...@yahoo.com wrote: I am using Mahout 0.7. Thanks

Re: In-memory kmeans clustering

2013-04-09 Thread Ted Dunning
This seems surprising. I don't think we removed it. Does anybody know better than I? On Mon, Apr 8, 2013 at 2:16 PM, Ahmet Ylmaz ahmetyilmazefe...@yahoo.comwrote: Hi, It seems to be that in-memory kmeans clustering is removed from Mahout 0.7. Does this mean that it is no longer possible

Re: In-memory kmeans clustering

2013-04-09 Thread Dan Filimon
Apologies for not getting back to you more quickly! You can use Mahout KMeansDriver and have it run locally (so not as a MapReduce, but locally). There's a static method KMeansDriver.run() whose last argument is runSequential. You need to set this to true. The thing is it will still read and

Re: Does Mahout decision forest support regression?

2013-04-09 Thread Dan Filimon
Though, if you want to try Andy's experimental repo: https://github.com/andytwigg/mahout Andy, is it usable?

Re: Does Mahout decision forest support regression?

2013-04-09 Thread Andy Twigg
no, it's not.

RE: Does Mahout decision forest support regression?

2013-04-09 Thread Daniel Donahoe
Andy, This youtube presentation may help some members understand the big picture. ASME Utah/ University of Utah hosted Prof. Adele Cutler (Utah State University), co-creator of Random Forests recently: http://www.youtube.com/watch?v=ldKCz1-SEAsfeature=youtu.be; reference:

User Based recommender - strange behaviour of Pearson

2013-04-09 Thread yamo93
Hi, I use a user based recommender. I've just discovered a strange behaviour of Pearson when a user has the same ratings for all rated items. The system don't recommend anything in this case for this user. I try an explanation : it is due to centered data (centeredSumX2 equals 0 in this

Re: User Based recommender - strange behaviour of Pearson

2013-04-09 Thread Tevfik Aytekin
You are correct, since centeredSumX2 equals zero, the Pearson similarity will be undefined (because of division by zero in the Pearson formula). If you do not center the data that will be cosine similarity which is another common similarity metric used in recommender systems and it will not be

Re: In-memory kmeans clustering

2013-04-09 Thread Johannes Schulte
Hi, this worked for me without having to fiddle with map reduce classes ListCluster initialClusters = new ArrayListCluster(); IterableVector dataPoints = Lists.newArrayList(); ClusterClassifier prior = new ClusterClassifier(initialClusters,

Re: In-memory kmeans clustering

2013-04-09 Thread Johannes Schulte
dataPoints can be in memory or from disk, and you can sample the dataPoints for initialClusters. On Tue, Apr 9, 2013 at 6:16 PM, Johannes Schulte johannes.schu...@gmail.com wrote: Hi, this worked for me without having to fiddle with map reduce classes ListCluster initialClusters = new

Could OpenNLP use Mahout for classification?

2013-04-09 Thread Jörn Kottmann
Hi all, I am currently putting a proposal for Apache OpenNLP together to make the Machine Learning pluggable, which lead me to the question if Mahout would be suitable candidate to provide Machine Learning capabilities. In OpenNLP we currently only do classification, the Mahout algorithm

JobConf and ClassPath

2013-04-09 Thread Cyril Bogus
So I am trying to run a Canopy Clustering on a small data imported in the hdfs. My Program is running on an IDE with Java and I have all the tools in my build path I setup JobConf with my own core-site.xml / hdfs-site.xml and mapred-site.xml Whenever I run the Canopy on the clustering I get the

Re: JobConf and ClassPath

2013-04-09 Thread Dominik Hübner
Do you run hadoop in pseudo mode or on a real cluster? If hadoop is running on a cluster, the classes in your class path are of course not reachable. I usually let maven include the mahout jars into my project's package. This way your applications becomes independent on what kind of hadoop

Re: JobConf and ClassPath

2013-04-09 Thread Suneel Marthi
U seem to be missing google collections in ur IDE class path Sent from my iPhone On Apr 9, 2013, at 4:50 PM, Cyril Bogus cyrilbo...@gmail.com wrote: So I am trying to run a Canopy Clustering on a small data imported in the hdfs. My Program is running on an IDE with Java and I have all the

Re: JobConf and ClassPath

2013-04-09 Thread Cyril Bogus
To Suneel, I just ran some code using the google collection class and it is working fine so I know it is included. To Dominik, You might be right. That would explain why it works in pseudo mode but when I try on the cluster it does not know where to look anymore. On Tue, Apr 9, 2013 at 5:30

Re: JobConf and ClassPath

2013-04-09 Thread Dominik Hübner
Try adding this to your pom file build plugins plugin groupIdorg.apache.maven.plugins/groupId artifactIdmaven-assembly-plugin/artifactId executions execution