Re: How to make normal Text suitable for Kmeans using mahout

2013-01-25 Thread Mahesh Balija
Hi Siddharth, Your question is not very clear on what (either keywords/documents) you want to cluster. BTW if you are looking for document clustering its straight approach using the term/keyword weights and you can find documentation in Mahout in Action or some other

Re: Boolean preferences and evaluation

2013-01-25 Thread Koobas
Great suggestion! Will do. On Fri, Jan 25, 2013 at 1:10 AM, Sean Owen sro...@gmail.com wrote: Why not test both the original and pruned data set? The low-rating data may still help, even when the rating is forgotten. I would not base the decision just on whether you can make recommendations

Newbie Question

2013-01-25 Thread Robby Stamper
I have successfully run the Breiman example from https://cwiki.apache.org/confluence/display/MAHOUT/Breiman+Example How do I view the tree? Do I need to write a program that instantiates ForestVisualizer.java? Is there another program for visualizing the results of Mahout output? Many

Re: Precision question

2013-01-25 Thread Sean Owen
The way I do it is to set x different for each user, to the number of items in the user's test set -- you ask for x recommendations. This makes precision == recall, note. It dodges this problem though. Otherwise, if you fix x, the condition you need is stronger, really: each user needs = x *test

Re: Precision question

2013-01-25 Thread Zia mel
Interesting. Using IRStatistics stats = evaluator.evaluate(recommenderBuilder, null, model, null, 5, GenericRecommenderIRStatsEvaluator.CHOOSE_THRESHOLD, 1.0); Can it be adjusted to each user ? In other

Re: Precision question

2013-01-25 Thread Sean Owen
No, it takes a fixed at value. You can modify it to do whatever you want. You will see it doesn't bother with users with little data, like 2*at data points. On Fri, Jan 25, 2013 at 6:23 PM, Zia mel ziad.kame...@gmail.com wrote: Interesting. Using IRStatistics stats =