Re: Persisting trained models in Mahout

2011-12-09 Thread Jens Grivolla
On 12/08/2011 03:19 PM, Sebastian Schelter wrote: [...] A model for recommenders that use matrix factorization consists of the user and item feature vectors. You can use a FilePersistenceStrategy with any SVDRecommender to read and write these. In the future we could also support loading the

Re: Persisting trained models in Mahout

2011-12-09 Thread Sebastian Schelter
Yes, you describe it perfectly. I think the only reason this has not been done yet is that the model computation is not very fast on Hadoop because of its iterative nature. Would you like to work on integrating the SVD recommenders? --sebastian On 09.12.2011 11:17, Jens Grivolla wrote: On

Re: mahout exception (lucene.vector)

2011-12-09 Thread Grant Ingersoll
The Lucene Driver class has a clause in it that keeps track of how many docs don't have term vectors and will then exit if the threshold is reached. You can control the threshold using the maxPercentErrorDocs input argument. The argument is a percentage, expressed as number between 0 and 1.

PLEASE HELP! - MAHOUT CLASSIFICATION

2011-12-09 Thread Sam Cunningham
I really need help. I am working on a project: I have a cron job that collects RSS feeds from news sites (Reuters and Associated Press). I need to classify these news data based on their content (just like 20news example). The categories are business, entertainment, health, politics, scitech, and

Re: PLEASE HELP! - MAHOUT CLASSIFICATION

2011-12-09 Thread Suneel Marthi
Which classifier r u running? From: Sam Cunningham sam_cun...@yahoo.com To: user@mahout.apache.org Sent: Friday, December 9, 2011 2:57 PM Subject: PLEASE HELP! - MAHOUT CLASSIFICATION I really need help. I am working on a project: I have a cron job that

Re: PLEASE HELP! - MAHOUT CLASSIFICATION

2011-12-09 Thread Dmitriy Lyubimov
Sam, the list wouldn't let attachments . On Fri, Dec 9, 2011 at 11:57 AM, Sam Cunningham sam_cun...@yahoo.com wrote: I really need help. I am working on a project: I have a cron job that collects RSS feeds from news sites (Reuters and Associated Press). I need to classify these news data based

Re: PLEASE HELP! - MAHOUT CLASSIFICATION

2011-12-09 Thread Sam Cunningham
Suneel Marthi suneel_marthi at yahoo.com writes: Which classifier r u running? Hi Suneel, I am running cbayes. Here is the command options for the trainer: $MAHOUT_HOME/bin/mahout trainclassifier -i /user/sayhan/articles-train -o /user/sayhan/articles-model -type cbayes -ng 1 -source

Re: PLEASE HELP! - MAHOUT CLASSIFICATION

2011-12-09 Thread Suneel Marthi
Hi Sam, I am assuming that you are running the latest code from the Mahout 0.6 trunk. Did you try running your dataset through SGD classifier for both training and testing? Suneel From: Sam Cunningham sam_cun...@yahoo.com To: user@mahout.apache.org Sent:

Re: PLEASE HELP! - MAHOUT CLASSIFICATION

2011-12-09 Thread Sam Cunningham
Dmitriy Lyubimov dlieu.7 at gmail.com writes: Sam, the list wouldn't let attachments . Hi Dmitriy, Here is the link to the attachments along with the same message content: http://lucene.472066.n3.nabble.com/PLEASE-HELP-MAHOUT-CLASSIFICATION-td3573905.html Thank you, Sam

Re: PLEASE HELP! - MAHOUT CLASSIFICATION

2011-12-09 Thread Sam Cunningham
Dmitriy Lyubimov dlieu.7 at gmail.com writes: Sam, the list wouldn't let attachments . Hi Dmitriy, Here is the link to the attachments along with the same message content. Please let me know if you can't get the attachments. Thank you for your help,