Re: mapreduce ItemSimilarity input optimization

2014-08-22 Thread Serega Sheypak
Ok, I got it. Is it Ted's book? http://www.amazon.com/Mahout-Action-Sean-Owen/dp/1935182684/ref=la_B00EHXC1NK_1_1?s=booksie=UTF8qid=1408689021sr=1-1 I've read this one:

Mahout LDA issue: small word probability in each topic

2014-08-22 Thread Wei Li
Hi All: I have successfully compiled the Mahout 0.9 on Hadoop and submit the LDA CVB model, most of the parameters are set to default values and the --maxIter is set to 25. After we got the model, we found that the word probability in each topic is quite small, most of them are about 0.1

Re: Mahout LDA issue: small word probability in each topic

2014-08-22 Thread vaibhav srivastava
What is your dictionary size. Lot of things depend on it. When we calculate final probability On 22 Aug 2014 14:27, Wei Li wei.le...@gmail.com wrote: Hi All: I have successfully compiled the Mahout 0.9 on Hadoop and submit the LDA CVB model, most of the parameters are set to default

RE: any pointer to run wikipedia bayes example

2014-08-22 Thread Andrew Palumbo
(1) I guess the NB wiki example doesn't work with the Mahout 0.9 release. So if I need a rather stable Mahout release (because I have made some changes to my local Mahout 0.9 download to tailor some of my own requirements) ,would it be better that I just apply some patch (e.g., the patch

Re: mapreduce ItemSimilarity input optimization

2014-08-22 Thread Ted Dunning
No. Go for this more recent (and much shorter) one: http://www.mapr.com/practical-machine-learning And if you like it, leave a review on Amazon: http://www.amazon.com/Practical-Machine-Learning-Innovations-Recommendation-ebook/dp/B00JRHVNT4 On Thu, Aug 21, 2014 at 11:31 PM, Serega

RE: any pointer to run wikipedia bayes example

2014-08-22 Thread Wei Zhang
Thank you very much Andrew for the such detailed explanations! I got all my answers. I will try to migrate my code towards the latest Mahout trunk. Thanks for keeping an eye on my other posts (i.e., Hadoop versions) I'll use neither exactMatch nor all for BN example on Wikipedia. And thanks

Extremely Slow ALS Recommender in 0.8, but faster in 0.9.

2014-08-22 Thread jay vyas
Hi mahout. Im finding that in 0.8, my ALS recommender goes extremely slow, but it goes very fast in 0.9 . - At the time the jobs slows down, there is virtually no disk io. - The CPU cycles fluctuate up and down during this time, but they arent at 100% the whole time. - Mapper percentages