Re: Question about Pearson Correlation in non-Taste mode

2013-11-27 Thread Sebastian Schelter
Hi Amit, You are right, the non-corated items are not filtered out in the distributed implementation. --sebastian On 26.11.2013 20:51, Amit Nithian wrote: Hi all, Apologies if this is a repeat question as I just joined the list but I have a question about the way that metrics like Cosine

Re: Question about Pearson Correlation in non-Taste mode

2013-11-27 Thread Amit Nithian
Thanks Sebastian! Is there a particular reason for that? On Nov 27, 2013 7:47 AM, Sebastian Schelter ssc.o...@googlemail.com wrote: Hi Amit, You are right, the non-corated items are not filtered out in the distributed implementation. --sebastian On 26.11.2013 20:51, Amit Nithian wrote:

Re: Question about Pearson Correlation in non-Taste mode

2013-11-27 Thread Sebastian Schelter
Yes, it is due to the parallel algorithm which only looks at co-ratings from a given user. On 27.11.2013 15:02, Amit Nithian wrote: Thanks Sebastian! Is there a particular reason for that? On Nov 27, 2013 7:47 AM, Sebastian Schelter ssc.o...@googlemail.com wrote: Hi Amit, You are right,

Re: Detecting high bias and variance in AdaptiveLogisticRegression classification

2013-11-27 Thread Vishal Santoshi
Hell Ted, Are we to assume that SGD is still a work in progress and implementations ( Cross Fold, Online, Adaptive ) are too flawed to be realistically used ? The evolutionary algorithm seems to be the core of OnlineLogisticRegression, which in turn builds up to Adaptive/Cross Fold. b) for truly

Re: Detecting high bias and variance in AdaptiveLogisticRegression classification

2013-11-27 Thread Vishal Santoshi
Sorry to spam, I never meant the Hello to come out as Hell. Given a little disappointment in the mail, I figure I rather spam than be misunderstood, On Wed, Nov 27, 2013 at 10:07 AM, Vishal Santoshi vishal.santo...@gmail.com wrote: Hell Ted, Are we to assume that SGD is still a work in

Re: Question about Pearson Correlation in non-Taste mode

2013-11-27 Thread Amit Nithian
Comparing this against the non distributed (taste) gives different answers for item item similarity as of course the non distributed looks only at corated items. I was more wondering if this difference in practice mattered or not. Also I'm confused on how you can compute the Pearson similarity

Re: Question about Pearson Correlation in non-Taste mode

2013-11-27 Thread Sebastian Schelter
Hi Amit, Yes, it gives different results. However in practice, most people don't do rating prediction with Pearson coefficient, but use count-based measures like the loglikelihood ratio test. The distributed code doesn't look at vectors of different lengths, but simply assumes non-existent

Re: Question about Pearson Correlation in non-Taste mode

2013-11-27 Thread Amit Nithian
Hey Sebastian, Thanks again. Actually I'm glad that I am talking to you as it's your paper and presentation I have questions with! :-) So to clarify my question further, looking at this presentation ( http://isabel-drost.de/hadoop/slides/collabMahout.pdf) you have the following user x item

Re: Detecting high bias and variance in AdaptiveLogisticRegression classification

2013-11-27 Thread Ted Dunning
No problem at all. Kind of funny. On Wed, Nov 27, 2013 at 7:08 AM, Vishal Santoshi vishal.santo...@gmail.comwrote: Sorry to spam, I never meant the Hello to come out as Hell. Given a little disappointment in the mail, I figure I rather spam than be misunderstood, On Wed, Nov 27, 2013

Re: Detecting high bias and variance in AdaptiveLogisticRegression classification

2013-11-27 Thread Ted Dunning
On Wed, Nov 27, 2013 at 7:07 AM, Vishal Santoshi vishal.santo...@gmail.com Are we to assume that SGD is still a work in progress and implementations ( Cross Fold, Online, Adaptive ) are too flawed to be realistically used ? They are too raw to be accepted uncritically, for sure. They have

Condition for equality of vectors

2013-11-27 Thread Tharindu Rusira
Hello, What are the necessary conditions for two vectors(any vector that extends DenseVector) to be equal? I know both values and cardinality should be equal. Is there something else? Thanks, Tharindu.

Re: Condition for equality of vectors

2013-11-27 Thread j.barrett Strausser
https://builds.apache.org/job/Mahout-Quality/ws/trunk/math/target/site/apidocs/org/apache/mahout/math/DenseVector.html#equals%28java.lang.Object%29 On Wed, Nov 27, 2013 at 10:54 PM, Tharindu Rusira tharindurus...@gmail.comwrote: Hello, What are the necessary conditions for two vectors(any

java.lang.NoClassDefFoundError: com/google/common/base/Preconditions

2013-11-27 Thread Tharindu Rusira
Hi all, I'm working on Mahout 0.9-SNAPSHOT version checked out from the svn trunk. The following code generates java.lang.NoClassDefFoundError: com/google/common/base/Preconditions where b is supposed to be true (because both cardinality and the values are same in the two vectors dv and sv) //

Re: java.lang.NoClassDefFoundError: com/google/common/base/Preconditions

2013-11-27 Thread Suneel Marthi
you r missing Google Guava library which has these classes.  R u running a mvn build on Mahout snapshot? On Thursday, November 28, 2013 1:56 AM, Tharindu Rusira tharindurus...@gmail.com wrote: Hi all, I'm working on Mahout 0.9-SNAPSHOT version checked out from the svn trunk. The following

Re: java.lang.NoClassDefFoundError: com/google/common/base/Preconditions

2013-11-27 Thread Tharindu Rusira
Yes that's the exact issue Suneel, it was a careless mistake while adding projects to Eclipse that I missed those .jars. I even messed with the code to find a workaround so that it does not require these Precondition checks. (I've attached a patch if you are interested) :) Thanks a lot. -Tharindu

Re: java.lang.NoClassDefFoundError: com/google/common/base/Preconditions

2013-11-27 Thread Suneel Marthi
Good. (No I am not interested in the patch, Thanks :) ) On Thursday, November 28, 2013 2:54 AM, Tharindu Rusira tharindurus...@gmail.com wrote: Yes that's the exact issue Suneel, it was a careless mistake while adding projects to Eclipse that I missed those .jars.  I even messed with the