Logistic Regression - Questions

2013-02-21 Thread Anbarasan Murthy
I am trying to understand the logistic regression implemented in mahout In org.apache.mahout.classifier.sgd.AbstractOnlineLogisticRegression What is the role of Protected Matrix beta(numCategories-1) x numFeatures How where this beta matrix is updated and How where the local

GenericUserBasedRecommender vs GenericItemBasedRecommender

2013-02-21 Thread Koobas
In the GenericUserBasedRecommender the concept of a neighborhood seems to be fundamental. I.e., it is a classic implementation of the kNN algorithm. But it is not the case with the GenericItemBasedRecommender. I understand that the two approaches are not meant to be completely symmetric, but

Re: GenericUserBasedRecommender vs GenericItemBasedRecommender

2013-02-21 Thread Sean Owen
It's also valid, yes. The difference is partly due to asymmetry, but also just historical (i.e. no great reason). The item-item system uses a different strategy for picking candidates based on CandidateItemStrategy. On Thu, Feb 21, 2013 at 2:37 PM, Koobas koo...@gmail.com wrote: In the

Re: GenericUserBasedRecommender vs GenericItemBasedRecommender

2013-02-21 Thread Koobas
On Thu, Feb 21, 2013 at 9:39 AM, Sean Owen sro...@gmail.com wrote: It's also valid, yes. The difference is partly due to asymmetry, but also just historical (i.e. no great reason).* The item-item system uses a different strategy for picking candidates based on CandidateItemStrategy.* Where

LDA Convergence

2013-02-21 Thread David LaBarbera
I've been running some performance test with the LDA algorithm and I'm unsure how to gauge them. I ran 10 iterations each time and collected the perplexity value every 2 iterations with test fraction set to 0.1. These were all run on an AWS cluster with 10 nodes (70 mapper, 30 reducers). I'm

Re: GenericUserBasedRecommender vs GenericItemBasedRecommender

2013-02-21 Thread Julian Ortega
The javadoc should be a nice start https://builds.apache.org/job/Mahout-Quality/javadoc/org/apache/mahout/cf/taste/impl/recommender/AbstractCandidateItemsStrategy.html Apart from that, I'd say you should have a look around the code. On Thu, Feb 21, 2013 at 3:42 PM, Koobas koo...@gmail.com

Cross recommendation

2013-02-21 Thread Pat Ferrel
I am quite interested in trying this but have a few questions. To use/abuse mahout to do this: A and B can be thought of as having the same size, in other words they must be constructed to have the same dimension definitions (userID for rows, itemID for columns) as well as row and column rank.

Re: LDA Convergence

2013-02-21 Thread Jake Mannix
On Thu, Feb 21, 2013 at 11:48 AM, David LaBarbera davidlabarb...@localresponse.com wrote: Is there a rule of thumb for determining leveling off of perplexity? Is this value controlled by the convergence delta? The value of where the driver will automatically stop issuing new iterations is

Re: Network Traffic and Security Analysis

2013-02-21 Thread Mahesh Balija
Hi Ted, My apologizes for the delay to reply, as I was brushing up my networking skills before I can discuss. Few of those topics which I want to start with will be, 1) Deep packet inspection - Can be useful for Intrusion detection (NIDS) by doing the