Suggestion: enhance examples/bin/classify-20newsgroups.sh to allow
using this to generate the model, along with the online program.

Lance

On Mon, Dec 12, 2011 at 4:06 AM, jirapos...@reviews.apache.org
(Commented) (JIRA) <j...@apache.org> wrote:
>
>    [ 
> https://issues.apache.org/jira/browse/MAHOUT-918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167462#comment-13167462
>  ]
>
> jirapos...@reviews.apache.org commented on MAHOUT-918:
> ------------------------------------------------------
>
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveMapper.java,
>  line 36
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63195#file63195line36>
> bq.  >
> bq.  >     Needs a comment about how this works.
>
> Added comments.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveMapper.java,
>  lines 67-75
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63195#file63195line67>
> bq.  >
> bq.  >     This really need a comment.  What is the purpose here?
>
> Added comments.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveMapper.java,
>  lines 98-111
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63195#file63195line98>
> bq.  >
> bq.  >     What is this intended to do?  Why?
>
> Added comments.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveReducer.java,
>  line 30
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63196#file63196line30>
> bq.  >
> bq.  >     Typo.
> bq.  >
> bq.  >     Also, this doesn't say how this works or why it is the way it is.
>
> Fixed the typo and added comments.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveReducer.java,
>  line 32
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63196#file63196line32>
> bq.  >
> bq.  >     Shouldn't there be a combiner as well?
>
> A combiner isn't needed because each map task submits one value overall.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveReducer.java,
>  line 53
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63196#file63196line53>
> bq.  >
> bq.  >     A comment here about what this weight is would be nice.  Also, how 
> can a double be a key?  That is tantamount to comparing doubles which is bad.
>
> Added comments. it is not the weight of the classifier but the weight of the 
> weighted average.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/SGDDriver.java,
>  line 99
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63197#file63197line99>
> bq.  >
> bq.  >     Where does the InterruptedException come from?
>
> It comes from runIteration function.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/SGDDriver.java,
>  lines 110-111
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63197#file63197line110>
> bq.  >
> bq.  >     Use brackets
>
> Added brackets.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/test/java/org/apache/mahout/classifier/sgd/mapreduce/AdaptiveLogisticRegressionMapReduceTest.java,
>  line 35
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63198#file63198line35>
> bq.  >
> bq.  >     Should not throw Exception
>
> Added IO Exception and Interrupted Exception.
>
>
> bq.  On 2011-12-08 07:04:49, Ted Dunning wrote:
> bq.  > 
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveMapper.java,
>  lines 53-56
> bq.  > <https://reviews.apache.org/r/3072/diff/2/?file=63195#file63195line53>
> bq.  >
> bq.  >     This is nearly duplicated code.  The mapper and reducer should 
> share some code to avoid inconsistent defaults.
>
> Created a base class which shares the same initialization code.
>
>
> - issei
>
>
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/3072/#review3734
> -----------------------------------------------------------
>
>
> On 2011-12-12 11:51:59, issei yoshida wrote:
> bq.
> bq.  -----------------------------------------------------------
> bq.  This is an automatically generated e-mail. To reply, visit:
> bq.  https://reviews.apache.org/r/3072/
> bq.  -----------------------------------------------------------
> bq.
> bq.  (Updated 2011-12-12 11:51:59)
> bq.
> bq.
> bq.  Review request for mahout.
> bq.
> bq.
> bq.  Summary
> bq.  -------
> bq.
> bq.  MAHOUT-918 Parallelized SGD in MapReduce
> bq.
> bq.
> bq.  This addresses bug MAHOUT-918.
> bq.      https://issues.apache.org/jira/browse/MAHOUT-918
> bq.
> bq.
> bq.  Diffs
> bq.  -----
> bq.
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/PassiveAggressive.java
>  1213193
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/AdaptiveLogisticRegressionDriver.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/AdaptiveLogisticRegressionMapper.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/AdaptiveLogisticRegressionReducer.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/LogisticRegressionDriver.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/LogisticRegressionMapper.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/LogisticRegressionReducer.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveDriver.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveMapper.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveReducer.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/SGDDriver.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/SGDMapper.java
>  PRE-CREATION
> bq.    
> trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/SGDReducer.java
>  PRE-CREATION
> bq.    
> trunk/core/src/test/java/org/apache/mahout/classifier/sgd/mapreduce/AdaptiveLogisticRegressionMapReduceTest.java
>  PRE-CREATION
> bq.    
> trunk/core/src/test/java/org/apache/mahout/classifier/sgd/mapreduce/LogisticRegressionMapReduceTest.java
>  PRE-CREATION
> bq.    
> trunk/core/src/test/java/org/apache/mahout/classifier/sgd/mapreduce/PassiveAggressiveMapReduceTest.java
>  PRE-CREATION
> bq.    
> trunk/core/src/test/java/org/apache/mahout/classifier/sgd/mapreduce/SGDMapReduceTest.java
>  PRE-CREATION
> bq.
> bq.  Diff: https://reviews.apache.org/r/3072/diff
> bq.
> bq.
> bq.  Testing
> bq.  -------
> bq.
> bq.
> bq.  Thanks,
> bq.
> bq.  issei
> bq.
> bq.
>
>
>
>> Implement SGD based classifiers using MapReduce
>> -----------------------------------------------
>>
>>                 Key: MAHOUT-918
>>                 URL: https://issues.apache.org/jira/browse/MAHOUT-918
>>             Project: Mahout
>>          Issue Type: New Feature
>>          Components: Classification
>>    Affects Versions: 0.6
>>            Reporter: issei yoshida
>>         Attachments: MAHOUT-918.patch, design.pdf
>>
>>
>> Implement SGD based classifiers (Logistic Regression, Adaptive Logistic 
>> regression and Passive-Aggressive) using MapReduce.
>> They are implemented using Iterative Parameter Mixtures algorithm which is 
>> referred to in the following papers.
>> http://research.google.com/pubs/pub36948.html
>> http://aclweb.org/anthology-new/N/N10/N10-1069.pdf
>> http://books.nips.cc/papers/files/nips22/NIPS2009_0345.pdf
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA 
> administrators: 
> https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>
>



-- 
Lance Norskog
goks...@gmail.com

Reply via email to