[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13170003#comment-13170003 ] Anatoliy Kats commented on MAHOUT-906: -- You've actually convinced me to change the es

Jenkins build is still unstable: Mahout-Quality #1254

2011-12-14 Thread Apache Jenkins Server
See

Re: Review Request: Row mean job for PCA

2011-12-14 Thread Ted Dunning
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/3147/#review3916 --- Looks much better from the trivial formatting standpoint. - Ted On

[jira] [Commented] (MAHOUT-923) Row mean job for PCA

2011-12-14 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169868#comment-13169868 ] jirapos...@reviews.apache.org commented on MAHOUT-923: --

Re: Review Requests

2011-12-14 Thread Isabel Drost
On 14.12.2011 Jake Mannix wrote: > Exactly. Lowers barrier to entry for joining in the review process. Having Ted review more patches certainly is a +1 for having review board. Just two wishes from my side: I can easily parse all our stuff (comments on JIRA, commits and regular mails) by read

Re: [jira] [Commented] (MAHOUT-904) SplitInput should support randomizing the input

2011-12-14 Thread Ted Dunning
On Wed, Dec 14, 2011 at 1:01 PM, Raphael Cendrillon < cendrillon1...@gmail.com> wrote: > Thanks Lance. If I understand you correctly you're proposing the following: > > Map: (K1,V1) -> (K2,V2) > V2 = V1 > K2 = hashcode(K1) > Preserving K1 may be important. In that case you may prefer > emit

Re: [jira] [Commented] (MAHOUT-904) SplitInput should support randomizing the input

2011-12-14 Thread Raphael Cendrillon
Thanks Lance. If I understand you correctly you're proposing the following: Map: (K1,V1) -> (K2,V2) V2 = V1 K2 = hashcode(K1) emit(K2,V2) Combine: (K2,V2) -> (K3,V3) (e.g. if we want to keep 10% of samples) if ( ! K2%10 ) { V3 = V2 K3 = K2 emit(K3, V3) } Reduce: (K3,V3) S

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169485#comment-13169485 ] Sean Owen commented on MAHOUT-906: -- No I think it's as simple as factoring out this secti

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169423#comment-13169423 ] Anatoliy Kats commented on MAHOUT-906: -- I have to head out, let me ask you a question

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169389#comment-13169389 ] Anatoliy Kats commented on MAHOUT-906: -- I hope this test gives a more realistic resul

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169361#comment-13169361 ] Sean Owen commented on MAHOUT-906: -- Sure, you can do that. I am not sure that gives you a

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169354#comment-13169354 ] Anatoliy Kats commented on MAHOUT-906: -- You're right, if you have preference values,

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169331#comment-13169331 ] Anatoliy Kats commented on MAHOUT-906: -- Or maybe we can somehow split the AbstractDif

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169332#comment-13169332 ] Sean Owen commented on MAHOUT-906: -- For the IR precision/recall evaluation, if you *do* h

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169310#comment-13169310 ] Anatoliy Kats commented on MAHOUT-906: -- I guess it's a hybrid of some sort between es

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169280#comment-13169280 ] Sean Owen commented on MAHOUT-906: -- OK. I think we're speaking about the estimation test,

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169279#comment-13169279 ] Anatoliy Kats commented on MAHOUT-906: -- The IR tests make recommendations for one use

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169249#comment-13169249 ] Sean Owen commented on MAHOUT-906: -- Are we talking about the IR tests, estimation test or

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169246#comment-13169246 ] Anatoliy Kats commented on MAHOUT-906: -- We will be sorting preferences by time for AL

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169241#comment-13169241 ] Sean Owen commented on MAHOUT-906: -- Yes, the lightest-touch approach is to pull them out

[jira] [Commented] (MAHOUT-918) Implement SGD based classifiers using MapReduce

2011-12-14 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169223#comment-13169223 ] jirapos...@reviews.apache.org commented on MAHOUT-918: -- bq. On 201

Re: Review Request: MAHOUT-918 Parallelized SGD in MapReduce

2011-12-14 Thread issei yoshida
> On 2011-12-13 13:24:28, Ted Dunning wrote: > > trunk/core/src/main/java/org/apache/mahout/classifier/sgd/mapreduce/AdaptiveLogisticRegressionDriver.java, > > lines 36-41 > > > > > > Direct and exact quotes from the pape

[jira] [Commented] (MAHOUT-918) Implement SGD based classifiers using MapReduce

2011-12-14 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169213#comment-13169213 ] jirapos...@reviews.apache.org commented on MAHOUT-918: --

Re: Review Request: MAHOUT-918 Parallelized SGD in MapReduce

2011-12-14 Thread issei yoshida
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/3072/ --- (Updated 2011-12-14 08:59:29.074032) Review request for mahout. Summary --

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Anatoliy Kats (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169182#comment-13169182 ] Anatoliy Kats commented on MAHOUT-906: -- I am beginning to write an evaluator that tak