Welcome Pat Ferrel as new committer on Mahout

2014-04-24 Thread Sebastian Schelter

Hi,

this is to announce that the Project Management Committee (PMC) for 
Apache Mahout has asked Pat Ferrel to become committer and we are 
pleased to announce that he has accepted.


Being a committer enables easier contribution to the project since in 
addition to posting patches on JIRA it also gives write access to the 
code repository. That also means that now we have yet another person who 
can commit patches submitted by others to our repo *wink*


Pat, we look forward to working with you in the future. Welcome! It 
would be great if you could introduce yourself with a few words.


-s


Re: Build failed in Jenkins: Mahout-Quality #2588

2014-04-24 Thread Sebastian Schelter
Grant told me he will have a look, he should have access. Would be a 
good thing if you could also ask for access with infra.


-s

On 04/24/2014 02:36 AM, Andrew Musselman wrote:

Infra escalation in order?  I hate broken builds.


On Tue, Apr 22, 2014 at 12:33 PM, Sean Owen sro...@gmail.com wrote:


(I have some access but don't seem to have permission to change the job
config.)

On Tue, Apr 22, 2014 at 8:02 PM, Sebastian Schelter s...@apache.org
wrote:


Who has access to the jenkins build server and can fix the failure? It
looks as if some paths need to be modified after the renaming of core to
mrlegacy.

--sebastian











Re: Welcome Pat Ferrel as new committer on Mahout

2014-04-24 Thread tuxdna
Congratulations Pat!

Best regards,
Saleem


On Thu, Apr 24, 2014 at 3:49 PM, Sebastian Schelter s...@apache.org wrote:
 Hi,

 this is to announce that the Project Management Committee (PMC) for Apache
 Mahout has asked Pat Ferrel to become committer and we are pleased to
 announce that he has accepted.

 Being a committer enables easier contribution to the project since in
 addition to posting patches on JIRA it also gives write access to the code
 repository. That also means that now we have yet another person who can
 commit patches submitted by others to our repo *wink*

 Pat, we look forward to working with you in the future. Welcome! It would be
 great if you could introduce yourself with a few words.

 -s


Re: Welcome Pat Ferrel as new committer on Mahout

2014-04-24 Thread Frank Scholten
Congratulations Pat! :-)

On Apr 24, 2014, at 12:19, Sebastian Schelter s...@apache.org wrote:

 Hi,
 
 this is to announce that the Project Management Committee (PMC) for Apache 
 Mahout has asked Pat Ferrel to become committer and we are pleased to 
 announce that he has accepted.
 
 Being a committer enables easier contribution to the project since in 
 addition to posting patches on JIRA it also gives write access to the code 
 repository. That also means that now we have yet another person who can 
 commit patches submitted by others to our repo *wink*
 
 Pat, we look forward to working with you in the future. Welcome! It would be 
 great if you could introduce yourself with a few words.
 
 -s


Re: Welcome Pat Ferrel as new committer on Mahout

2014-04-24 Thread Martin, Nick
Awesome Pat congrats!!! Very well deserved.

Sent from my iPhone

On Apr 24, 2014, at 6:20 AM, Sebastian Schelter s...@apache.org wrote:

 Hi,
 
 this is to announce that the Project Management Committee (PMC) for Apache 
 Mahout has asked Pat Ferrel to become committer and we are pleased to 
 announce that he has accepted.
 
 Being a committer enables easier contribution to the project since in 
 addition to posting patches on JIRA it also gives write access to the code 
 repository. That also means that now we have yet another person who can 
 commit patches submitted by others to our repo *wink*
 
 Pat, we look forward to working with you in the future. Welcome! It would be 
 great if you could introduce yourself with a few words.
 
 -s


Re: Welcome Pat Ferrel as new committer on Mahout

2014-04-24 Thread Shannon Quinn
Congratulations Pat! Been enjoying your discussions so far. Looking 
forward to working with you.


On 4/24/14, 6:22 AM, Frank Scholten wrote:

Congratulations Pat! :-)

On Apr 24, 2014, at 12:19, Sebastian Schelter s...@apache.org wrote:


Hi,

this is to announce that the Project Management Committee (PMC) for Apache 
Mahout has asked Pat Ferrel to become committer and we are pleased to announce 
that he has accepted.

Being a committer enables easier contribution to the project since in addition 
to posting patches on JIRA it also gives write access to the code repository. 
That also means that now we have yet another person who can commit patches 
submitted by others to our repo *wink*

Pat, we look forward to working with you in the future. Welcome! It would be 
great if you could introduce yourself with a few words.

-s




[jira] [Commented] (MAHOUT-157) Frequent Pattern Mining using Parallel FP-Growth

2014-04-24 Thread Amanpreet Singh Saini (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13979725#comment-13979725
 ] 

Amanpreet Singh Saini commented on MAHOUT-157:
--

Hey Robin, Regarding your comment dated 07/Sep/09 06:32 
I know its been a long time since the post, but i actually wanted to use 
pfpgrowth on mahout without closing of the itemsets.
For it, I downloaded your source and tried to interpret it and initially i 
thought that the issue was in FPGrowth.generateSinglePathPatterns() as you are 
adding the entire conditional FP Tree in the frequentPatterns. 
For eg: if the conditional fp tree had [f:3 c:3 a:3 m:3], then this method 
makes the pattern [f,c,a,m | 3] and inserts it into the FrequentPatternMaxHeap. 
I thought the problem was not generating all the combinations here but it seems 
that i couldn't able to understand the code correctly.
Could you please just lead me to the point(s) or code snippet(s) where the 
closure is being performed so that i could test on how to remove this closure.

Thanks,
Aman

 Frequent Pattern Mining using Parallel FP-Growth
 

 Key: MAHOUT-157
 URL: https://issues.apache.org/jira/browse/MAHOUT-157
 Project: Mahout
  Issue Type: New Feature
  Components: Frequent Itemset/Association Rule Mining
Affects Versions: 0.2
Reporter: Robin Anil
Assignee: Robin Anil
 Fix For: 0.2

 Attachments: MAHOUT-157-August-17.patch, MAHOUT-157-August-24.patch, 
 MAHOUT-157-August-31.patch, MAHOUT-157-August-6.patch, 
 MAHOUT-157-Combinations-BSD-License.patch, 
 MAHOUT-157-Combinations-BSD-License.patch, 
 MAHOUT-157-CompactTransactionMapperFormat.patch, MAHOUT-157-Oct-1.patch, 
 MAHOUT-157-Oct-10.pfpgrowth.patch, 
 MAHOUT-157-Oct-8.TestedMapReducePipeline.patch, 
 MAHOUT-157-Oct-8.pfpgrowth.patch, 
 MAHOUT-157-Oct-9.StreamingDBRead-Inprogress.patch, 
 MAHOUT-157-September-10.patch, MAHOUT-157-September-18.patch, 
 MAHOUT-157-September-5.patch, MAHOUT-157-codecleanup-javadocs.patch, 
 MAHOUT-157-final.patch, MAHOUT-157-inProgress-August-5.patch


 Implement: http://infolab.stanford.edu/~echang/recsys08-69.pdf



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: Welcome Pat Ferrel as new committer on Mahout

2014-04-24 Thread Andrew Musselman
Great news, welcome Pat!

 On Apr 24, 2014, at 3:19 AM, Sebastian Schelter s...@apache.org wrote:
 
 Hi,
 
 this is to announce that the Project Management Committee (PMC) for Apache 
 Mahout has asked Pat Ferrel to become committer and we are pleased to 
 announce that he has accepted.
 
 Being a committer enables easier contribution to the project since in 
 addition to posting patches on JIRA it also gives write access to the code 
 repository. That also means that now we have yet another person who can 
 commit patches submitted by others to our repo *wink*
 
 Pat, we look forward to working with you in the future. Welcome! It would be 
 great if you could introduce yourself with a few words.
 
 -s


RE: Welcome Pat Ferrel as new committer on Mahout

2014-04-24 Thread Andrew Palumbo
Congratulations Pat!

 Subject: Re: Welcome Pat Ferrel as new committer on Mahout
 From: andrew.mussel...@gmail.com
 Date: Thu, 24 Apr 2014 06:44:43 -0700
 CC: u...@mahout.apache.org
 To: dev@mahout.apache.org
 
 Great news, welcome Pat!
 
  On Apr 24, 2014, at 3:19 AM, Sebastian Schelter s...@apache.org wrote:
  
  Hi,
  
  this is to announce that the Project Management Committee (PMC) for Apache 
  Mahout has asked Pat Ferrel to become committer and we are pleased to 
  announce that he has accepted.
  
  Being a committer enables easier contribution to the project since in 
  addition to posting patches on JIRA it also gives write access to the code 
  repository. That also means that now we have yet another person who can 
  commit patches submitted by others to our repo *wink*
  
  Pat, we look forward to working with you in the future. Welcome! It would 
  be great if you could introduce yourself with a few words.
  
  -s
  

[jira] [Commented] (MAHOUT-1485) Clean up Recommender Overview page

2014-04-24 Thread Yash Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13980045#comment-13980045
 ] 

Yash Sharma commented on MAHOUT-1485:
-

Have been reading about solr based recommenders. 
Should the new overview page be based on the code here by Pat : 
https://github.com/pferrel/solr-recommender/.
Also are there any plans of merging this code in mahout as well?
Am creating a sample document here and would keep on documenting my findings 
here:
https://docs.google.com/document/d/1iwzcvIbXudSCEZ6n5J5lAhoGGf78Qo9LQsaAvChzMV4/edit

I also agree that we should retain the current overview page as well because 
its a popular implementation and lot of people are using this approach. We can 
have a new page talking about online recommenders powered with search engines.


 Clean up Recommender Overview page
 --

 Key: MAHOUT-1485
 URL: https://issues.apache.org/jira/browse/MAHOUT-1485
 Project: Mahout
  Issue Type: Improvement
  Components: Documentation
Reporter: Sebastian Schelter
Assignee: Sebastian Schelter
 Fix For: 1.0


 Clean up the recommender overview page, remove outdated content and make sure 
 the examples work.
 https://mahout.apache.org/users/recommender/recommender-documentation.html



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (MAHOUT-1525) train/validateAdaptiveLogistic

2014-04-24 Thread Richard Scharrer (JIRA)
Richard Scharrer created MAHOUT-1525:


 Summary: train/validateAdaptiveLogistic
 Key: MAHOUT-1525
 URL: https://issues.apache.org/jira/browse/MAHOUT-1525
 Project: Mahout
  Issue Type: Question
  Components: Classification
Affects Versions: 0.7
Reporter: Richard Scharrer


Hi,
I tried to use train- and validateAdaptiveLogistic on my data which is like:
category, id, var1, var2, ...var72 (all numeric)

I used the following settings:
mahout trainAdaptiveLogistic --input resource/trainingData \
--output ./model \
--target category --categories 9 \
--predictors a0 a1 a2 a3 a4 a5 a6 a7 a8 a9 a10 a11 .
--types numeric \
--passes 100 \
--showperf \

mahout validateAdaptiveLogistic --input resource/testData --model model 
--confusion --defaultCategory none

The output of validateAdaptiveLogistic is:
Log-likelihood:Min=-5.54, Max=-0.04, Mean=-1.58, Median=-1.33

===
Confusion Matrix
---
a   b   d   e   f   g   h   i   --Classified as
14  0   0   0   0   0   0   0|  14  
a = projekt
0   18  0   0   0   0   0   0|  18  
b = news/aktuelles/presse
0   0   24  0   0   0   0   0|  24  
d = lehrveranstaltung
0   0   0   19  0   0   0   0|  19  
e = publikation
0   0   0   0   20  0   0   0|  20  
f = event
0   0   0   0   0   14  0   0|  14  
g = mitarbeiter/person
0   0   0   0   0   0   44  0|  44  
h = übersicht
0   0   0   0   0   0   0   13   |  13  
i = institut


(in case you were wondering, the categories a in german)

My problem is that this is impossible. I always get a perfect classification 
even with just a little amount of training data. It doesnt even matter how many 
features I use I tried it with all 72 and with only one. Am I missing something?

Regards,
Richard



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1525) train/validateAdaptiveLogistic

2014-04-24 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13980395#comment-13980395
 ] 

Suneel Marthi commented on MAHOUT-1525:
---

You shouldn't be using 0.7, 0.7 has long been retired and is unsupported. 
Please upgrade to 0.9 or work off of trunk.

 train/validateAdaptiveLogistic
 --

 Key: MAHOUT-1525
 URL: https://issues.apache.org/jira/browse/MAHOUT-1525
 Project: Mahout
  Issue Type: Question
  Components: Classification
Affects Versions: 0.7
Reporter: Richard Scharrer
  Labels: adaptiveLogisticRegression,, newbie

 Hi,
 I tried to use train- and validateAdaptiveLogistic on my data which is like:
 category, id, var1, var2, ...var72 (all numeric)
 I used the following settings:
 mahout trainAdaptiveLogistic --input resource/trainingData \
 --output ./model \
 --target category --categories 9 \
 --predictors a0 a1 a2 a3 a4 a5 a6 a7 a8 a9 a10 a11 .
 --types numeric \
 --passes 100 \
 --showperf \
 mahout validateAdaptiveLogistic --input resource/testData --model model 
 --confusion --defaultCategory none
 The output of validateAdaptiveLogistic is:
 Log-likelihood:Min=-5.54, Max=-0.04, Mean=-1.58, Median=-1.33
 ===
 Confusion Matrix
 ---
 a b   d   e   f   g   h   i   --Classified as
 140   0   0   0   0   0   0|  14  
 a = projekt
 0 18  0   0   0   0   0   0|  18  
 b = news/aktuelles/presse
 0 0   24  0   0   0   0   0|  24  
 d = lehrveranstaltung
 0 0   0   19  0   0   0   0|  19  
 e = publikation
 0 0   0   0   20  0   0   0|  20  
 f = event
 0 0   0   0   0   14  0   0|  14  
 g = mitarbeiter/person
 0 0   0   0   0   0   44  0|  44  
 h = übersicht
 0 0   0   0   0   0   0   13   |  13  
 i = institut
 (in case you were wondering, the categories a in german)
 My problem is that this is impossible. I always get a perfect classification 
 even with just a little amount of training data. It doesnt even matter how 
 many features I use I tried it with all 72 and with only one. Am I missing 
 something?
 Regards,
 Richard



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Build failed in Jenkins: Mahout-Quality #2592

2014-04-24 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/2592/

--
[...truncated 8573 lines...]

36196 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings.drm.decompositions.DQR$  - R=
{
  0  = {0:27.313000567495326,1:42.54384270701014}
  1  = {1:0.14645040518827934}
}

Q=
{
  0  = {0:0.40273861426601687,1:-0.9153150324187648}
  1  = {0:0.9153150324227656,1:0.40273861426427493}
}
- C = A %*% B mapBlock {}
- C = A %*% B incompatible B keys
39177 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings.blas.AtB$  - A and B for A'B are not 
identically partitioned, performing inner join.
- C = At %*% B , join
40867 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings.blas.AtB$  - A and B for A'B are not 
identically partitioned, performing inner join.
- C = At %*% B , join, String-keyed
42502 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings.blas.AtB$  - A and B for A'B are identically 
distributed, performing row-wise zip.
- C = At %*% B , zippable, String-keyed
{
  2  = {0:62.0,1:86.0,3:132.0,2:115.0}
  1  = {0:50.0,1:69.0,3:105.0,2:92.0}
  3  = {0:74.0,1:103.0,3:159.0,2:138.0}
  0  = {0:26.0,1:35.0,3:51.0,2:46.0}
}
- C = A %*% inCoreB
{
  0  = {0:26.0,1:35.0,2:46.0,3:51.0}
  1  = {0:50.0,1:69.0,2:92.0,3:105.0}
  2  = {0:62.0,1:86.0,2:115.0,3:132.0}
  3  = {0:74.0,1:103.0,2:138.0,3:159.0}
}
- C = inCoreA %*%: B
46942 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings.blas.AtA$  - Applying slim A'A.
- C = A.t %*% A
48486 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings.blas.AtA$  - Applying non-slim non-graph A'A.
80788 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings  - test done.
- C = A.t %*% A fat non-graph
82049 [ScalaTest-main-running-RLikeDrmOpsSuite] DEBUG 
org.apache.mahout.sparkbindings.blas.AtA$  - Applying slim A'A.
- C = A.t %*% A non-int key
- C = A + B
- C = A + B side test 1
- C = A + B side test 2
- C = A + B side test 3
ArrayBuffer(0, 1, 2, 3, 4)
ArrayBuffer(0, 1, 2, 3, 4)
- general side
- Ax
- A'x
- colSums, colMeans
Run completed in 1 minute, 43 seconds.
Total number of tests run: 38
Suites: completed 9, aborted 0
Tests: succeeded 38, failed 0, canceled 0, ignored 0, pending 0
All tests passed.
[INFO] 
[INFO] --- build-helper-maven-plugin:1.8:remove-project-artifact 
(remove-old-mahout-artifacts) @ mahout-spark ---
[INFO] /home/jenkins/.m2/repository/org/apache/mahout/mahout-spark removed.
[INFO] 
[INFO] --- maven-jar-plugin:2.4:jar (default-jar) @ mahout-spark ---
[INFO] Building jar: 
https://builds.apache.org/job/Mahout-Quality/2592/artifact/trunk/spark/target/mahout-spark-1.0-SNAPSHOT.jar
[INFO] 
[INFO] --- maven-jar-plugin:2.4:test-jar (default) @ mahout-spark ---
[INFO] Building jar: 
https://builds.apache.org/job/Mahout-Quality/2592/artifact/trunk/spark/target/mahout-spark-1.0-SNAPSHOT-tests.jar
[INFO] 
[INFO] --- maven-source-plugin:2.2.1:jar-no-fork (attach-sources) @ 
mahout-spark ---
[INFO] Building jar: 
https://builds.apache.org/job/Mahout-Quality/2592/artifact/trunk/spark/target/mahout-spark-1.0-SNAPSHOT-sources.jar
[INFO] 
[INFO] --- maven-install-plugin:2.5.1:install (default-install) @ mahout-spark 
---
[INFO] Installing 
https://builds.apache.org/job/Mahout-Quality/2592/artifact/trunk/spark/target/mahout-spark-1.0-SNAPSHOT.jar
 to 
/home/jenkins/.m2/repository/org/apache/mahout/mahout-spark/1.0-SNAPSHOT/mahout-spark-1.0-SNAPSHOT.jar
[INFO] Installing 
https://builds.apache.org/job/Mahout-Quality/ws/trunk/spark/pom.xml to 
/home/jenkins/.m2/repository/org/apache/mahout/mahout-spark/1.0-SNAPSHOT/mahout-spark-1.0-SNAPSHOT.pom
[INFO] Installing 
https://builds.apache.org/job/Mahout-Quality/2592/artifact/trunk/spark/target/mahout-spark-1.0-SNAPSHOT-tests.jar
 to 
/home/jenkins/.m2/repository/org/apache/mahout/mahout-spark/1.0-SNAPSHOT/mahout-spark-1.0-SNAPSHOT-tests.jar
[INFO] Installing 
https://builds.apache.org/job/Mahout-Quality/2592/artifact/trunk/spark/target/mahout-spark-1.0-SNAPSHOT-sources.jar
 to 
/home/jenkins/.m2/repository/org/apache/mahout/mahout-spark/1.0-SNAPSHOT/mahout-spark-1.0-SNAPSHOT-sources.jar
[INFO] 
[INFO]  maven-javadoc-plugin:2.9.1:javadoc (default-cli) @ mahout-spark 
[INFO] 
[INFO] --- build-helper-maven-plugin:1.8:add-source (add-source) @ mahout-spark 
---
[INFO] Source directory: 
https://builds.apache.org/job/Mahout-Quality/ws/trunk/spark/target/generated-sources/mahout
 added.
[INFO] 
[INFO] --- build-helper-maven-plugin:1.8:add-test-source (add-test-source) @ 
mahout-spark ---
[INFO] Test Source directory: 
https://builds.apache.org/job/Mahout-Quality/ws/trunk/spark/target/generated-test-sources/mahout