[jira] [Created] (MAHOUT-1611) Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob

2014-08-28 Thread Haishou Ma (JIRA)
Haishou Ma created MAHOUT-1611:
--

 Summary: Preconditions.checkArgument in 
org.apache.mahout.utils.ConcatenateVectorsJob
 Key: MAHOUT-1611
 URL: https://issues.apache.org/jira/browse/MAHOUT-1611
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
Reporter: Haishou Ma
Priority: Minor


In org.apache.mahout.utils.ConcatenateVectorsJob line 115
Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
should be a directory);
paths.length == 0 should be paths.length  0



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (MAHOUT-1611) Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob

2014-08-28 Thread Haishou Ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Haishou Ma updated MAHOUT-1611:
---

Description: 
In org.apache.mahout.utils.ConcatenateVectorsJob line 111
Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
should be a directory);
paths.length == 0 should be paths.length  0

  was:
In org.apache.mahout.utils.ConcatenateVectorsJob line 115
Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
should be a directory);
paths.length == 0 should be paths.length  0


 Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob
 

 Key: MAHOUT-1611
 URL: https://issues.apache.org/jira/browse/MAHOUT-1611
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
Reporter: Haishou Ma
Priority: Minor

 In org.apache.mahout.utils.ConcatenateVectorsJob line 111
 Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
 should be a directory);
 paths.length == 0 should be paths.length  0



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1611) Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob

2014-08-28 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113501#comment-14113501
 ] 

Suneel Marthi commented on MAHOUT-1611:
---

Please open a PR with the proposed fix.

 Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob
 

 Key: MAHOUT-1611
 URL: https://issues.apache.org/jira/browse/MAHOUT-1611
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
Reporter: Haishou Ma
Priority: Minor

 In org.apache.mahout.utils.ConcatenateVectorsJob line 111
 Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
 should be a directory);
 paths.length == 0 should be paths.length  0



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1611) Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob

2014-08-28 Thread Haishou Ma (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113523#comment-14113523
 ] 

Haishou Ma commented on MAHOUT-1611:


Sorry, what does a PR mean?

 Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob
 

 Key: MAHOUT-1611
 URL: https://issues.apache.org/jira/browse/MAHOUT-1611
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
Reporter: Haishou Ma
Priority: Minor

 In org.apache.mahout.utils.ConcatenateVectorsJob line 111
 Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
 should be a directory);
 paths.length == 0 should be paths.length  0



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Comment Edited] (MAHOUT-1611) Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob

2014-08-28 Thread Haishou Ma (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113523#comment-14113523
 ] 

Haishou Ma edited comment on MAHOUT-1611 at 8/28/14 8:10 AM:
-

Sorry, what does a PR mean?Pull Request in https://github.com/apache/mahout?


was (Author: mahaishou):
Sorry, what does a PR mean?

 Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob
 

 Key: MAHOUT-1611
 URL: https://issues.apache.org/jira/browse/MAHOUT-1611
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
Reporter: Haishou Ma
Priority: Minor

 In org.apache.mahout.utils.ConcatenateVectorsJob line 111
 Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
 should be a directory);
 paths.length == 0 should be paths.length  0



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1611) Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob

2014-08-28 Thread Frank Rosner (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113541#comment-14113541
 ] 

Frank Rosner commented on MAHOUT-1611:
--

See [How to 
Contribute|https://mahout.apache.org/developers/how-to-contribute.html]/Making 
Changes.

Pull requests allow to review your code and to discuss it.

 Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob
 

 Key: MAHOUT-1611
 URL: https://issues.apache.org/jira/browse/MAHOUT-1611
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
Reporter: Haishou Ma
Priority: Minor

 In org.apache.mahout.utils.ConcatenateVectorsJob line 111
 Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
 should be a directory);
 paths.length == 0 should be paths.length  0



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1611) Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113636#comment-14113636
 ] 

ASF GitHub Bot commented on MAHOUT-1611:


GitHub user smarthi opened a pull request:

https://github.com/apache/mahout/pull/49

MAHOUT-1611: Preconditions.checkArgument in 
org.apache.mahout.utils.ConcatenateVectorsJob



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/smarthi/mahout suneel

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/mahout/pull/49.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #49


commit 7ef9086c1bd4409646f19a1439ea159f0d660a9b
Author: Suneel Marthi suneel.mar...@gmail.com
Date:   2014-08-28T10:18:11Z

MAHOUT-1611: Preconditions.checkArgument in 
org.apache.mahout.utils.ConcatenateVectorsJob




 Preconditions.checkArgument in org.apache.mahout.utils.ConcatenateVectorsJob
 

 Key: MAHOUT-1611
 URL: https://issues.apache.org/jira/browse/MAHOUT-1611
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
Reporter: Haishou Ma
Priority: Minor

 In org.apache.mahout.utils.ConcatenateVectorsJob line 111
 Preconditions.checkArgument(paths.length == 0, path.getName() +  is a file, 
 should be a directory);
 paths.length == 0 should be paths.length  0



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Resolved] (MAHOUT-1595) iterateNonZero() is broken in MatrixVectorView

2014-08-28 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi resolved MAHOUT-1595.
---

   Resolution: Fixed
Fix Version/s: 1.0

Resolved on 21 July 2014

 iterateNonZero() is broken in MatrixVectorView
 --

 Key: MAHOUT-1595
 URL: https://issues.apache.org/jira/browse/MAHOUT-1595
 Project: Mahout
  Issue Type: Bug
  Components: Math
Reporter: Anand Avati
 Fix For: 1.0


 iterateNonZero() in MatrixVectorView is broken and behaves like a normal 
 iterator, over all elements.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113724#comment-14113724
 ] 

ASF GitHub Bot commented on MAHOUT-1610:


Github user srowen commented on the pull request:

https://github.com/apache/mahout/pull/46#issuecomment-53716190
  
I may still have the commit bit for ASF git, but can't merge the pull 
request myself. (I also realize I'm not yet sure if there's another step? will 
asfbot merge back to ASF git if merged here?)

Anyone who can merge this is welcome to do so!


 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests

 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113907#comment-14113907
 ] 

ASF GitHub Bot commented on MAHOUT-1610:


Github user dlyubimov commented on the pull request:

https://github.com/apache/mahout/pull/46#issuecomment-53745490
  
 may still have the commit bit for ASF git, but can't merge the pull 
request myself
Thanks, Sean. 

Yes, you can merge. A bit exploded beyond what's needed IMO but still 
useful [1]. Also, it works best if master is first merged to the PR branch and 
conflicts, if any, resolved there, so when you `--squash` stuff to master, you 
don't have to worry about conflicts on top of everything else. Hope this helps.

[1] http://mahout.apache.org/developers/github.html




 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests

 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113913#comment-14113913
 ] 

ASF GitHub Bot commented on MAHOUT-1610:


Github user srowen commented on the pull request:

https://github.com/apache/mahout/pull/46#issuecomment-53746682
  
Ah right, should have RTFM. Thanks! When you say beyond what's needed 
were you commenting on the PR, or on the docs? Just checking whether you meant 
you wanted to discuss the change more. 


 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests

 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113918#comment-14113918
 ] 

ASF GitHub Bot commented on MAHOUT-1610:


Github user dlyubimov commented on the pull request:

https://github.com/apache/mahout/pull/46#issuecomment-53748236
  
I meant the PR doc. Strictly in my opinion, since original versions  stuff
has been added that was not strictly to the point and makes read longer and
therefore harder than it needs be.


On Thu, Aug 28, 2014 at 9:14 AM, Sean Owen notificati...@github.com wrote:

 Ah right, should have RTFM. Thanks! When you say beyond what's needed
 were you commenting on the PR, or on the docs? Just checking whether you
 meant you wanted to discuss the change more.

 —
 Reply to this email directly or view it on GitHub
 https://github.com/apache/mahout/pull/46#issuecomment-53746682.



 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests

 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Resolved] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread Sean Owen (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen resolved MAHOUT-1610.
---

   Resolution: Fixed
Fix Version/s: 1.0

 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests
 Fix For: 1.0


 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113938#comment-14113938
 ] 

ASF GitHub Bot commented on MAHOUT-1610:


Github user asfgit closed the pull request at:

https://github.com/apache/mahout/pull/46


 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests
 Fix For: 1.0


 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: [jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread Ted Dunning
On Thu, Aug 28, 2014 at 6:04 AM, ASF GitHub Bot (JIRA) j...@apache.org
wrote:

 Github user srowen commented on the pull request:

 https://github.com/apache/mahout/pull/46#issuecomment-53716190

 I may still have the commit bit for ASF git, but can't merge the pull
 request myself. (I also realize I'm not yet sure if there's another step?
 will asfbot merge back to ASF git if merged here?)


If you do the commit with the github note closes #xx, then github does
the right thing.  Your commit does the merge.


Re: [jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread Dmitriy Lyubimov
On Thu, Aug 28, 2014 at 9:48 AM, Ted Dunning ted.dunn...@gmail.com wrote:


 If you do the commit with the github note closes #xx, then github does
 the right thing.  Your commit does the merge.


This is a bad advice.  Please always use github repo specified. it needs to
say closes apache/mahout #xx since just closes #xx also closess ALL PRs
in all collaborator's branches that just happen to have the same ordinal
number. as experienced first hand.


[jira] [Commented] (MAHOUT-1604) Create a RowSimilarity for Spark

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113952#comment-14113952
 ] 

ASF GitHub Bot commented on MAHOUT-1604:


Github user pferrel commented on the pull request:

https://github.com/apache/mahout/pull/47#issuecomment-53754239
  
Heads up, this changes the classname for CooccurrenceAnalysis to 
SimilarityAnalysis. There don't seem to be any tests in h2o so I don't think is 
should break anything. The spark tests have been updated naturally.


 Create a RowSimilarity for Spark
 

 Key: MAHOUT-1604
 URL: https://issues.apache.org/jira/browse/MAHOUT-1604
 Project: Mahout
  Issue Type: Bug
  Components: CLI
Affects Versions: 1.0
 Environment: Spark
Reporter: Pat Ferrel
Assignee: Pat Ferrel

 Using CooccurrenceAnalysis.cooccurrence create a driver that reads a text DRM 
 or two and produces LLR similarity/cross-similarity matrices.
 This will produce the same results as ItemSimilarity but take a Drm as input 
 instead of individual cells.
 The first version will only support LLR, other similarity measures will need 
 to be in separate Jiras



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: [jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread Dmitriy Lyubimov
Including PRs that may happen on private company github branches to private
code once public master is merged to them. Such is unfortunate workings of
github. So please don't.


On Thu, Aug 28, 2014 at 9:52 AM, Dmitriy Lyubimov dlie...@gmail.com wrote:




 On Thu, Aug 28, 2014 at 9:48 AM, Ted Dunning ted.dunn...@gmail.com
 wrote:


 If you do the commit with the github note closes #xx, then github does
 the right thing.  Your commit does the merge.


 This is a bad advice.  Please always use github repo specified. it needs
 to say closes apache/mahout #xx since just closes #xx also closess ALL
 PRs in all collaborator's branches that just happen to have the same
 ordinal number. as experienced first hand.




[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14113985#comment-14113985
 ] 

Hudson commented on MAHOUT-1610:


SUCCESS: Integrated in Mahout-Quality #2766 (See 
[https://builds.apache.org/job/Mahout-Quality/2766/])
MAHOUT-1610 Update tests to pass in Java 8; closes apache/mahout#46 (srowen: 
rev 91f15ecfe5d26de36d8e9bcd6cf109b307882a32)
* 
mrlegacy/src/test/java/org/apache/mahout/classifier/df/tools/VisualizerTest.java
* math/src/test/java/org/apache/mahout/math/random/MultinomialTest.java
* mrlegacy/src/test/java/org/apache/mahout/clustering/TestClusterInterface.java
* mrlegacy/src/test/java/org/apache/mahout/common/StringUtilsTest.java
* mrlegacy/src/main/java/org/apache/mahout/clustering/ClusteringUtils.java
* integration/src/test/resources/expected-arff-schema-2.json
* integration/src/test/resources/expected-arff-dictionary-2.csv
* mrlegacy/src/test/java/org/apache/mahout/classifier/df/DecisionForestTest.java
* mrlegacy/src/test/java/org/apache/mahout/classifier/df/data/DatasetTest.java
* integration/src/test/java/org/apache/mahout/utils/vectors/arff/DriverTest.java


 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests
 Fix For: 1.0


 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1604) Create a RowSimilarity for Spark

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114008#comment-14114008
 ] 

ASF GitHub Bot commented on MAHOUT-1604:


Github user andrewpalumbo commented on the pull request:

https://github.com/apache/mahout/pull/47#issuecomment-53761366
  
That shouldn't break anything in h2o


 Create a RowSimilarity for Spark
 

 Key: MAHOUT-1604
 URL: https://issues.apache.org/jira/browse/MAHOUT-1604
 Project: Mahout
  Issue Type: Bug
  Components: CLI
Affects Versions: 1.0
 Environment: Spark
Reporter: Pat Ferrel
Assignee: Pat Ferrel

 Using CooccurrenceAnalysis.cooccurrence create a driver that reads a text DRM 
 or two and produces LLR similarity/cross-similarity matrices.
 This will produce the same results as ItemSimilarity but take a Drm as input 
 instead of individual cells.
 The first version will only support LLR, other similarity measures will need 
 to be in separate Jiras



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Can't push

2014-08-28 Thread Pat Ferrel
I’m no longer able to git push

The first time I tried I got an error, then it started asking for a sign-in. 
We’re using keys, right? Do I need to reset my public key? Can someone remind 
me where that is done?

Re: Can't push

2014-08-28 Thread Tony Stevenson
No we're not using keys. User / pass combo. Can you login to 
https://id.apache.org if not that is why. It is possible your account is now 
locked so you should perform a reset if that is the case. 


Cheers,
Tony

Sent from my iPhone - Please excuse any brevity or typos. 



 On 28 Aug 2014, at 18:55, Pat Ferrel p...@occamsmachete.com wrote:
 
 I’m no longer able to git push
 
 The first time I tried I got an error, then it started asking for a sign-in. 
 We’re using keys, right? Do I need to reset my public key? Can someone remind 
 me where that is done?


[jira] [Commented] (MAHOUT-1604) Create a RowSimilarity for Spark

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114064#comment-14114064
 ] 

ASF GitHub Bot commented on MAHOUT-1604:


Github user asfgit closed the pull request at:

https://github.com/apache/mahout/pull/47


 Create a RowSimilarity for Spark
 

 Key: MAHOUT-1604
 URL: https://issues.apache.org/jira/browse/MAHOUT-1604
 Project: Mahout
  Issue Type: Bug
  Components: CLI
Affects Versions: 1.0
 Environment: Spark
Reporter: Pat Ferrel
Assignee: Pat Ferrel

 Using CooccurrenceAnalysis.cooccurrence create a driver that reads a text DRM 
 or two and produces LLR similarity/cross-similarity matrices.
 This will produce the same results as ItemSimilarity but take a Drm as input 
 instead of individual cells.
 The first version will only support LLR, other similarity measures will need 
 to be in separate Jiras



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1500) H2O integration

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114117#comment-14114117
 ] 

ASF GitHub Bot commented on MAHOUT-1500:


Github user avati commented on the pull request:

https://github.com/apache/mahout/pull/48#issuecomment-53772200
  
@andrewpalumbo - does this look good?


 H2O integration
 ---

 Key: MAHOUT-1500
 URL: https://issues.apache.org/jira/browse/MAHOUT-1500
 Project: Mahout
  Issue Type: Improvement
Reporter: Anand Avati
Assignee: Andrew Palumbo
 Fix For: 1.0


 Provide H2O backend for the Mahout DSL



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1568) Build an I/O model that can replace sequence files for import/export

2014-08-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114126#comment-14114126
 ] 

Hudson commented on MAHOUT-1568:


SUCCESS: Integrated in Mahout-Quality #2768 (See 
[https://builds.apache.org/job/Mahout-Quality/2768/])
MAHOUT-1604 add a CLI and associated code for spark-rowsimilarity, also cleans 
up some things in MAHOUT-1568 and MAHOUT-1569, closes apache/mahout#47 (pat: 
rev 149c98592fe447c98dfb5afc67b5809725cc3056)
* spark/pom.xml
* spark/src/main/scala/org/apache/mahout/drivers/RowSimilarityDriver.scala
* CHANGELOG
* spark/src/main/scala/org/apache/mahout/drivers/ReaderWriter.scala
* spark/src/main/scala/org/apache/mahout/drivers/TextDelimitedReaderWriter.scala
* spark/src/main/scala/org/apache/mahout/drivers/MahoutDriver.scala
* math-scala/src/main/scala/org/apache/mahout/math/scalabindings/MatrixOps.scala
* spark/src/main/scala/org/apache/mahout/drivers/FileSysUtils.scala
* spark/src/test/scala/org/apache/mahout/cf/CooccurrenceAnalysisSuite.scala
* spark/src/test/scala/org/apache/mahout/drivers/RowSimilarityDriverSuite.scala
* math-scala/src/main/scala/org/apache/mahout/math/drm/RLikeDrmOps.scala
* math-scala/src/main/scala/org/apache/mahout/math/cf/CooccurrenceAnalysis.scala
* bin/mahout
* spark/src/main/scala/org/apache/mahout/sparkbindings/SparkEngine.scala
* spark/src/main/scala/org/apache/mahout/drivers/MahoutOptionParser.scala
* spark/src/main/scala/org/apache/mahout/drivers/IndexedDataset.scala
* spark/src/main/scala/org/apache/mahout/drivers/Schema.scala
* spark/src/main/scala/org/apache/mahout/drivers/ItemSimilarityDriver.scala
* math-scala/src/main/scala/org/apache/mahout/math/cf/SimilarityAnalysis.scala
* 
math-scala/src/test/scala/org/apache/mahout/math/scalabindings/MatrixOpsSuite.scala
* spark/src/test/scala/org/apache/mahout/drivers/ItemSimilarityDriverSuite.scala


 Build an I/O model that can replace sequence files for import/export
 

 Key: MAHOUT-1568
 URL: https://issues.apache.org/jira/browse/MAHOUT-1568
 Project: Mahout
  Issue Type: New Feature
  Components: CLI
 Environment: Scala, Spark
Reporter: Pat Ferrel
Assignee: Pat Ferrel

 Implement mechanisms to read and write data from/to flexible stores. These 
 will support tuples streams and drms but with extensions that allow keeping 
 user defined values for IDs. The mechanism in some sense can replace Sequence 
 Files for import/export and will make the operation much easier for the user. 
 In many cases directly consuming their input files.
 Start with text delimited files for input/output in the Spark version of 
 ItemSimilarity
 A proposal is running with ItemSimilarity on Spark and is documented on the 
 github wiki here: https://github.com/pferrel/harness/wiki
 Comments are appreciated



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1569) Create CLI driver that supports Spark jobs

2014-08-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114127#comment-14114127
 ] 

Hudson commented on MAHOUT-1569:


SUCCESS: Integrated in Mahout-Quality #2768 (See 
[https://builds.apache.org/job/Mahout-Quality/2768/])
MAHOUT-1604 add a CLI and associated code for spark-rowsimilarity, also cleans 
up some things in MAHOUT-1568 and MAHOUT-1569, closes apache/mahout#47 (pat: 
rev 149c98592fe447c98dfb5afc67b5809725cc3056)
* spark/pom.xml
* spark/src/main/scala/org/apache/mahout/drivers/RowSimilarityDriver.scala
* CHANGELOG
* spark/src/main/scala/org/apache/mahout/drivers/ReaderWriter.scala
* spark/src/main/scala/org/apache/mahout/drivers/TextDelimitedReaderWriter.scala
* spark/src/main/scala/org/apache/mahout/drivers/MahoutDriver.scala
* math-scala/src/main/scala/org/apache/mahout/math/scalabindings/MatrixOps.scala
* spark/src/main/scala/org/apache/mahout/drivers/FileSysUtils.scala
* spark/src/test/scala/org/apache/mahout/cf/CooccurrenceAnalysisSuite.scala
* spark/src/test/scala/org/apache/mahout/drivers/RowSimilarityDriverSuite.scala
* math-scala/src/main/scala/org/apache/mahout/math/drm/RLikeDrmOps.scala
* math-scala/src/main/scala/org/apache/mahout/math/cf/CooccurrenceAnalysis.scala
* bin/mahout
* spark/src/main/scala/org/apache/mahout/sparkbindings/SparkEngine.scala
* spark/src/main/scala/org/apache/mahout/drivers/MahoutOptionParser.scala
* spark/src/main/scala/org/apache/mahout/drivers/IndexedDataset.scala
* spark/src/main/scala/org/apache/mahout/drivers/Schema.scala
* spark/src/main/scala/org/apache/mahout/drivers/ItemSimilarityDriver.scala
* math-scala/src/main/scala/org/apache/mahout/math/cf/SimilarityAnalysis.scala
* 
math-scala/src/test/scala/org/apache/mahout/math/scalabindings/MatrixOpsSuite.scala
* spark/src/test/scala/org/apache/mahout/drivers/ItemSimilarityDriverSuite.scala


 Create CLI driver that supports Spark jobs
 --

 Key: MAHOUT-1569
 URL: https://issues.apache.org/jira/browse/MAHOUT-1569
 Project: Mahout
  Issue Type: New Feature
  Components: CLI
 Environment: Scala, Spark
Reporter: Pat Ferrel
Assignee: Pat Ferrel

 Create a design for CLI drivers, including an option parser, base 
 MahoutDriver for Spark, that uses a text file I/O mechanism MAHOUT-1568
 A version of the proposal is implemented and running for ItemSimilarity on 
 Spark. MAHOUT-1541
 A proposal is running with ItemSimilarity on Spark and is documented on the 
 github wiki here: https://github.com/pferrel/harness/wiki
 Comments are appreciated



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1604) Create a RowSimilarity for Spark

2014-08-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114128#comment-14114128
 ] 

Hudson commented on MAHOUT-1604:


SUCCESS: Integrated in Mahout-Quality #2768 (See 
[https://builds.apache.org/job/Mahout-Quality/2768/])
MAHOUT-1604 add a CLI and associated code for spark-rowsimilarity, also cleans 
up some things in MAHOUT-1568 and MAHOUT-1569, closes apache/mahout#47 (pat: 
rev 149c98592fe447c98dfb5afc67b5809725cc3056)
* spark/pom.xml
* spark/src/main/scala/org/apache/mahout/drivers/RowSimilarityDriver.scala
* CHANGELOG
* spark/src/main/scala/org/apache/mahout/drivers/ReaderWriter.scala
* spark/src/main/scala/org/apache/mahout/drivers/TextDelimitedReaderWriter.scala
* spark/src/main/scala/org/apache/mahout/drivers/MahoutDriver.scala
* math-scala/src/main/scala/org/apache/mahout/math/scalabindings/MatrixOps.scala
* spark/src/main/scala/org/apache/mahout/drivers/FileSysUtils.scala
* spark/src/test/scala/org/apache/mahout/cf/CooccurrenceAnalysisSuite.scala
* spark/src/test/scala/org/apache/mahout/drivers/RowSimilarityDriverSuite.scala
* math-scala/src/main/scala/org/apache/mahout/math/drm/RLikeDrmOps.scala
* math-scala/src/main/scala/org/apache/mahout/math/cf/CooccurrenceAnalysis.scala
* bin/mahout
* spark/src/main/scala/org/apache/mahout/sparkbindings/SparkEngine.scala
* spark/src/main/scala/org/apache/mahout/drivers/MahoutOptionParser.scala
* spark/src/main/scala/org/apache/mahout/drivers/IndexedDataset.scala
* spark/src/main/scala/org/apache/mahout/drivers/Schema.scala
* spark/src/main/scala/org/apache/mahout/drivers/ItemSimilarityDriver.scala
* math-scala/src/main/scala/org/apache/mahout/math/cf/SimilarityAnalysis.scala
* 
math-scala/src/test/scala/org/apache/mahout/math/scalabindings/MatrixOpsSuite.scala
* spark/src/test/scala/org/apache/mahout/drivers/ItemSimilarityDriverSuite.scala


 Create a RowSimilarity for Spark
 

 Key: MAHOUT-1604
 URL: https://issues.apache.org/jira/browse/MAHOUT-1604
 Project: Mahout
  Issue Type: Bug
  Components: CLI
Affects Versions: 1.0
 Environment: Spark
Reporter: Pat Ferrel
Assignee: Pat Ferrel

 Using CooccurrenceAnalysis.cooccurrence create a driver that reads a text DRM 
 or two and produces LLR similarity/cross-similarity matrices.
 This will produce the same results as ItemSimilarity but take a Drm as input 
 instead of individual cells.
 The first version will only support LLR, other similarity measures will need 
 to be in separate Jiras



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1500) H2O integration

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114252#comment-14114252
 ] 

ASF GitHub Bot commented on MAHOUT-1500:


Github user andrewpalumbo commented on the pull request:

https://github.com/apache/mahout/pull/48#issuecomment-53788269
  
Looks good anand, thanks.


 H2O integration
 ---

 Key: MAHOUT-1500
 URL: https://issues.apache.org/jira/browse/MAHOUT-1500
 Project: Mahout
  Issue Type: Improvement
Reporter: Anand Avati
Assignee: Andrew Palumbo
 Fix For: 1.0


 Provide H2O backend for the Mahout DSL



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1500) H2O integration

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114256#comment-14114256
 ] 

ASF GitHub Bot commented on MAHOUT-1500:


Github user asfgit closed the pull request at:

https://github.com/apache/mahout/pull/48


 H2O integration
 ---

 Key: MAHOUT-1500
 URL: https://issues.apache.org/jira/browse/MAHOUT-1500
 Project: Mahout
  Issue Type: Improvement
Reporter: Anand Avati
Assignee: Andrew Palumbo
 Fix For: 1.0


 Provide H2O backend for the Mahout DSL



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Assigned] (MAHOUT-1608) Add Option WikipediaToSequenceFile to remove Category Labels from Documents

2014-08-28 Thread Andrew Palumbo (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Palumbo reassigned MAHOUT-1608:
--

Assignee: Andrew Palumbo

 Add Option WikipediaToSequenceFile to remove Category Labels from Documents
 ---

 Key: MAHOUT-1608
 URL: https://issues.apache.org/jira/browse/MAHOUT-1608
 Project: Mahout
  Issue Type: Improvement
Affects Versions: 0.9
Reporter: Andrew Palumbo
Assignee: Andrew Palumbo
Priority: Minor
 Fix For: 1.0


 Currently WikipediaMapper job extracts Category labels from the text of the 
 Wikipedia documents and leaves the label as [[Category:label]] in the 
 document.  Add in an option to WikipediaToSequenceFile.java to remove 
 [[Category:label]] from the text after extracting the label.   



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Assigned] (MAHOUT-1559) Add documentation for and clean up the wikipedia classifier example

2014-08-28 Thread Andrew Palumbo (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Palumbo reassigned MAHOUT-1559:
--

Assignee: Andrew Palumbo

 Add documentation for and clean up the wikipedia classifier example
 ---

 Key: MAHOUT-1559
 URL: https://issues.apache.org/jira/browse/MAHOUT-1559
 Project: Mahout
  Issue Type: Improvement
  Components: Documentation, Examples
Affects Versions: 1.0
Reporter: Andrew Palumbo
Assignee: Andrew Palumbo
Priority: Minor
 Fix For: 1.0


 Add documentation for the wikipedia classifer example. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (MAHOUT-1559) Add documentation for and clean up the wikipedia classifier example

2014-08-28 Thread Andrew Palumbo (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Palumbo updated MAHOUT-1559:
---

Summary: Add documentation for and clean up the wikipedia classifier 
example  (was: Add documentation for the wikipedia classifier example)

 Add documentation for and clean up the wikipedia classifier example
 ---

 Key: MAHOUT-1559
 URL: https://issues.apache.org/jira/browse/MAHOUT-1559
 Project: Mahout
  Issue Type: Improvement
  Components: Documentation, Examples
Affects Versions: 1.0
Reporter: Andrew Palumbo
Priority: Minor
 Fix For: 1.0


 Add documentation for the wikipedia classifer example. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1500) H2O integration

2014-08-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114316#comment-14114316
 ] 

Hudson commented on MAHOUT-1500:


SUCCESS: Integrated in Mahout-Quality #2769 (See 
[https://builds.apache.org/job/Mahout-Quality/2769/])
MAHOUT-1500: H2O Integration - more style revisions closes apache/mahout#48 
(ap.dev: rev 03a5bb61ed56daccd207d7a255956e21612cf995)
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/TimesRightMatrix.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/RowRange.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/At.java
* h2o/src/main/scala/org/apache/mahout/h2obindings/H2OEngine.scala
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/AewB.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/AtB.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/AtA.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/Cbind.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/Ax.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/ABt.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/Atx.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/AewScalar.java
* h2o/src/main/java/org/apache/mahout/h2obindings/ops/Rbind.java


 H2O integration
 ---

 Key: MAHOUT-1500
 URL: https://issues.apache.org/jira/browse/MAHOUT-1500
 Project: Mahout
  Issue Type: Improvement
Reporter: Anand Avati
Assignee: Andrew Palumbo
 Fix For: 1.0


 Provide H2O backend for the Mahout DSL



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1500) H2O integration

2014-08-28 Thread Andrew Palumbo (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114383#comment-14114383
 ] 

Andrew Palumbo commented on MAHOUT-1500:


[~avati]- Looks good!  Thanks for cleaning that up.  I think that the only 
thing left to do is add in some java/scaladoc comments and we can close this up.

 H2O integration
 ---

 Key: MAHOUT-1500
 URL: https://issues.apache.org/jira/browse/MAHOUT-1500
 Project: Mahout
  Issue Type: Improvement
Reporter: Anand Avati
Assignee: Andrew Palumbo
 Fix For: 1.0


 Provide H2O backend for the Mahout DSL



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Jenkins build is back to normal : mahout-nightly #1666

2014-08-28 Thread Apache Jenkins Server
See https://builds.apache.org/job/mahout-nightly/1666/changes



Jenkins build is back to normal : mahout-nightly » Mahout Build Tools #1666

2014-08-28 Thread Apache Jenkins Server
See 
https://builds.apache.org/job/mahout-nightly/org.apache.mahout$mahout-buildtools/1666/



[jira] [Commented] (MAHOUT-1608) Add Option WikipediaToSequenceFile to remove Category Labels from Documents

2014-08-28 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114642#comment-14114642
 ] 

ASF GitHub Bot commented on MAHOUT-1608:


Github user asfgit closed the pull request at:

https://github.com/apache/mahout/pull/45


 Add Option WikipediaToSequenceFile to remove Category Labels from Documents
 ---

 Key: MAHOUT-1608
 URL: https://issues.apache.org/jira/browse/MAHOUT-1608
 Project: Mahout
  Issue Type: Improvement
Affects Versions: 0.9
Reporter: Andrew Palumbo
Assignee: Andrew Palumbo
Priority: Minor
 Fix For: 1.0


 Currently WikipediaMapper job extracts Category labels from the text of the 
 Wikipedia documents and leaves the label as [[Category:label]] in the 
 document.  Add in an option to WikipediaToSequenceFile.java to remove 
 [[Category:label]] from the text after extracting the label.   



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Resolved] (MAHOUT-1608) Add Option WikipediaToSequenceFile to remove Category Labels from Documents

2014-08-28 Thread Andrew Palumbo (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Palumbo resolved MAHOUT-1608.


Resolution: Fixed

 Add Option WikipediaToSequenceFile to remove Category Labels from Documents
 ---

 Key: MAHOUT-1608
 URL: https://issues.apache.org/jira/browse/MAHOUT-1608
 Project: Mahout
  Issue Type: Improvement
Affects Versions: 0.9
Reporter: Andrew Palumbo
Assignee: Andrew Palumbo
Priority: Minor
 Fix For: 1.0


 Currently WikipediaMapper job extracts Category labels from the text of the 
 Wikipedia documents and leaves the label as [[Category:label]] in the 
 document.  Add in an option to WikipediaToSequenceFile.java to remove 
 [[Category:label]] from the text after extracting the label.   



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1608) Add Option WikipediaToSequenceFile to remove Category Labels from Documents

2014-08-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114676#comment-14114676
 ] 

Hudson commented on MAHOUT-1608:


SUCCESS: Integrated in Mahout-Quality #2770 (See 
[https://builds.apache.org/job/Mahout-Quality/2770/])
MAHOUT-1608: Add option in WikipediaToSequenceFile to remove category labels 
from documents. Remove redundant call to findMatchingCategory.  closes 
apache/mahout#45 (ap.dev: rev 6dd0c92ddb9dad2627e9ca5e28118865d6fba159)
* CHANGELOG
* integration/src/main/java/org/apache/mahout/text/WikipediaToSequenceFile.java
* 
integration/src/main/java/org/apache/mahout/text/wikipedia/WikipediaMapper.java


 Add Option WikipediaToSequenceFile to remove Category Labels from Documents
 ---

 Key: MAHOUT-1608
 URL: https://issues.apache.org/jira/browse/MAHOUT-1608
 Project: Mahout
  Issue Type: Improvement
Affects Versions: 0.9
Reporter: Andrew Palumbo
Assignee: Andrew Palumbo
Priority: Minor
 Fix For: 1.0


 Currently WikipediaMapper job extracts Category labels from the text of the 
 Wikipedia documents and leaves the label as [[Category:label]] in the 
 document.  Add in an option to WikipediaToSequenceFile.java to remove 
 [[Category:label]] from the text after extracting the label.   



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Build failed in Jenkins: Mahout-Quality #2771

2014-08-28 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/2771/changes

Changes:

[srowen] MAHOUT-1610 addendum: add CHANGELOG note

--
[...truncated 7667 lines...]
pcaControl:
{
  0  = 
{0:-5.668224460908154,1:-6.133214511308873,2:0.779825278384518,3:-1.0626643480581115,4:-0.06538841105740897,5:-0.04349879929171554,6:-0.06507643146171325,7:-0.006672372115154449,8:-0.0019160303748368523,9:7.147168739488035E-4}
  1  = 
{0:14.746441059414165,1:0.1238449432155428,2:0.509150191469107,3:0.4137388528291847,4:0.4602304546866354,5:-0.030092835495961597,6:-0.056985559374213894,7:-0.005383768497924209,8:-0.0030665235947086983,9:-0.003185941529074587}
  2  = 
{0:-1.0990711930828203,1:2.112004016731812,2:0.6369936971603877,3:0.40066673113765894,4:-0.09407717241766621,5:0.16713018353526032,6:0.016204859925633024,7:0.023855732691034304,8:-0.0016098234173380218,9:9.122665238331369E-4}
  3  = 
{0:-2.343738208181828,1:1.728589380524,2:1.0496211749048339,3:0.3391338800120744,4:0.42118942185569147,5:-0.058002321673436046,6:-0.06279289708931955,7:-0.005985509935939571,8:-0.001577611924415433,9:6.435206497681367E-4}
  4  = 
{0:-2.8258135063306025,1:2.5456315939021112,2:-2.7761375304054297,3:0.18896801583102982,4:-0.08909300750068946,5:-0.04768178986720063,6:0.005996445601075281,7:-0.004688932648084842,8:-0.0019988695090549114,9:5.368932160626223E-4}
  5  = 
{0:14.783534397198967,1:0.29501509257269437,2:0.7769477989297868,3:-1.1500481296888652,4:-0.09462906519128594,5:0.18156858452961613,6:0.017776821093516578,7:-0.003623965976588348,8:0.007827898130456826,9:7.230575194194227E-4}
  6  = 
{0:9.171538465556496,1:-7.175558039374563,2:-3.635190529531252,3:0.26468851672819754,4:-0.10959537162291529,5:-0.029728456598475363,6:0.020325484403651273,7:-0.008415615578830245,8:0.006282408064286262,9:0.0010658814415176934}
  7  = 
{0:-3.0041815916415446,1:1.942663933478009,2:0.5119422239977843,3:0.2763405492176308,4:-0.16727903392370685,5:-0.03560463803468493,6:-0.0636262914744333,7:-0.005132709375403085,8:-0.0020968858498234227,9:6.459293841668969E-4}
  8  = 
{0:-1.4367848261801806,1:2.9643681587815434,2:-2.73136079842,3:0.04120009370991556,4:-0.1492391527326176,5:-0.025079506195922904,6:0.006004013139264973,7:-0.004597335572305195,8:0.007223888432389089,9:7.344779171772158E-4}
  9  = 
{0:-1.7283784683760932,1:2.758066793169005,2:-2.7878134378042914,3:0.20694927779394884,4:-0.15124172553064372,5:0.17362751994048398,6:0.009930210366534378,7:-0.005660577591472821,8:-0.002278641733622213,9:7.533068782920091E-4}
}
- dspca
spectrum:{0:300.0,1:110.3638323514327,2:40.60058497098381,3:14.936120510359183,4:5.49469120254,5:2.02138409972564,6:0.743625652076,7:0.2735645896663549,8:0.10063878837075356,9:0.037022941226003865,10:0.013619978928745453,11:0.005010510237073698,12:0.001843263705998463,13:0.0010,14:0.0010,15:0.0010,16:0.0010,17:0.0010,18:0.0010,19:0.0010,20:0.0010,21:0.0010,22:0.0010,23:0.0010,24:0.0010,25:0.0010,26:0.0010,27:0.0010,28:0.0010,29:0.0010,30:0.0010,31:0.0010,32:0.0010,33:0.0010,34:0.0010,35:0.0010,36:0.0010,37:0.0010,38:0.0010,39:0.0010}
Control block:
{
  0  = {0:0.3947476722883563,1:-0.08695028358267716,2:-1.0574297632219802}
  1  = {0:0.4076559804271818,1:0.013563509240543453,2:-0.6050700722864573}
  2  = {0:0.15935325307337903,1:0.07468219465060774,2:-0.37963073350622206}
}
ALS factorized approximation block:
{
  0  = {0:0.39473834081223236,1:-0.08694970637335711,2:-1.0574244769841608}
  1  = {0:0.40763342202551156,1:0.013569874413426178,2:-0.6050658217564977}
  2  = {0:0.15932771034936122,1:0.07468368649827967,2:-0.3796283428829573}
}
norm of residuals 0.005133
train iteration rmses: List(1.499530739399157E-7, 4.780852923458485E-8, 
5.270138513995659E-8)
- dals
DrmLikeOpsSuite:
{
  0  = {0:2.0,1:3.0,2:4.0}
  1  = {0:3.0,1:4.0,2:5.0}
  2  = {0:4.0,1:5.0,2:6.0}
  3  = {0:5.0,1:6.0,2:7.0}
}
- mapBlock
{
  0  = {0:2.0,1:3.0}
  1  = {0:3.0,1:4.0}
  2  = {0:4.0,1:5.0}
  3  = {0:5.0,1:6.0}
}
- col range
{
  0  = {0:2.0,1:3.0,2:4.0}
  1  = {0:3.0,1:4.0,2:5.0}
}
- row range
{
  0  = {0:3.0,1:4.0}
  1  = {0:4.0,1:5.0}
}
- col, row range
- exact, min and auto ||
ItemSimilarityDriverSuite:
- ItemSimilarityDriver, non-full-spec CSV
- ItemSimilarityDriver TSV 
- ItemSimilarityDriver log-ish files
- ItemSimilarityDriver legacy supported file format
- ItemSimilarityDriver write search engine output
- ItemSimilarityDriver recursive file discovery using filename patterns
- ItemSimilarityDriver, two input paths
- ItemSimilarityDriver, two inputs of different dimensions
- ItemSimilarityDriver cross similarity two separate items spaces
- A.t %*% B after changing row cardinality of A
- ItemSimilarityDriver cross similarity two separate items spaces, missing 
rows in B
BlasSuite:
AB' num partitions = 2.
{
  2  = 

[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114885#comment-14114885
 ] 

Hudson commented on MAHOUT-1610:


FAILURE: Integrated in Mahout-Quality #2771 (See 
[https://builds.apache.org/job/Mahout-Quality/2771/])
MAHOUT-1610 addendum: add CHANGELOG note (srowen: rev 
d9e26c64d620d723c94386e1a5ef9e70588c31a3)
* CHANGELOG


 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests
 Fix For: 1.0


 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1610) Tests can be made more robust to pass in Java 8

2014-08-28 Thread Sean Owen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14114893#comment-14114893
 ] 

Sean Owen commented on MAHOUT-1610:
---

I forgot to modify CHANGELOG, so fixed that in an extra commit. The original 
Hudson tests passed, so I can only assume this change to just the CHANGELOG 
file can't have broken something, and that this is a spurious failure.

 Tests can be made more robust to pass in Java 8
 ---

 Key: MAHOUT-1610
 URL: https://issues.apache.org/jira/browse/MAHOUT-1610
 Project: Mahout
  Issue Type: Bug
  Components: Integration
Affects Versions: 0.9
 Environment: Java 1.8.0_11 OS X 10.9.4
Reporter: Sean Owen
Assignee: Sean Owen
Priority: Minor
  Labels: java8, tests
 Fix For: 1.0


 Right now, several tests don't seem to pass when run with Java 8 (at least on 
 Java 8). The failures are benign, and just due to tests looking for 
 too-specific values or expecting things like a certain ordering of hashmaps. 
 The tests can easily be made to pass both Java 8 and Java 6/7 at the same 
 time by either relaxing the tests in a principled way, or accepting either 
 output of two equally valid ones as correct.
 (There's also one curious compilation failure in Java 8, related to generics. 
 It is fixable by changing to a more explicit declaration that should be 
 equivalent. It should be entirely equivalent at compile time, and of course, 
 at run time. I am not sure it's not just a javac bug, but, might as well work 
 around when it's so easy.)



--
This message was sent by Atlassian JIRA
(v6.2#6252)