[jira] [Created] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass
zhubin created MAHOUT-1651: -- Summary: [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass Key: MAHOUT-1651 URL: https://issues.apache.org/jira/browse/MAHOUT-1651 Project: Mahout Issue Type: Bug Components: Math Affects Versions: 0.9 Reporter: zhubin Priority: Minor Running org.apache.mahout.math.random.MultinomialTest Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec FAILURE! - in org.apache.mahout.math.random.MultinomialTest testPrime(org.apache.mahout.math.random.MultinomialTest) Time elapsed: 0.039 sec FAILURE! java.lang.AssertionError: expected:16 but was:17 at __randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0) at org.junit.Assert.fail(Assert.java:88) at org.junit.Assert.failNotEquals(Assert.java:743) at org.junit.Assert.assertEquals(Assert.java:118) at org.junit.Assert.assertEquals(Assert.java:555) at org.junit.Assert.assertEquals(Assert.java:542) at org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143) -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass
[ https://issues.apache.org/jira/browse/MAHOUT-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14370948#comment-14370948 ] zhubin commented on MAHOUT-1651: I see the trunk has changed the org/apache/mahout/math/random/MultinomialTest.java to work around this issue. However, I'm not sure if the JDK 8 itself introduced the issue or it was a test code issue. [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass -- Key: MAHOUT-1651 URL: https://issues.apache.org/jira/browse/MAHOUT-1651 Project: Mahout Issue Type: Bug Components: Math Affects Versions: 0.9 Reporter: zhubin Priority: Minor Running org.apache.mahout.math.random.MultinomialTest Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec FAILURE! - in org.apache.mahout.math.random.MultinomialTest testPrime(org.apache.mahout.math.random.MultinomialTest) Time elapsed: 0.039 sec FAILURE! java.lang.AssertionError: expected:16 but was:17 at __randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0) at org.junit.Assert.fail(Assert.java:88) at org.junit.Assert.failNotEquals(Assert.java:743) at org.junit.Assert.assertEquals(Assert.java:118) at org.junit.Assert.assertEquals(Assert.java:555) at org.junit.Assert.assertEquals(Assert.java:542) at org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143) -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass
[ https://issues.apache.org/jira/browse/MAHOUT-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi resolved MAHOUT-1651. --- Resolution: Not a Problem Fix Version/s: 0.10.0 Assignee: Suneel Marthi [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass -- Key: MAHOUT-1651 URL: https://issues.apache.org/jira/browse/MAHOUT-1651 Project: Mahout Issue Type: Bug Components: Math Affects Versions: 0.9 Reporter: zhubin Assignee: Suneel Marthi Priority: Minor Fix For: 0.10.0 Running org.apache.mahout.math.random.MultinomialTest Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec FAILURE! - in org.apache.mahout.math.random.MultinomialTest testPrime(org.apache.mahout.math.random.MultinomialTest) Time elapsed: 0.039 sec FAILURE! java.lang.AssertionError: expected:16 but was:17 at __randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0) at org.junit.Assert.fail(Assert.java:88) at org.junit.Assert.failNotEquals(Assert.java:743) at org.junit.Assert.assertEquals(Assert.java:118) at org.junit.Assert.assertEquals(Assert.java:555) at org.junit.Assert.assertEquals(Assert.java:542) at org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143) -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass
[ https://issues.apache.org/jira/browse/MAHOUT-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14371381#comment-14371381 ] Suneel Marthi commented on MAHOUT-1651: --- This is nothing to do with JDK versions. Its a random test that needed to be loosened up, closing this Jira as its a non-issue. [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass -- Key: MAHOUT-1651 URL: https://issues.apache.org/jira/browse/MAHOUT-1651 Project: Mahout Issue Type: Bug Components: Math Affects Versions: 0.9 Reporter: zhubin Priority: Minor Fix For: 0.10.0 Running org.apache.mahout.math.random.MultinomialTest Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec FAILURE! - in org.apache.mahout.math.random.MultinomialTest testPrime(org.apache.mahout.math.random.MultinomialTest) Time elapsed: 0.039 sec FAILURE! java.lang.AssertionError: expected:16 but was:17 at __randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0) at org.junit.Assert.fail(Assert.java:88) at org.junit.Assert.failNotEquals(Assert.java:743) at org.junit.Assert.assertEquals(Assert.java:118) at org.junit.Assert.assertEquals(Assert.java:555) at org.junit.Assert.assertEquals(Assert.java:542) at org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143) -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1431. Resolution: Cannot Reproduce Closing this as we didn't get another answer for 6 weeks. Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13967797#comment-13967797 ] Sebastian Schelter commented on MAHOUT-1431: Any progess here? Otherwise I'll close the ticket soon. Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13924818#comment-13924818 ] Suneel Marthi commented on MAHOUT-1431: --- yannis, could u provide explicit details as to the task name. KMeans has 2 mapper tasks and a reducer task. Which mapper is taking longer time? I would expect ClusterClassificationMapper to be slower due to the additional overhead of calculating the vector distance from the centroid and converting the vector to a named vector. Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13924841#comment-13924841 ] yannis ats commented on MAHOUT-1431: how can i distinguish which mapper is slower? by the logs? i would try to see the logs here the time was defined approximately by the console of emr Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922707#comment-13922707 ] Suneel Marthi commented on MAHOUT-1431: --- hi yannis, any updates on this? Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922734#comment-13922734 ] yannis ats commented on MAHOUT-1431: HI i will report you in few hours since i will be able to inspect manually how much time is consumed on mapper/reducers kind regards yannis Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923048#comment-13923048 ] yannis ats commented on MAHOUT-1431: From manual inspection it seems that the reducer takes twice the time that a mapper needs to complete the task.I did experiments with 6 large machines as data nodes. Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Created] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
yannis ats created MAHOUT-1431: -- Summary: Comparison of Mahout 0.8 vs mahout 0.9 in EMR Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.9, 0.8 Reporter: yannis ats Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am buffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yannis ats updated MAHOUT-1431: --- Description: Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. was: Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am buffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919260#comment-13919260 ] Sebastian Schelter commented on MAHOUT-1431: That is really strange, I don't think we changed something in our k-Means implementation (can someone verify this?). Could it be that some change in our vector code causes this behaviour? Btw: k-Means should have an option to fix the random seed for the initialization to enable repeatable experiments Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919267#comment-13919267 ] Suneel Marthi commented on MAHOUT-1431: --- Could u provide CODE snapshots of where u believe that the iterations r taking longer? the only change that was done to address Mahout-1030 was convert all vectors to Named Vectors to store the Vector Ids and vector distances from cluster centers. The code changes for that are in ClusterClassificationDriver (for Sequential mode) and ClusterClassificationMapper (for MR mode) which are post processing steps after clustering is done. Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919271#comment-13919271 ] Suneel Marthi commented on MAHOUT-1431: --- The other change in 0.9 that comes to mind and could have potentially impacted the vector behavious is Mahout-1242. Mere late night speculation and evidence not verified?? Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919273#comment-13919273 ] yannis ats commented on MAHOUT-1431: The notion of time for every iteration is part of what is recorded when the job(iteration) finishes and was found from the console of EMR. i have not inspected the code,i just did experiments with a large dataset because when i did the same experiment from 0.7 to 0.8 i found that 0.8 was much faster(in terms of the number of iterations,i dont remember how much time every iteration was taking) Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919293#comment-13919293 ] Suneel Marthi commented on MAHOUT-1431: --- Comparing 0.7 to 0.8 is comparing apples-oranges. The clustering code was redone for 0.7 and wasn't functioning right until 0.8. Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919308#comment-13919308 ] Sebastian Schelter commented on MAHOUT-1431: Could you see where the additional time was spent? (e.g. in the mapper or in the reducer)? Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR
[ https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919314#comment-13919314 ] yannis ats commented on MAHOUT-1431: I am pretty sure that the reducer takes more time than the mapper,but if i remember well probably both mapper and reducer take more time in 0.9 than 0.8 but i think it took more time on the mapper but i am not very confident (my memory is not very good) but i cannot answer this question right now, probably i have to restart the processes and check manually how long it takes Comparison of Mahout 0.8 vs mahout 0.9 in EMR - Key: MAHOUT-1431 URL: https://issues.apache.org/jira/browse/MAHOUT-1431 Project: Mahout Issue Type: Question Components: Clustering Affects Versions: 0.8, 0.9 Reporter: yannis ats Labels: performance Hi all, i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and i performed kmeans experiments with both versions in amazon EMR. What i found is that mahout 0.8 is faster than mahout 0.9 in particular i observed that mahout 0.8 is performing less iterations and every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 0.8 is twice as fast as that of 0.9 the hadoop version was 1.0.x and the input of the data was roughly 2 million datapoints with dimensionality of 1800. The input parameters in both experiments were exactly the same,modulo the initialization which was random in both cases and i can understand that this may affect the convergence(the amount of iterations),but i am baffled by the fact that every iteration takes almost twice the time in 0.9 vs 0.8 Is this normal?is this expected? thank you in advance for your time. -- This message was sent by Atlassian JIRA (v6.2#6252)
Re: Solr-recommender for Mahout 0.9
On Sat, Feb 22, 2014 at 4:50 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: *Ted*, if you have any code you could donate for this example from your and Ellen's book I'd love to be able to re-use it. I do. I will try to open up access to that sometime today. Pat's work on the cross recommender job is also important.
Re: Solr-recommender for Mahout 0.9
*Pat*, I opened a ticket(M-1420) for putting a new script in examples/ that uses the solr-recommender. Seems there's another, related ticket from Suneel in M-1288. Did the work described in the thread below make it into 0.9, and/or how much more is needed on it? *Ted*, if you have any code you could donate for this example from your and Ellen's book I'd love to be able to re-use it. Thanks Andrew On Sun, Nov 17, 2013 at 3:36 PM, Pat Ferrel p...@occamsmachete.com wrote: Eventually I'd like to get MAP built into the solr-recommender. Used it at a client who had good data. It was very helpful for exploring what data was useful and what wasn't. We'd run map with and without detail-view data for instance and take the MAP as a measure of how predictive the data was. In our case the MAP@ numbers went down with purchase and detail-view mixed together. That was why I got interested in the cross-action recommender--as a way to scrub less predictive actions. Didn't finish it before I lost access to the data unfortunately. What form of precision calc will you use? Obviously we used mean average precision at different numbers of recommendations, which had the effect of producing a fall-off curve. The curve, we took, as a measure of how well our ranking was working. On Nov 17, 2013, at 10:47 AM, Ken Krugler kkrugler_li...@transpac.com wrote: Hi Pat, On Nov 13, 2013, at 4:43pm, Pat Ferrel pat.fer...@gmail.com wrote: Ever done an offline precision calc? No, sorry. I do (finally) have one client with some data that could be used to calculate precision, and a willingness to pay for the work, so I'm hoping to include details on that in my next blog post about text feature selection. -- Ken On Nov 13, 2013, at 1:39 PM, Ken Krugler kkrugler_li...@transpac.com wrote: Hi Pat, On Nov 13, 2013, at 9:21am, Pat Ferrel p...@occamsmachete.com wrote: A version is now checked in that uses mahout 0.9. Haven't tested it on a cluster yet, only locally. I have to upgrade my cluster to Hadoop 1.2.1, which takes some time. Saw the Strata slides from Ted touting dithering of results, which I'll implement. Ken, did you have anything specific for And usually I just use Solr to generate a candidate list, then I do more specific scoring to find the N best form N*4 candidates? If I'm looking for the top N best matches, I'll do a Solr query with rows=N*4. Then I use all of the data from these potential matches, and calculate a more sophisticated similarity score (e.g. adding a weighting based on the user's activity level) between my target and these candidates. Regards, -- Ken Was planning to try boosting by something like genre/category in the recs query. For instance, in the demo data, each item will soon have a set of tags (actually genre names) so these could be a field being queried along with the item-item links. The query for recs would then include the user history against the item-item links, and the average genre tags preferred by the user against item genre tags. This would return recs skewed towards the user's genre preference. Another way this could be used is when showing similar items. You'd have the tags for the item being viewed and so could use them to skew towards items with similar tags. I think this works but would turn similar items from a lookup (they are pre-calculated by Mahout) into another Solr query. On Nov 8, 2013, at 1:27 PM, Pat Ferrel p...@occamsmachete.com wrote: Not planning to do anything with weights at present. An ORed query should suffice for the time being and Solr weights. There are a good list of ways to do this later if it warrants an experiment. Thanks. Have, similar items as input, recommendations from user likes, and just got recs from recently viewed working. Once you have online recs from the pre-calculated model experimenting is super easy. The next step will be to get more metadata ingested so we can try boosting by context genre, or recent genre viewed, which is sort of in line with more specific scoring to find the N best from N*4 candidates. Also want to do what Ted calls dithering to vary the choices you see. On Nov 8, 2013, at 10:10 AM, Ken Krugler kkrugler_li...@transpac.com wrote: One other thing I should have mentioned is that if you care about setting weights on incoming terms, you can boost them using the ^value syntax. E.g. the_kings_speech^1.5 OR skyfalll^0.5 OR looper^3.0... If you want to account for weights of terms in the index, it's a bit harder. You can do simple boosting by replicating terms, or you can use payload-based boosting, or you could code up your own Similarity class that takes advantage of side-channel data. But in my experience the gain from applying weights to terms int he index isn't very significant. And usually I just Solr to generate a candidate list, then I do more specific scoring to find the N best
Re: Mahout 0.9 Release Notes - First Draft
Could someone please point me to the URL for adding Mahout release notes? On Monday, February 17, 2014 3:27 PM, Ellen Friedman b.ellen.fried...@gmail.com wrote: Hi Suneel, Thanks for notes. I'm inquiring about status of the notes and update to the website to announce 0.9: Ted has reviewed the release notes - were you waiting for additional input or are they ready to go on the website? Are you the one who updates the site? I've been asked to write a short blog on the release but wanted to wait until the site is updated. Thanks much Ellen On Tue, Feb 11, 2014 at 10:06 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's a draft of the Release Notes for Mahout 0.9, Please review the same. -- The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries focused primarily in the areas of collaborative filtering (recommenders), clustering and classification (known collectively as the 3Cs), as well as the necessary infrastructure to support those implementations including, but not limited to, math packages for statistics, linear algebra and others as well as Java primitive collections, local and distributed vector and matrix classes and a variety of integrative code to work with popular packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache Cassandra and much more. The 0.9 release is mainly a clean up release in preparation for an upcoming 1.0 release targeted for first half of 2014, but there are a few significant new features, which are highlighted below. To get started with Apache Mahout 0.9, download the release artifacts and signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central Maven repository. As with any release, we wish to thank all of the users and contributors to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for individual credits, as there are too many to list here. GETTING STARTED In the release package, the examples directory contains several working examples of the core functionality available in Mahout. These can be run via scripts in the examples/bin directory and will prompt you for more information to help you try things out. Most examples do not need a Hadoop cluster in order to run. RELEASE HIGHLIGHTS The highlights of the Apache Mahout 0.9 release include, but are not limited to the list below. For further information, see the included CHANGELOG[1] file. - MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra. See http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html - MAHOUT-1288: Recommenders as a Search. See https://github.com/pferrel/solr-recommender - MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1 - MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 1-dimensional Clustering See https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf for the details. - MAHOUT-1265: MultiLayer Perceptron (MLP) classifier This is an early implementation of MLP to solicit user feedback, needs to be integrated into Mahout’s processing pipeline to work with Mahout’s vectors. - Removed Deprecated algorithms as they have been either replaced by better performing algorithms or lacked user support and maintenance. - the usual bug fixes. See [2] for more information on the 0.9 release. A total of 113 separate JIRA issues were addressed in this release. The following algorithms that were marked deprecated in 0.8 have been removed in 0.9: - From Clustering: Switched LDA implementation from using Dirtichlet to Collapsed Variational Bayes (CVB) Meanshift MinHash - removed due to poor performance, lack of support and lack of usage - From Classification (both are sequential implementations) Winnow - lack of actual usage and support Perceptron - lack of actual usage and support - Collaborative Filtering SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and org.apache.mahout.cf.taste.impl.recommender.slopeone Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender - Mahout Math Hadoop entropy stuff in org.apache.mahout.math.stats.entropy CONTRIBUTING Mahout is always looking for contributions focused on the 3Cs. If you are interested in contributing, please see our contribution page http://mahout.apache.org/developers/how-to-contribute.html or contact us via email at dev@mahout.apache.org. As the project moves towards a 1.0 release, the community will be focused on key algorithms that are proven to scale in production and have seen wide-spread adoption. [1] http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661 [2] https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20
Apache Mahout 0.9 released
The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries focused primarily in the areas of collaborative filtering (recommenders), clustering and classification (known collectively as the 3Cs), as well as the necessary infrastructure to support those implementations including, but not limited to, math packages for statistics, linear algebra and others as well as Java primitive collections, local and distributed vector and matrix classes and a variety of integrative code to work with popular packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache Cassandra and much more. The 0.9 release is mainly a clean up release in preparation for an upcoming 1.0 release targeted for first half of 2014, but there are a few significant new features, which are highlighted below. To get started with Apache Mahout 0.9, download the release artifacts and signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central Maven repository. As with any release, we wish to thank all of the users and contributors to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for individual credits, as there are too many to list here. GETTING STARTED In the release package, the examples directory contains several working examples of the core functionality available in Mahout. These can be run via scripts in the examples/bin directory and will prompt you for more information to help you try things out. Most examples do not need a Hadoop cluster in order to run. RELEASE HIGHLIGHTS The highlights of the Apache Mahout 0.9 release include, but are not limited to the list below. For further information, see the included CHANGELOG[1] file. - MAHOUT-1245: A new and improved Mahout website based on Apache CMS - MAHOUT-1265: MultiLayer Perceptron (MLP) classifier This is an early implementation of MLP to solicit user feedback, needs to be integrated into Mahout’s processing pipeline to work with Mahout’s vectors. - MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra. See http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html - MAHOUT-1288: Recommenders as a Search. See https://github.com/pferrel/solr-recommender - MAHOUT-1300: Suport for easy functional Matrix views and derivatives - MAHOUT-1343: JSON output format for ClusterDumper - MAHOUT-1345: Enable randomised testing for all Mahout modules using Carrot RandomizedRunner. - MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 1-dimensional Clustering. See https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf for the details. - MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1 - Removed Deprecated algorithms as they have been either replaced by better performing algorithms or lacked user support and maintenance. - the usual bug fixes. See [2] for more information on the 0.9 release. A total of 113 separate JIRA issues were addressed in this release. The following algorithms that were marked deprecated in 0.8 have been removed in 0.9: - From Clustering: Switched LDA implementation from using Gibbs Sampling to Collapsed Variational Bayes (CVB) Meanshift MinHash - removed due to poor performance, lack of support and lack of usage - From Classification (both are sequential implementations) Winnow - lack of actual usage and support Perceptron - lack of actual usage and support - Collaborative Filtering SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and org.apache.mahout.cf.taste.impl.recommender.slopeone Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender - Mahout Math Hadoop entropy stuff in org.apache.mahout.math.stats.entropy CONTRIBUTING Mahout is always looking for contributions focused on the 3Cs. If you are interested in contributing, please see our contribution page http://mahout.apache.org/developers/how-to-contribute.html or contact us via email at dev@mahout.apache.org. As the project moves towards a 1.0 release, the community will be focused on key algorithms that are proven to scale in production and have seen wide-spread adoption. [1] http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661 [2] https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22
Re: Mahout 0.9 Release Notes - First Draft
Below r the release notes, not sure where they should be going on the website. If someone could point me to a location I will go ahead and update the same. = The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries focused primarily in the areas of collaborative filtering (recommenders), clustering and classification (known collectively as the 3Cs), as well as the necessary infrastructure to support those implementations including, but not limited to, math packages for statistics, linear algebra and others as well as Java primitive collections, local and distributed vector and matrix classes and a variety of integrative code to work with popular packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache Cassandra and much more. The 0.9 release is mainly a clean up release in preparation for an upcoming 1.0 release targeted for first half of 2014, but there are a few significant new features, which are highlighted below. To get started with Apache Mahout 0.9, download the release artifacts and signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central Maven repository. As with any release, we wish to thank all of the users and contributors to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for individual credits, as there are too many to list here. GETTING STARTED In the release package, the examples directory contains several working examples of the core functionality available in Mahout. These can be run via scripts in the examples/bin directory and will prompt you for more information to help you try things out. Most examples do not need a Hadoop cluster in order to run. RELEASE HIGHLIGHTS The highlights of the Apache Mahout 0.9 release include, but are not limited to the list below. For further information, see the included CHANGELOG[1] file. - MAHOUT-1245: A new and improved Mahout website based on Apache CMS - MAHOUT-1265: MultiLayer Perceptron (MLP) classifier This is an early implementation of MLP to solicit user feedback, needs to be integrated into Mahout’s processing pipeline to work with Mahout’s vectors. - MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra. See http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html - MAHOUT-1288: Recommenders as a Search. See https://github.com/pferrel/solr-recommender - MAHOUT-1300: Suport for easy functional Matrix views and derivatives - MAHOUT-1343: JSON output format for ClusterDumper - MAHOUT-1345: Enable randomised testing for all Mahout modules using Carrot RandomizedRunner. - MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 1-dimensional Clustering. See https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf for the details. - MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1 - Removed Deprecated algorithms as they have been either replaced by better performing algorithms or lacked user support and maintenance. - the usual bug fixes. See [2] for more information on the 0.9 release. A total of 113 separate JIRA issues were addressed in this release. The following algorithms that were marked deprecated in 0.8 have been removed in 0.9: - From Clustering: Switched LDA implementation from using Gibbs Sampling to Collapsed Variational Bayes (CVB) Meanshift MinHash - removed due to poor performance, lack of support and lack of usage - From Classification (both are sequential implementations) Winnow - lack of actual usage and support Perceptron - lack of actual usage and support - Collaborative Filtering SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and org.apache.mahout.cf.taste.impl.recommender.slopeone Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender - Mahout Math Hadoop entropy stuff in org.apache.mahout.math.stats.entropy CONTRIBUTING Mahout is always looking for contributions focused on the 3Cs. If you are interested in contributing, please see our contribution page http://mahout.apache.org/developers/how-to-contribute.html or contact us via email at dev@mahout.apache.org. As the project moves towards a 1.0 release, the community will be focused on key algorithms that are proven to scale in production and have seen wide-spread adoption. [1] http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661 [2] https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22 On Monday, February 17, 2014 3:27 PM, Ellen Friedman b.ellen.fried...@gmail.com wrote: Hi Suneel, Thanks for notes. I'm inquiring about status of the notes and update to the website to announce 0.9: Ted has reviewed the release notes - were you waiting
Re: Mahout 0.9 Release Notes - First Draft
On Tue, Feb 11, 2014 at 10:06 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Switched LDA implementation from using Dirtichlet to Collapsed Variational Bayes (CVB) This line should read: Switched LDA implementation from using Gibb's sampling to Collapsed Variational Bayes (CVB) Otherwise, it looks pretty good.
Re: Mahout 0.9 Release Notes - First Draft
Hi Suneel, Thanks for notes. I'm inquiring about status of the notes and update to the website to announce 0.9: Ted has reviewed the release notes - were you waiting for additional input or are they ready to go on the website? Are you the one who updates the site? I've been asked to write a short blog on the release but wanted to wait until the site is updated. Thanks much Ellen On Tue, Feb 11, 2014 at 10:06 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Here's a draft of the Release Notes for Mahout 0.9, Please review the same. -- The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries focused primarily in the areas of collaborative filtering (recommenders), clustering and classification (known collectively as the 3Cs), as well as the necessary infrastructure to support those implementations including, but not limited to, math packages for statistics, linear algebra and others as well as Java primitive collections, local and distributed vector and matrix classes and a variety of integrative code to work with popular packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache Cassandra and much more. The 0.9 release is mainly a clean up release in preparation for an upcoming 1.0 release targeted for first half of 2014, but there are a few significant new features, which are highlighted below. To get started with Apache Mahout 0.9, download the release artifacts and signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central Maven repository. As with any release, we wish to thank all of the users and contributors to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for individual credits, as there are too many to list here. GETTING STARTED In the release package, the examples directory contains several working examples of the core functionality available in Mahout. These can be run via scripts in the examples/bin directory and will prompt you for more information to help you try things out. Most examples do not need a Hadoop cluster in order to run. RELEASE HIGHLIGHTS The highlights of the Apache Mahout 0.9 release include, but are not limited to the list below. For further information, see the included CHANGELOG[1] file. - MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra. See http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html - MAHOUT-1288: Recommenders as a Search. See https://github.com/pferrel/solr-recommender - MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1 - MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 1-dimensional Clustering See https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdffor the details. - MAHOUT-1265: MultiLayer Perceptron (MLP) classifier This is an early implementation of MLP to solicit user feedback, needs to be integrated into Mahout's processing pipeline to work with Mahout's vectors. - Removed Deprecated algorithms as they have been either replaced by better performing algorithms or lacked user support and maintenance. - the usual bug fixes. See [2] for more information on the 0.9 release. A total of 113 separate JIRA issues were addressed in this release. The following algorithms that were marked deprecated in 0.8 have been removed in 0.9: - From Clustering: Switched LDA implementation from using Dirtichlet to Collapsed Variational Bayes (CVB) Meanshift MinHash - removed due to poor performance, lack of support and lack of usage - From Classification (both are sequential implementations) Winnow - lack of actual usage and support Perceptron - lack of actual usage and support - Collaborative Filtering SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and org.apache.mahout.cf.taste.impl.recommender.slopeone Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender - Mahout Math Hadoop entropy stuff in org.apache.mahout.math.stats.entropy CONTRIBUTING Mahout is always looking for contributions focused on the 3Cs. If you are interested in contributing, please see our contribution page http://mahout.apache.org/developers/how-to-contribute.html or contact us via email at dev@mahout.apache.org. As the project moves towards a 1.0 release, the community will be focused on key algorithms that are proven to scale in production and have seen wide-spread adoption. [1] http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661 [2] https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22 On Monday, December 23, 2013 7:41 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: On Sun, Dec 22, 2013
Re: Mahout 0.9 Release Notes - First Draft
Here's a draft of the Release Notes for Mahout 0.9, Please review the same. -- The Apache Mahout PMC is pleased to announce the release of Mahout 0.9. Mahout's goal is to build scalable machine learning libraries focused primarily in the areas of collaborative filtering (recommenders), clustering and classification (known collectively as the 3Cs), as well as the necessary infrastructure to support those implementations including, but not limited to, math packages for statistics, linear algebra and others as well as Java primitive collections, local and distributed vector and matrix classes and a variety of integrative code to work with popular packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache Cassandra and much more. The 0.9 release is mainly a clean up release in preparation for an upcoming 1.0 release targeted for first half of 2014, but there are a few significant new features, which are highlighted below. To get started with Apache Mahout 0.9, download the release artifacts and signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central Maven repository. As with any release, we wish to thank all of the users and contributors to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for individual credits, as there are too many to list here. GETTING STARTED In the release package, the examples directory contains several working examples of the core functionality available in Mahout. These can be run via scripts in the examples/bin directory and will prompt you for more information to help you try things out. Most examples do not need a Hadoop cluster in order to run. RELEASE HIGHLIGHTS The highlights of the Apache Mahout 0.9 release include, but are not limited to the list below. For further information, see the included CHANGELOG[1] file. - MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra. See http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html - MAHOUT-1288: Recommenders as a Search. See https://github.com/pferrel/solr-recommender - MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1 - MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 1-dimensional Clustering See https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf for the details. - MAHOUT-1265: MultiLayer Perceptron (MLP) classifier This is an early implementation of MLP to solicit user feedback, needs to be integrated into Mahout’s processing pipeline to work with Mahout’s vectors. - Removed Deprecated algorithms as they have been either replaced by better performing algorithms or lacked user support and maintenance. - the usual bug fixes. See [2] for more information on the 0.9 release. A total of 113 separate JIRA issues were addressed in this release. The following algorithms that were marked deprecated in 0.8 have been removed in 0.9: - From Clustering: Switched LDA implementation from using Dirtichlet to Collapsed Variational Bayes (CVB) Meanshift MinHash - removed due to poor performance, lack of support and lack of usage - From Classification (both are sequential implementations) Winnow - lack of actual usage and support Perceptron - lack of actual usage and support - Collaborative Filtering SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and org.apache.mahout.cf.taste.impl.recommender.slopeone Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender - Mahout Math Hadoop entropy stuff in org.apache.mahout.math.stats.entropy CONTRIBUTING Mahout is always looking for contributions focused on the 3Cs. If you are interested in contributing, please see our contribution page http://mahout.apache.org/developers/how-to-contribute.html or contact us via email at dev@mahout.apache.org. As the project moves towards a 1.0 release, the community will be focused on key algorithms that are proven to scale in production and have seen wide-spread adoption. [1] http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661 [2] https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22 On Monday, December 23, 2013 7:41 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: On Sun, Dec 22, 2013 at 11:21 AM, Sebastian Schelter ssc.o...@googlemail.com wrote: - Mahout Math Lanczos in favour of SSVD IIRC, we agreed to not remove Lanczos, although it was initially deprecated. We should undeprecate it. Some folks like Lanczos in Mahout (for reasons not really clear to me, aside from accuracy when computing svd of a random noise, there are actually 0 reasons to use Lanczos instead). I agree we don't necessarily want to cull it out -- but IMO there should be a clear steer posted in favor of SSVD
Re: Mahout 0.9 Release
Mahout 0.9 has been pushed to the mirrors and is available for download at http://www.apache.org/dyn/closer.cgi/mahout/ On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The release has passed with the required votes from PMC, will be pushing 0.9 to the mirrors and updating the release notes over the next day or two. On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote: +1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
Yes, thanks Suneel! On Feb 2, 2014, at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Mahout 0.9 has been pushed to the mirrors and is available for download at http://www.apache.org/dyn/closer.cgi/mahout/ On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The release has passed with the required votes from PMC, will be pushing 0.9 to the mirrors and updating the release notes over the next day or two. On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote: +1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
I just checked and the release has propagated to French mirrors. On Sun, Feb 2, 2014 at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Mahout 0.9 has been pushed to the mirrors and is available for download at http://www.apache.org/dyn/closer.cgi/mahout/ On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The release has passed with the required votes from PMC, will be pushing 0.9 to the mirrors and updating the release notes over the next day or two.
Re: Mahout 0.9 Release
The release has passed with the required votes from PMC, will be pushing 0.9 to the mirrors and updating the release notes over the next day or two. On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote: +1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
+1, all steps passed On Thu, Jan 30, 2014 at 9:16 AM, Stevo Slavić ssla...@gmail.com wrote: +1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: Mahout 0.9 Release
Voting for this closes tomorrow, thus far there's been only a single +1 from PMC. I'll go ahead and release the artifacts tomorrow regardless of whether we succeed/fail to muster the minimum 3 +1 votes. Thanks On Thursday, January 30, 2014 9:45 AM, Sergey Svinarchuk ssvinarc...@hortonworks.com wrote: +1, all steps passed On Thu, Jan 30, 2014 at 9:16 AM, Stevo Slavić ssla...@gmail.com wrote: +1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: Mahout 0.9 Release
+1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
+1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
+1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
+1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Mahout 0.9 Release
Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
RE: Mahout 0.9 Release
a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
I'll be rolling back the 0.9 Release today that's presently in staging in light of the issues that have been reported in the last 2 days and need to be fixed as part of the Release. Please hold off from committing any new code to trunk meanwhile. Thanks. On Friday, January 24, 2014 7:36 PM, Ted Dunning ted.dunn...@gmail.com wrote: My schedule has opened up a bit and I can review as well. On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Rolled back trunk to 0.9-SNAPSHOT, please go ahead and commit any changes. On Saturday, January 25, 2014 4:19 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: I'll be rolling back the 0.9 Release today that's presently in staging in light of the issues that have been reported in the last 2 days and need to be fixed as part of the Release. Please hold off from committing any new code to trunk meanwhile. Thanks. On Friday, January 24, 2014 7:36 PM, Ted Dunning ted.dunn...@gmail.com wrote: My schedule has opened up a bit and I can review as well. On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
My schedule has opened up a bit and I can review as well. On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
+1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com
Re: MAHOUT 0.9 Release - New URL
I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.orgwrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
RE: MAHOUT 0.9 Release - New URL
a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.orgwrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
I did a) b) c) and d) without noting any problem so far. +1 from me. --sebastian On 01/22/2014 11:55 PM, Suneel Marthi wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
Same here. I did a), b), c) and d) too and all tests pass. Here's my +1, if my vote counts. On Wednesday, January 22, 2014 7:11 PM, Sebastian Schelter s...@apache.org wrote: I did a) b) c) and d) without noting any problem so far. +1 from me. --sebastian On 01/22/2014 11:55 PM, Suneel Marthi wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
Likewise, a) through d) work on an Amazon AMI and Ubuntu 12.04. +1 On Wed, Jan 22, 2014 at 6:38 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Same here. I did a), b), c) and d) too and all tests pass. Here's my +1, if my vote counts. On Wednesday, January 22, 2014 7:11 PM, Sebastian Schelter s...@apache.org wrote: I did a) b) c) and d) without noting any problem so far. +1 from me. --sebastian On 01/22/2014 11:55 PM, Suneel Marthi wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u...@mahout.apache.org; dev@mahout.apache.org Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u...@mahout.apache.org; dev@mahout.apache.org Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
RE: MAHOUT 0.9 Release - New URL
from the asf-email-examples.sh script: # You will need to download or otherwise obtain some or all of the Amazon ASF Em ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this script. # To obtain a full copy you will need to launch an EC2 instance and mount the da taset to download it, otherwise you can get a sample of it at # http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout It looks like the: http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout link is down. Is there somewhere else that we can get a subset of the ASF emails? Date: Tue, 21 Jan 2014 09:48:06 -0800 Subject: Re: MAHOUT 0.9 Release - New URL From: andrew.mussel...@gmail.com To: dev@mahout.apache.org Sure thing; continuing to smoke test the other examples tonight On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0 : [distance-squared=0.9509142993214911]: /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor = [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048, 4419:0.076, 4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048, 7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077, 7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081, 10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087, 12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061, 14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124, 20346:0.159, 21233:0.075, 23657:0.089, 23939:0.078, 23974:0.105, 23998:0.146, 24962:0.122, 25003:0.093, 25084:0.151, 25128:0.052, 29143:0.095, 30459:0.046, 30806:0.075, 31559:0.046, 31727:0.104, 31895:0.105, 31900:0.153, 32149:0.079, 32993:0.069, 33112:0.177, 33208:0.101, 33351:0.089, 33533:0.079, 33638:0.042, 35795:0.066, 36189:0.078, 36491
Re: MAHOUT 0.9 Release - New URL
Thanks Andrew for reporting that. I rolled back the release to fix this and few other issues. We have removed asf-examples*.sh from trunk as the sample file at the url mentioned in ur email is not available. This is something we need to fix and restore in 1.0. On Tuesday, January 21, 2014 3:21 PM, Andrew Palumbo ap@outlook.com wrote: from the asf-email-examples.sh script: # You will need to download or otherwise obtain some or all of the Amazon ASF Em ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this script. # To obtain a full copy you will need to launch an EC2 instance and mount the da taset to download it, otherwise you can get a sample of it at # http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout It looks like the: http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout link is down. Is there somewhere else that we can get a subset of the ASF emails? Date: Tue, 21 Jan 2014 09:48:06 -0800 Subject: Re: MAHOUT 0.9 Release - New URL From: andrew.mussel...@gmail.com To: dev@mahout.apache.org Sure thing; continuing to smoke test the other examples tonight On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0 : [distance-squared=0.9509142993214911]: /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor = [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048, 4419:0.076, 4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048, 7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077, 7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081, 10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087, 12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061, 14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124, 20346
Re: MAHOUT 0.9 Release - New URL
75% Reliability70.6238% Reliability (standard deviation)0.2187 Log-likelihoodmean :-1.1182 25%-ile :-1.6911 75%-ile :-0.0803 Jan 21, 2014 9:46:39 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 10783 ms (Minutes: 0.17971) On Tue, Jan 21, 2014 at 1:08 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew for reporting that. I rolled back the release to fix this and few other issues. We have removed asf-examples*.sh from trunk as the sample file at the url mentioned in ur email is not available. This is something we need to fix and restore in 1.0. On Tuesday, January 21, 2014 3:21 PM, Andrew Palumbo ap@outlook.com wrote: from the asf-email-examples.sh script: # You will need to download or otherwise obtain some or all of the Amazon ASF Em ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this script. # To obtain a full copy you will need to launch an EC2 instance and mount the da taset to download it, otherwise you can get a sample of it at # http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout It looks like the: http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout link is down. Is there somewhere else that we can get a subset of the ASF emails? Date: Tue, 21 Jan 2014 09:48:06 -0800 Subject: Re: MAHOUT 0.9 Release - New URL From: andrew.mussel...@gmail.com To: dev@mahout.apache.org Sure thing; continuing to smoke test the other examples tonight On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0
Re: MAHOUT 0.9 Release - New URL
Hmmm... that's an issue. Since both Dirichlet and Meanshift clustering have been removed from 0.9, cluster-syntheticcontrol.sh options 4,5 are not gonna work and should have been removed for 0.9. To PMC, - rollback the release, fix this issue (and other patches that were submitted in the last few days) and put out another release ? On Monday, January 20, 2014 12:33 AM, Andrew Palumbo ap@outlook.com wrote: I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u
Re: MAHOUT 0.9 Release - New URL
This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging. Thanks for reporting this Andrew P. On Monday, January 20, 2014 12:34 AM, Andrew Palumbo ap@outlook.com wrote: I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u...@mahout.apache.org; dev@mahout.apache.org Third time's a Charm!!! Here's the new URL for Mahout 0.9
Re: MAHOUT 0.9 Release - New URL
: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u...@mahout.apache.org; dev@mahout.apache.org Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
, 2014 at 6:00 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging. Thanks for reporting this Andrew P. On Monday, January 20, 2014 12:34 AM, Andrew Palumbo ap@outlook.com wrote: I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9
Re: MAHOUT 0.9 Release - New URL
Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
-1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.aschttps://people.apache.org/keys/committer/pwendell.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
OK, running in MR mode now. On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
When I run in MR mode I get the same problem. See http://pastebin.com/TXJ5mQmt On Sun, Jan 19, 2014 at 5:31 PM, Frank Scholten fr...@frankscholten.nlwrote: OK, running in MR mode now. On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
It works when both MAHOUT_LOCAL=true and '-xm sequential' option are set. Guess will have to cut a release again with '-xm sequential' option set. On Sunday, January 19, 2014 11:31 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
The reason u r seeing the error is because there are were no sequence files in HDFS in MR mode to begin with = hence no term vectors generated = and hence no vectors to cluster. MR mode: 1. Set HADOOP_HOME 2. unset MAHOUT_LOCAL 3. clean up ur local /tmp/mahout-work-x directory 4. run ./examples/bin/cluster-reuters.sh = option 4 Sequential Mode: - 1. set MAHOUT_LOCAL=true 2. Add -xm sequential flag to cluster-reuters.sh script 3. run ./examples/bin/cluster-reuters.sh = option 4 On Sunday, January 19, 2014 12:22 PM, Frank Scholten fr...@frankscholten.nl wrote: When I run in MR mode I get the same problem. See http://pastebin.com/TXJ5mQmt On Sun, Jan 19, 2014 at 5:31 PM, Frank Scholten fr...@frankscholten.nl wrote: OK, running in MR mode now. On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run
Re: MAHOUT 0.9 Release - New URL
I'll try to test out soon
Re: MAHOUT 0.9 Release - New URL
+1 (binding) On Sun, Jan 19, 2014 at 7:49 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: I'll try to test out soon
Re: MAHOUT 0.9 Release - New URL
Stevo, could u test streaming kmeans? Sent from my iPhone On Jan 19, 2014, at 8:10 PM, Stevo Slavić ssla...@gmail.com wrote: +1 (binding) On Sun, Jan 19, 2014 at 7:49 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: I'll try to test out soon
RE: MAHOUT 0.9 Release - New URL
I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u...@mahout.apache.org; dev@mahout.apache.org Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through
Re: Mahout 0.9 Release - Call for Volunteers
Sorry, I've been traveling. Looking at it this AM -Grant On Jan 18, 2014, at 3:07 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks to Andrew, Sotiris and others for helping test the release. Need votes of fellow Mahout Committers and PMC members for the release to be finalized. On Friday, January 17, 2014 5:27 AM, Andrew Psaltis andrew.psal...@webtrends.com wrote: Suneel, I removed all Lucene entries from my local maven repo and indeed the tests that were failing due to that succeed. Now I just get the good ole: Unable to load realm mapping info from SCDynamicStore and the subsequently expected KrbException Thanks, Andrew From: Suneel Marthi suneel_mar...@yahoo.com Reply-To: Suneel Marthi suneel_mar...@yahoo.com Date: Thursday, January 16, 2014 11:24 AM To: dev@mahout.apache.org dev@mahout.apache.org, Andrew Psaltis andrew.psal...@webtrends.com, i...@eprice.gr i...@eprice.gr Cc: u...@mahout.apache.org u...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers This is not a maven issue. Andrew, r u on Mac OS 10.8? If so u would be seeing these errors. These errors being spewed by Carrot RandomizedRunner and per the conversation in Mahout-1345 this happens on Mac OS X due to an issue in Lucene 4.3.1 and below that was fixed in later Lucene releases. Do u have an older version of Lucene ( 4.4) somewhere on ur system, that could be causing this? Mahout 0.9 Release Lucene version is 4.6.0. On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis i...@eprice.gr wrote: Hi Andrew, Could please check if you have the latest maven? http://maven.apache.org/download.cgi Or what version of Maven you are using, I believe this is the issue. Regards Sotiris -Original Message- From: Andrew Psaltis [mailto:andrew.psal...@webtrends.com] Sent: Thursday, January 16, 2014 8:01 PM To: dev@mahout.apache.org; i...@eprice.gr Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Tests for Mahout Core fail on OS X 10.8.5 (12F45) java version 1.7.0_17 Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM) 64-Bit Server VM (build 23.7-b01, mixed mode) Attached is the output from running: mvn clean test I have not started to look at the test cases that failed in a debugger, but would be glad to. Thanks, Andrew On 1/16/14 9:52 AM, Yexi Jiang yexiji...@gmail.com wrote: Tested on my mac and a server with ubuntu 12.04 LTS. All tests passed. [INFO] --- - [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [1.964s] [INFO] Apache Mahout . SUCCESS [0.400s] [INFO] Mahout Math ... SUCCESS [1:53.067s] [INFO] Mahout Core ... SUCCESS [9:09.716s] [INFO] Mahout Integration SUCCESS [1:04.662s] [INFO] Mahout Examples ... SUCCESS [3.331s] [INFO] Mahout Release Package SUCCESS [0.000s] [INFO] Mahout Math/Scala wrappers SUCCESS [11.356s] [INFO] --- - [INFO] BUILD SUCCESS [INFO] --- - Regards, Yexi 2014/1/16 Sotiris Salloumis i...@eprice.gr From unix you should try the following with wget or curl, make sure during copy the email client will not wrap it up http://repository.apache.org/content/repositories/orgapachemahout-1002/ org /a pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar. gz Above link via Google url shortener for easy copy/paste http://goo.gl/gX6xGz Regards Sotiris -Original Message- From: Yexi Jiang [mailto:yexiji...@gmail.com] Sent: Thursday, January 16, 2014 5:59 PM To: mahout Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Got the same error. Regards, Yexi 2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com Hi Suneel, Still it getting 404 error. Thanks, Chameera On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout- 100 1/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under
Re: MAHOUT 0.9 Release - New URL
Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: Mahout 0.9 Release - Call for Volunteers
Suneel, I removed all Lucene entries from my local maven repo and indeed the tests that were failing due to that succeed. Now I just get the good ole: Unable to load realm mapping info from SCDynamicStore and the subsequently expected KrbException Thanks, Andrew From: Suneel Marthi suneel_mar...@yahoo.commailto:suneel_mar...@yahoo.com Reply-To: Suneel Marthi suneel_mar...@yahoo.commailto:suneel_mar...@yahoo.com Date: Thursday, January 16, 2014 11:24 AM To: dev@mahout.apache.orgmailto:dev@mahout.apache.org dev@mahout.apache.orgmailto:dev@mahout.apache.org, Andrew Psaltis andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com, i...@eprice.grmailto:i...@eprice.gr i...@eprice.grmailto:i...@eprice.gr Cc: u...@mahout.apache.orgmailto:u...@mahout.apache.org u...@mahout.apache.orgmailto:u...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers This is not a maven issue. Andrew, r u on Mac OS 10.8? If so u would be seeing these errors. These errors being spewed by Carrot RandomizedRunner and per the conversation in Mahout-1345 this happens on Mac OS X due to an issue in Lucene 4.3.1 and below that was fixed in later Lucene releases. Do u have an older version of Lucene ( 4.4) somewhere on ur system, that could be causing this? Mahout 0.9 Release Lucene version is 4.6.0. On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis i...@eprice.grmailto:i...@eprice.gr wrote: Hi Andrew, Could please check if you have the latest maven? http://maven.apache.org/download.cgi Or what version of Maven you are using, I believe this is the issue. Regards Sotiris -Original Message- From: Andrew Psaltis [mailto:andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com] Sent: Thursday, January 16, 2014 8:01 PM To: dev@mahout.apache.orgmailto:dev@mahout.apache.org; i...@eprice.grmailto:i...@eprice.gr Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org; priv...@mahout.apache.orgmailto:priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Tests for Mahout Core fail on OS X 10.8.5 (12F45) java version 1.7.0_17 Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM) 64-Bit Server VM (build 23.7-b01, mixed mode) Attached is the output from running: mvn clean test I have not started to look at the test cases that failed in a debugger, but would be glad to. Thanks, Andrew On 1/16/14 9:52 AM, Yexi Jiang yexiji...@gmail.commailto:yexiji...@gmail.com wrote: Tested on my mac and a server with ubuntu 12.04 LTS. All tests passed. [INFO] --- - [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [1.964s] [INFO] Apache Mahout . SUCCESS [0.400s] [INFO] Mahout Math ... SUCCESS [1:53.067s] [INFO] Mahout Core ... SUCCESS [9:09.716s] [INFO] Mahout Integration SUCCESS [1:04.662s] [INFO] Mahout Examples ... SUCCESS [3.331s] [INFO] Mahout Release Package SUCCESS [0.000s] [INFO] Mahout Math/Scala wrappers SUCCESS [11.356s] [INFO] --- - [INFO] BUILD SUCCESS [INFO] --- - Regards, Yexi 2014/1/16 Sotiris Salloumis i...@eprice.grmailto:i...@eprice.gr From unix you should try the following with wget or curl, make sure during copy the email client will not wrap it up http://repository.apache.org/content/repositories/orgapachemahout-1002/ org /a pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar. gz Above link via Google url shortener for easy copy/paste http://goo.gl/gX6xGz Regards Sotiris -Original Message- From: Yexi Jiang [mailto:yexiji...@gmail.commailto:yexiji...@gmail.com] Sent: Thursday, January 16, 2014 5:59 PM To: mahout Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org; priv...@mahout.apache.orgmailto:priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Got the same error. Regards, Yexi 2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.commailto:chameerawijeband...@gmail.com Hi Suneel, Still it getting 404 error. Thanks, Chameera On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.commailto:suneel_mar...@yahoo.com wrote: Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout- 100 1/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c
Re: Mahout 0.9 Release - Call for Volunteers
Hi, Windows 7 Home Premium Service Pack 1 64-bit java version 1.7.0_45 Java(TM) SE Runtime Environment (build 1.7.0_45-b18) Java HotSpot(TM) 64-Bit Server VM (build 24.45-b08, mixed mode) I have download the zip file. https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.zip Unpack it - success mvn compile - success mvn install - build was success some test are failed. I have attached the result of the test run with this mail. Is this the way to test and bubit the test results. please update and correct me if i made something wrong. Thanks, Chameera. On Fri, Jan 17, 2014 at 3:57 PM, Andrew Psaltis andrew.psal...@webtrends.com wrote: Suneel, I removed all Lucene entries from my local maven repo and indeed the tests that were failing due to that succeed. Now I just get the good ole: Unable to load realm mapping info from SCDynamicStore and the subsequently expected KrbException Thanks, Andrew From: Suneel Marthi suneel_mar...@yahoo.commailto: suneel_mar...@yahoo.com Reply-To: Suneel Marthi suneel_mar...@yahoo.commailto: suneel_mar...@yahoo.com Date: Thursday, January 16, 2014 11:24 AM To: dev@mahout.apache.orgmailto:dev@mahout.apache.org dev@mahout.apache.orgmailto:dev@mahout.apache.org, Andrew Psaltis andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com, i...@eprice.grmailto:i...@eprice.gr i...@eprice.grmailto: i...@eprice.gr Cc: u...@mahout.apache.orgmailto:u...@mahout.apache.org u...@mahout.apache.orgmailto:u...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers This is not a maven issue. Andrew, r u on Mac OS 10.8? If so u would be seeing these errors. These errors being spewed by Carrot RandomizedRunner and per the conversation in Mahout-1345 this happens on Mac OS X due to an issue in Lucene 4.3.1 and below that was fixed in later Lucene releases. Do u have an older version of Lucene ( 4.4) somewhere on ur system, that could be causing this? Mahout 0.9 Release Lucene version is 4.6.0. On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis i...@eprice.gr mailto:i...@eprice.gr wrote: Hi Andrew, Could please check if you have the latest maven? http://maven.apache.org/download.cgi Or what version of Maven you are using, I believe this is the issue. Regards Sotiris -Original Message- From: Andrew Psaltis [mailto:andrew.psal...@webtrends.commailto: andrew.psal...@webtrends.com] Sent: Thursday, January 16, 2014 8:01 PM To: dev@mahout.apache.orgmailto:dev@mahout.apache.org; i...@eprice.gr mailto:i...@eprice.gr Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org; priv...@mahout.apache.orgmailto:priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Tests for Mahout Core fail on OS X 10.8.5 (12F45) java version 1.7.0_17 Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM) 64-Bit Server VM (build 23.7-b01, mixed mode) Attached is the output from running: mvn clean test I have not started to look at the test cases that failed in a debugger, but would be glad to. Thanks, Andrew On 1/16/14 9:52 AM, Yexi Jiang yexiji...@gmail.commailto: yexiji...@gmail.com wrote: Tested on my mac and a server with ubuntu 12.04 LTS. All tests passed. [INFO] --- - [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [1.964s] [INFO] Apache Mahout . SUCCESS [0.400s] [INFO] Mahout Math ... SUCCESS [1:53.067s] [INFO] Mahout Core ... SUCCESS [9:09.716s] [INFO] Mahout Integration SUCCESS [1:04.662s] [INFO] Mahout Examples ... SUCCESS [3.331s] [INFO] Mahout Release Package SUCCESS [0.000s] [INFO] Mahout Math/Scala wrappers SUCCESS [11.356s] [INFO] --- - [INFO] BUILD SUCCESS [INFO] --- - Regards, Yexi 2014/1/16 Sotiris Salloumis i...@eprice.grmailto:i...@eprice.gr From unix you should try the following with wget or curl, make sure during copy the email client will not wrap it up http://repository.apache.org/content/repositories/orgapachemahout-1002/ org /a pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar. gz Above link via Google url shortener for easy copy/paste http://goo.gl/gX6xGz Regards Sotiris -Original Message- From: Yexi Jiang [mailto:yexiji...@gmail.commailto:yexiji...@gmail.com ] Sent: Thursday, January 16, 2014 5:59 PM To: mahout Cc
Re: Mahout 0.9 Release - Call for Volunteers
I think all tests have error with setting some permission to directory. And mahout 0.9 not supported Windows. On Fri, Jan 17, 2014 at 2:04 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Hi, Windows 7 Home Premium Service Pack 1 64-bit java version 1.7.0_45 Java(TM) SE Runtime Environment (build 1.7.0_45-b18) Java HotSpot(TM) 64-Bit Server VM (build 24.45-b08, mixed mode) I have download the zip file. https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.zip Unpack it - success mvn compile - success mvn install - build was success some test are failed. I have attached the result of the test run with this mail. Is this the way to test and bubit the test results. please update and correct me if i made something wrong. Thanks, Chameera. On Fri, Jan 17, 2014 at 3:57 PM, Andrew Psaltis andrew.psal...@webtrends.com wrote: Suneel, I removed all Lucene entries from my local maven repo and indeed the tests that were failing due to that succeed. Now I just get the good ole: Unable to load realm mapping info from SCDynamicStore and the subsequently expected KrbException Thanks, Andrew From: Suneel Marthi suneel_mar...@yahoo.commailto: suneel_mar...@yahoo.com Reply-To: Suneel Marthi suneel_mar...@yahoo.commailto: suneel_mar...@yahoo.com Date: Thursday, January 16, 2014 11:24 AM To: dev@mahout.apache.orgmailto:dev@mahout.apache.org dev@mahout.apache.orgmailto:dev@mahout.apache.org, Andrew Psaltis andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com, i...@eprice.grmailto:i...@eprice.gr i...@eprice.grmailto: i...@eprice.gr Cc: u...@mahout.apache.orgmailto:u...@mahout.apache.org u...@mahout.apache.orgmailto:u...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers This is not a maven issue. Andrew, r u on Mac OS 10.8? If so u would be seeing these errors. These errors being spewed by Carrot RandomizedRunner and per the conversation in Mahout-1345 this happens on Mac OS X due to an issue in Lucene 4.3.1 and below that was fixed in later Lucene releases. Do u have an older version of Lucene ( 4.4) somewhere on ur system, that could be causing this? Mahout 0.9 Release Lucene version is 4.6.0. On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis i...@eprice.gr mailto:i...@eprice.gr wrote: Hi Andrew, Could please check if you have the latest maven? http://maven.apache.org/download.cgi Or what version of Maven you are using, I believe this is the issue. Regards Sotiris -Original Message- From: Andrew Psaltis [mailto:andrew.psal...@webtrends.commailto: andrew.psal...@webtrends.com] Sent: Thursday, January 16, 2014 8:01 PM To: dev@mahout.apache.orgmailto:dev@mahout.apache.org; i...@eprice.gr mailto:i...@eprice.gr Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org; priv...@mahout.apache.orgmailto:priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Tests for Mahout Core fail on OS X 10.8.5 (12F45) java version 1.7.0_17 Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM) 64-Bit Server VM (build 23.7-b01, mixed mode) Attached is the output from running: mvn clean test I have not started to look at the test cases that failed in a debugger, but would be glad to. Thanks, Andrew On 1/16/14 9:52 AM, Yexi Jiang yexiji...@gmail.commailto: yexiji...@gmail.com wrote: Tested on my mac and a server with ubuntu 12.04 LTS. All tests passed. [INFO] --- - [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [1.964s] [INFO] Apache Mahout . SUCCESS [0.400s] [INFO] Mahout Math ... SUCCESS [1:53.067s] [INFO] Mahout Core ... SUCCESS [9:09.716s] [INFO] Mahout Integration SUCCESS [1:04.662s] [INFO] Mahout Examples ... SUCCESS [3.331s] [INFO] Mahout Release Package SUCCESS [0.000s] [INFO] Mahout Math/Scala wrappers SUCCESS [11.356s] [INFO] --- - [INFO] BUILD SUCCESS [INFO] --- - Regards, Yexi 2014/1/16 Sotiris Salloumis i...@eprice.grmailto:i...@eprice.gr From unix you should try the following with wget or curl, make sure during copy the email client will not wrap it up http://repository.apache.org/content/repositories/orgapachemahout-1002/ org /a pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar. gz Above link via Google url shortener for easy
Re: MAHOUT 0.9 Release - New URL
On Thu, Jan 16, 2014 at 7:35 AM, Sotiris Salloumis i...@eprice.gr wrote: c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] ?! Was that seconds? Or really milliseconds?
RE: MAHOUT 0.9 Release - New URL
Sorry my mistake milliseconds was the last test … below the full results ~/mahout/apache-maven-3.1.1/bin/mvn -DskipTests clean install [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 52.312s [INFO] Finished at: Sat Jan 18 02:04:29 CET 2014 [INFO] Final Memory: 46M/305M [INFO] ~/mahout/apache-maven-3.1.1/bin/mvn clean test [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [1.166s] [INFO] Apache Mahout . SUCCESS [0.264s] [INFO] Mahout Math ... SUCCESS [58.639s] [INFO] Mahout Core ... SUCCESS [4:01.640s] [INFO] Mahout Integration SUCCESS [21.481s] [INFO] Mahout Examples ... SUCCESS [1.980s] [INFO] Mahout Release Package SUCCESS [0.003s] [INFO] Mahout Math/Scala wrappers SUCCESS [14.149s] [INFO] [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 5:39.563s [INFO] Finished at: Sat Jan 18 02:10:53 CET 2014 [INFO] Final Memory: 51M/1068M [INFO] From: Ted Dunning [mailto:ted.dunn...@gmail.com] Sent: Saturday, January 18, 2014 2:50 AM To: Mahout Dev List; Sotiris Salloumis Cc: Suneel Marthi; u...@mahout.apache.org Subject: Re: MAHOUT 0.9 Release - New URL On Thu, Jan 16, 2014 at 7:35 AM, Sotiris Salloumis i...@eprice.gr mailto:i...@eprice.gr wrote: c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] ?! Was that seconds? Or really milliseconds?
Re: Mahout 0.9 Release Candidate - VOTE
It would be .tar.gz file and you would find it under mahout/distribution. On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Ok let's see after fixed the URL Thank you Tharindu for your cooperation. :) On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Tharindu, There is no .tar file in the given link (befor give the 404 error). Well, what about a .zip file? As I can remember, it was a .zip file. Thanks, Chameera On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Hi Tharindu, Chameera, sorry for the late reply. I'm having issues with my personal computer these days :) Still I could not able to download the artifacts. Could you please hep me to test the Release Have you figured out a way to proceed? I think the given URL is down as Suneel mentioned. Once the fixed release candidate is posted, you can download the source tar and check it as any other Mahout release. Regards, Thanks Chameera On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Tharindu. On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira tharindurus...@gmail.com wrote: Hi Suneel, I tested the installation process with unit tests and everything went well. (Ubuntu 12.10 32bit, Java 1.7.0_40). Please note that I did not clean my local maven repository before the installation so I assumed maven dependencies are all available . On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the link to Release artifacts for Mahout 0.9: https://repository.apache.org/content/repositories/orgapachemahout-1000/ For those volunteering to test this, some of the stuff to look out for: a) Verify u can unpack the Release tar. Verified b) Verify u are able to compile the distribution Verified [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [4.380s] [INFO] Apache Mahout . SUCCESS [0.965s] [INFO] Mahout Math ... SUCCESS [2:07.687s] [INFO] Mahout Core ... SUCCESS [10:34.651s] [INFO] Mahout Integration SUCCESS [1:03.250s] [INFO] Mahout Examples ... SUCCESS [16.607s] [INFO] Mahout Release Package SUCCESS [0.469s] [INFO] Mahout Math/Scala wrappers SUCCESS [35.562s] [INFO] [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 14:44.158s [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014 [INFO] Final Memory: 41M/252M [INFO] c) Run through the unit tests: mvn clean test Verified. d) Run the example scripts under $MAHOUT_HOME/examples/bin. I'm yet to test the example scripts and I will give an update soon. Regards, See http://incubator.apache.org/guides/releasemanagement.html#check-list for more details. On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com spa...@gmail.com wrote: I want to volunteer to test this release. What is the procedure/steps to get started and what pre-reqs I need to have? Cheers .S On Tue, Jan 14, 2014 at 6:52 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Calling for volunteers to test this Release. On Friday, January 10, 2014 7:39 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Pushed the Mahout 0.9 Release candidate. See https://repository.apache.org/content/repositories/orgapachemahout-1000/ This is a call for Vote. -- http://spawgi.wordpress.com We can do it and do it better. -- M.P. Tharindu Rusira Kumara Department of Computer
Re: Mahout 0.9 Release Candidate - VOTE
Thanks Suneel. On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: It would be .tar.gz file and you would find it under mahout/distribution. On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Ok let's see after fixed the URL Thank you Tharindu for your cooperation. :) On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Tharindu, There is no .tar file in the given link (befor give the 404 error). Well, what about a .zip file? As I can remember, it was a .zip file. Thanks, Chameera On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Hi Tharindu, Chameera, sorry for the late reply. I'm having issues with my personal computer these days :) Still I could not able to download the artifacts. Could you please hep me to test the Release Have you figured out a way to proceed? I think the given URL is down as Suneel mentioned. Once the fixed release candidate is posted, you can download the source tar and check it as any other Mahout release. Regards, Thanks Chameera On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Tharindu. On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira tharindurus...@gmail.com wrote: Hi Suneel, I tested the installation process with unit tests and everything went well. (Ubuntu 12.10 32bit, Java 1.7.0_40). Please note that I did not clean my local maven repository before the installation so I assumed maven dependencies are all available . On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the link to Release artifacts for Mahout 0.9: https://repository.apache.org/content/repositories/orgapachemahout-1000/ For those volunteering to test this, some of the stuff to look out for: a) Verify u can unpack the Release tar. Verified b) Verify u are able to compile the distribution Verified [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [4.380s] [INFO] Apache Mahout . SUCCESS [0.965s] [INFO] Mahout Math ... SUCCESS [2:07.687s] [INFO] Mahout Core ... SUCCESS [10:34.651s] [INFO] Mahout Integration SUCCESS [1:03.250s] [INFO] Mahout Examples ... SUCCESS [16.607s] [INFO] Mahout Release Package SUCCESS [0.469s] [INFO] Mahout Math/Scala wrappers SUCCESS [35.562s] [INFO] [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 14:44.158s [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014 [INFO] Final Memory: 41M/252M [INFO] c) Run through the unit tests: mvn clean test Verified. d) Run the example scripts under $MAHOUT_HOME/examples/bin. I'm yet to test the example scripts and I will give an update soon. Regards, See http://incubator.apache.org/guides/releasemanagement.html#check-list for more details. On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com spa...@gmail.com wrote: I want to volunteer to test this release. What is the procedure/steps to get started and what pre-reqs I need to have? Cheers .S On Tue, Jan 14, 2014 at 6:52 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Calling for volunteers to test this Release. On Friday, January 10, 2014 7:39 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Pushed the Mahout 0.9 Release candidate. See https
Mahout 0.9 Release - Call for Volunteers
Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1001/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need atleast 3 +1 votes from this group for the Release to pass. Thanks and Regards.
Re: Mahout 0.9 Release Candidate - VOTE
The link - https://repository.apache.org/content/repositories/orgapachemahout-1000/ still returns a 404. Is the latest release candidate made available for 0.9? On Thu, Jan 16, 2014 at 7:06 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Thanks Suneel. On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: It would be .tar.gz file and you would find it under mahout/distribution. On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Ok let's see after fixed the URL Thank you Tharindu for your cooperation. :) On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Tharindu, There is no .tar file in the given link (befor give the 404 error). Well, what about a .zip file? As I can remember, it was a .zip file. Thanks, Chameera On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Hi Tharindu, Chameera, sorry for the late reply. I'm having issues with my personal computer these days :) Still I could not able to download the artifacts. Could you please hep me to test the Release Have you figured out a way to proceed? I think the given URL is down as Suneel mentioned. Once the fixed release candidate is posted, you can download the source tar and check it as any other Mahout release. Regards, Thanks Chameera On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Tharindu. On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira tharindurus...@gmail.com wrote: Hi Suneel, I tested the installation process with unit tests and everything went well. (Ubuntu 12.10 32bit, Java 1.7.0_40). Please note that I did not clean my local maven repository before the installation so I assumed maven dependencies are all available . On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the link to Release artifacts for Mahout 0.9: https://repository.apache.org/content/repositories/orgapachemahout-1000/ For those volunteering to test this, some of the stuff to look out for: a) Verify u can unpack the Release tar. Verified b) Verify u are able to compile the distribution Verified [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [4.380s] [INFO] Apache Mahout . SUCCESS [0.965s] [INFO] Mahout Math ... SUCCESS [2:07.687s] [INFO] Mahout Core ... SUCCESS [10:34.651s] [INFO] Mahout Integration SUCCESS [1:03.250s] [INFO] Mahout Examples ... SUCCESS [16.607s] [INFO] Mahout Release Package SUCCESS [0.469s] [INFO] Mahout Math/Scala wrappers SUCCESS [35.562s] [INFO] [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 14:44.158s [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014 [INFO] Final Memory: 41M/252M [INFO] c) Run through the unit tests: mvn clean test Verified. d) Run the example scripts under $MAHOUT_HOME/examples/bin. I'm yet to test the example scripts and I will give an update soon. Regards, See http://incubator.apache.org/guides/releasemanagement.html#check-list for more details. On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com spa...@gmail.com wrote: I want to volunteer to test this release. What is the procedure/steps to get started and what pre-reqs I need
Re: Mahout 0.9 Release Candidate - VOTE
Sorry, sent little too early :). Got email from Suneel. On Thu, Jan 16, 2014 at 7:16 PM, spa...@gmail.com wrote: The link - https://repository.apache.org/content/repositories/orgapachemahout-1000/ still returns a 404. Is the latest release candidate made available for 0.9? On Thu, Jan 16, 2014 at 7:06 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Thanks Suneel. On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: It would be .tar.gz file and you would find it under mahout/distribution. On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Ok let's see after fixed the URL Thank you Tharindu for your cooperation. :) On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Tharindu, There is no .tar file in the given link (befor give the 404 error). Well, what about a .zip file? As I can remember, it was a .zip file. Thanks, Chameera On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Hi Tharindu, Chameera, sorry for the late reply. I'm having issues with my personal computer these days :) Still I could not able to download the artifacts. Could you please hep me to test the Release Have you figured out a way to proceed? I think the given URL is down as Suneel mentioned. Once the fixed release candidate is posted, you can download the source tar and check it as any other Mahout release. Regards, Thanks Chameera On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Tharindu. On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira tharindurus...@gmail.com wrote: Hi Suneel, I tested the installation process with unit tests and everything went well. (Ubuntu 12.10 32bit, Java 1.7.0_40). Please note that I did not clean my local maven repository before the installation so I assumed maven dependencies are all available . On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the link to Release artifacts for Mahout 0.9: https://repository.apache.org/content/repositories/orgapachemahout-1000/ For those volunteering to test this, some of the stuff to look out for: a) Verify u can unpack the Release tar. Verified b) Verify u are able to compile the distribution Verified [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [4.380s] [INFO] Apache Mahout . SUCCESS [0.965s] [INFO] Mahout Math ... SUCCESS [2:07.687s] [INFO] Mahout Core ... SUCCESS [10:34.651s] [INFO] Mahout Integration SUCCESS [1:03.250s] [INFO] Mahout Examples ... SUCCESS [16.607s] [INFO] Mahout Release Package SUCCESS [0.469s] [INFO] Mahout Math/Scala wrappers SUCCESS [35.562s] [INFO] [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 14:44.158s [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014 [INFO] Final Memory: 41M/252M [INFO] c) Run through the unit tests: mvn clean test Verified. d) Run the example scripts under $MAHOUT_HOME/examples/bin. I'm yet to test the example scripts and I will give an update soon. Regards, See http://incubator.apache.org/guides/releasemanagement.html#check-list for more details. On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com spa...@gmail.com wrote: I
Re: Mahout 0.9 Release Candidate - VOTE
Please hold off on this, screwed up the future development version#. Have to redo this again. Sorry about that. On Thursday, January 16, 2014 8:47 AM, spa...@gmail.com spa...@gmail.com wrote: Sorry, sent little too early :). Got email from Suneel. On Thu, Jan 16, 2014 at 7:16 PM, spa...@gmail.com wrote: The link - https://repository.apache.org/content/repositories/orgapachemahout-1000/ still returns a 404. Is the latest release candidate made available for 0.9? On Thu, Jan 16, 2014 at 7:06 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Thanks Suneel. On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: It would be .tar.gz file and you would find it under mahout/distribution. On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Ok let's see after fixed the URL Thank you Tharindu for your cooperation. :) On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Tharindu, There is no .tar file in the given link (befor give the 404 error). Well, what about a .zip file? As I can remember, it was a .zip file. Thanks, Chameera On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira tharindurus...@gmail.comwrote: On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara chameerawijeband...@gmail.com wrote: Hi Tharindu, Chameera, sorry for the late reply. I'm having issues with my personal computer these days :) Still I could not able to download the artifacts. Could you please hep me to test the Release Have you figured out a way to proceed? I think the given URL is down as Suneel mentioned. Once the fixed release candidate is posted, you can download the source tar and check it as any other Mahout release. Regards, Thanks Chameera On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Tharindu. On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira tharindurus...@gmail.com wrote: Hi Suneel, I tested the installation process with unit tests and everything went well. (Ubuntu 12.10 32bit, Java 1.7.0_40). Please note that I did not clean my local maven repository before the installation so I assumed maven dependencies are all available . On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the link to Release artifacts for Mahout 0.9: https://repository.apache.org/content/repositories/orgapachemahout-1000/ For those volunteering to test this, some of the stuff to look out for: a) Verify u can unpack the Release tar. Verified b) Verify u are able to compile the distribution Verified [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [4.380s] [INFO] Apache Mahout . SUCCESS [0.965s] [INFO] Mahout Math ... SUCCESS [2:07.687s] [INFO] Mahout Core ... SUCCESS [10:34.651s] [INFO] Mahout Integration SUCCESS [1:03.250s] [INFO] Mahout Examples ... SUCCESS [16.607s] [INFO] Mahout Release Package SUCCESS [0.469s] [INFO] Mahout Math/Scala wrappers SUCCESS [35.562s] [INFO] [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 14:44.158s [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014 [INFO] Final Memory: 41M/252M [INFO] c) Run through the unit tests: mvn clean test Verified. d) Run the example scripts under $MAHOUT_HOME/examples/bin. I'm yet to test the example scripts and I will give an update soon. Regards, See http://incubator.apache.org/guides/releasemanagement.html#check-list for more details. On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com spa...@gmail.com wrote: I
Re: Mahout 0.9 Release - Call for Volunteers
Hi Suneel, Still it getting 404 error. Thanks, Chameera On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1001/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need atleast 3 +1 votes from this group for the Release to pass. Thanks and Regards. -- Thanks, Chameera
MAHOUT 0.9 Release - New URL
Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
RE: MAHOUT 0.9 Release - New URL
Hi Suneel, Below first round of tests, Environment: SMP Debian 3.2.51-1 x86_64 Machine: Intel(R) Core(TM) i7 CPU 950 @ 3.07GHz stepping 05 12GB RAM OpenJDK: javac 1.6.0_27 a) Verify that u can unpack the release (tar or zip) [ Passed: tar -zxvf ] b) Verify u r able to compile the distro [ Passed: With OpenJDK, Latest Maven on LatestDebian ] c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. [Ongoing will update later] Regards Sotiris -Original Message- From: Suneel Marthi [mailto:suneel_mar...@yahoo.com] Sent: Thursday, January 16, 2014 4:41 PM To: u...@mahout.apache.org; mahout Subject: MAHOUT 0.9 Release - New URL Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/ apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
a), b), and c) all pass for me. Don't have the setup yet at work to go through d), will wait for others to verify. On 1/16/14, 9:41 AM, Suneel Marthi wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
I tested mahout 0.9 on Ubuntu 12.04 64bit, java version 1.6.0_27 a) Verify that u can unpack the release (tar or zip) - passed b) Verify u r able to compile the distro - passed c) Run through the unit tests: mvn clean test -passed d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. - will update later On Thu, Jan 16, 2014 at 5:35 PM, Sotiris Salloumis i...@eprice.gr wrote: Hi Suneel, Below first round of tests, Environment: SMP Debian 3.2.51-1 x86_64 Machine: Intel(R) Core(TM) i7 CPU 950 @ 3.07GHz stepping 05 12GB RAM OpenJDK: javac 1.6.0_27 a) Verify that u can unpack the release (tar or zip) [ Passed: tar -zxvf ] b) Verify u r able to compile the distro [ Passed: With OpenJDK, Latest Maven on LatestDebian ] c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. [Ongoing will update later] Regards Sotiris -Original Message- From: Suneel Marthi [mailto:suneel_mar...@yahoo.com] Sent: Thursday, January 16, 2014 4:41 PM To: u...@mahout.apache.org; mahout Subject: MAHOUT 0.9 Release - New URL Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/ apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: Mahout 0.9 Release - Call for Volunteers
Got the same error. Regards, Yexi 2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com Hi Suneel, Still it getting 404 error. Thanks, Chameera On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1001/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need atleast 3 +1 votes from this group for the Release to pass. Thanks and Regards. -- Thanks, Chameera -- -- Yexi Jiang, ECS 251, yjian...@cs.fiu.edu School of Computer and Information Science, Florida International University Homepage: http://users.cis.fiu.edu/~yjian004/
RE: Mahout 0.9 Release - Call for Volunteers
From unix you should try the following with wget or curl, make sure during copy the email client will not wrap it up http://repository.apache.org/content/repositories/orgapachemahout-1002/org/a pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.gz Above link via Google url shortener for easy copy/paste http://goo.gl/gX6xGz Regards Sotiris -Original Message- From: Yexi Jiang [mailto:yexiji...@gmail.com] Sent: Thursday, January 16, 2014 5:59 PM To: mahout Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Got the same error. Regards, Yexi 2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com Hi Suneel, Still it getting 404 error. Thanks, Chameera On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-100 1/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need atleast 3 +1 votes from this group for the Release to pass. Thanks and Regards. -- Thanks, Chameera -- -- Yexi Jiang, ECS 251, yjian...@cs.fiu.edu School of Computer and Information Science, Florida International University Homepage: http://users.cis.fiu.edu/~yjian004/
Re: Mahout 0.9 Release - Call for Volunteers
Tested on my mac and a server with ubuntu 12.04 LTS. All tests passed. [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [1.964s] [INFO] Apache Mahout . SUCCESS [0.400s] [INFO] Mahout Math ... SUCCESS [1:53.067s] [INFO] Mahout Core ... SUCCESS [9:09.716s] [INFO] Mahout Integration SUCCESS [1:04.662s] [INFO] Mahout Examples ... SUCCESS [3.331s] [INFO] Mahout Release Package SUCCESS [0.000s] [INFO] Mahout Math/Scala wrappers SUCCESS [11.356s] [INFO] [INFO] BUILD SUCCESS [INFO] Regards, Yexi 2014/1/16 Sotiris Salloumis i...@eprice.gr From unix you should try the following with wget or curl, make sure during copy the email client will not wrap it up http://repository.apache.org/content/repositories/orgapachemahout-1002/org/a pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.gz Above link via Google url shortener for easy copy/paste http://goo.gl/gX6xGz Regards Sotiris -Original Message- From: Yexi Jiang [mailto:yexiji...@gmail.com] Sent: Thursday, January 16, 2014 5:59 PM To: mahout Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org Subject: Re: Mahout 0.9 Release - Call for Volunteers Got the same error. Regards, Yexi 2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com Hi Suneel, Still it getting 404 error. Thanks, Chameera On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-100 1/org/apache/mahout/mahout-buildtools/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need atleast 3 +1 votes from this group for the Release to pass. Thanks and Regards. -- Thanks, Chameera -- -- Yexi Jiang, ECS 251, yjian...@cs.fiu.edu School of Computer and Information Science, Florida International University Homepage: http://users.cis.fiu.edu/~yjian004/ -- -- Yexi Jiang, ECS 251, yjian...@cs.fiu.edu School of Computer and Information Science, Florida International University Homepage: http://users.cis.fiu.edu/~yjian004/