[jira] [Created] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass

2015-03-20 Thread zhubin (JIRA)
zhubin created MAHOUT-1651:
--

 Summary: [JDK8] maven test failed for Mahout 0.9 with JDK 8, and 
JDK 7 can pass
 Key: MAHOUT-1651
 URL: https://issues.apache.org/jira/browse/MAHOUT-1651
 Project: Mahout
  Issue Type: Bug
  Components: Math
Affects Versions: 0.9
Reporter: zhubin
Priority: Minor


Running org.apache.mahout.math.random.MultinomialTest
Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec  
FAILURE! - in org.apache.mahout.math.random.MultinomialTest
testPrime(org.apache.mahout.math.random.MultinomialTest)  Time elapsed: 0.039 
sec   FAILURE!
java.lang.AssertionError: expected:16 but was:17
at 
__randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0)
at org.junit.Assert.fail(Assert.java:88)
at org.junit.Assert.failNotEquals(Assert.java:743)
at org.junit.Assert.assertEquals(Assert.java:118)
at org.junit.Assert.assertEquals(Assert.java:555)
at org.junit.Assert.assertEquals(Assert.java:542)
at 
org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143)




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass

2015-03-20 Thread zhubin (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14370948#comment-14370948
 ] 

zhubin commented on MAHOUT-1651:


I see the trunk has changed the 
org/apache/mahout/math/random/MultinomialTest.java to work around this issue. 
However, I'm not sure if the JDK 8 itself introduced the issue or it was a test 
code issue.

 [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass
 --

 Key: MAHOUT-1651
 URL: https://issues.apache.org/jira/browse/MAHOUT-1651
 Project: Mahout
  Issue Type: Bug
  Components: Math
Affects Versions: 0.9
Reporter: zhubin
Priority: Minor

 Running org.apache.mahout.math.random.MultinomialTest
 Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec  
 FAILURE! - in org.apache.mahout.math.random.MultinomialTest
 testPrime(org.apache.mahout.math.random.MultinomialTest)  Time elapsed: 0.039 
 sec   FAILURE!
 java.lang.AssertionError: expected:16 but was:17
 at 
 __randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0)
 at org.junit.Assert.fail(Assert.java:88)
 at org.junit.Assert.failNotEquals(Assert.java:743)
 at org.junit.Assert.assertEquals(Assert.java:118)
 at org.junit.Assert.assertEquals(Assert.java:555)
 at org.junit.Assert.assertEquals(Assert.java:542)
 at 
 org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass

2015-03-20 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi resolved MAHOUT-1651.
---
   Resolution: Not a Problem
Fix Version/s: 0.10.0
 Assignee: Suneel Marthi

 [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass
 --

 Key: MAHOUT-1651
 URL: https://issues.apache.org/jira/browse/MAHOUT-1651
 Project: Mahout
  Issue Type: Bug
  Components: Math
Affects Versions: 0.9
Reporter: zhubin
Assignee: Suneel Marthi
Priority: Minor
 Fix For: 0.10.0


 Running org.apache.mahout.math.random.MultinomialTest
 Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec  
 FAILURE! - in org.apache.mahout.math.random.MultinomialTest
 testPrime(org.apache.mahout.math.random.MultinomialTest)  Time elapsed: 0.039 
 sec   FAILURE!
 java.lang.AssertionError: expected:16 but was:17
 at 
 __randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0)
 at org.junit.Assert.fail(Assert.java:88)
 at org.junit.Assert.failNotEquals(Assert.java:743)
 at org.junit.Assert.assertEquals(Assert.java:118)
 at org.junit.Assert.assertEquals(Assert.java:555)
 at org.junit.Assert.assertEquals(Assert.java:542)
 at 
 org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MAHOUT-1651) [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass

2015-03-20 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14371381#comment-14371381
 ] 

Suneel Marthi commented on MAHOUT-1651:
---

This is nothing to do with JDK versions.  Its a random test that needed to be 
loosened up, closing this Jira as its a non-issue.

 [JDK8] maven test failed for Mahout 0.9 with JDK 8, and JDK 7 can pass
 --

 Key: MAHOUT-1651
 URL: https://issues.apache.org/jira/browse/MAHOUT-1651
 Project: Mahout
  Issue Type: Bug
  Components: Math
Affects Versions: 0.9
Reporter: zhubin
Priority: Minor
 Fix For: 0.10.0


 Running org.apache.mahout.math.random.MultinomialTest
 Tests run: 8, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.371 sec  
 FAILURE! - in org.apache.mahout.math.random.MultinomialTest
 testPrime(org.apache.mahout.math.random.MultinomialTest)  Time elapsed: 0.039 
 sec   FAILURE!
 java.lang.AssertionError: expected:16 but was:17
 at 
 __randomizedtesting.SeedInfo.seed([AAD9BA068467B3DA:9D6D3992EDADE21A]:0)
 at org.junit.Assert.fail(Assert.java:88)
 at org.junit.Assert.failNotEquals(Assert.java:743)
 at org.junit.Assert.assertEquals(Assert.java:118)
 at org.junit.Assert.assertEquals(Assert.java:555)
 at org.junit.Assert.assertEquals(Assert.java:542)
 at 
 org.apache.mahout.math.random.MultinomialTest.testPrime(MultinomialTest.java:143)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-04-20 Thread Sebastian Schelter (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Schelter resolved MAHOUT-1431.


Resolution: Cannot Reproduce

Closing this as we didn't get another answer for 6 weeks.

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-04-13 Thread Sebastian Schelter (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13967797#comment-13967797
 ] 

Sebastian Schelter commented on MAHOUT-1431:


Any progess here? Otherwise I'll close the ticket soon.

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-08 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13924818#comment-13924818
 ] 

Suneel Marthi commented on MAHOUT-1431:
---

yannis, could u provide explicit details as to the task name. KMeans has 2 
mapper tasks and a reducer task.  Which mapper is taking longer time?  I would 
expect ClusterClassificationMapper to be slower due to the additional overhead 
of calculating the vector distance from the centroid and converting the vector 
to a named vector. 

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-08 Thread yannis ats (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13924841#comment-13924841
 ] 

yannis ats commented on MAHOUT-1431:


how can i distinguish which mapper is slower?
by the logs?
i would try to see the logs 
here the time was defined approximately by the console of emr


 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-06 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922707#comment-13922707
 ] 

Suneel Marthi commented on MAHOUT-1431:
---

hi yannis, any updates on this?

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-06 Thread yannis ats (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922734#comment-13922734
 ] 

yannis ats commented on MAHOUT-1431:


HI
i will report you in few hours
since i will be able to inspect manually how much time is consumed on 
mapper/reducers

kind regards
yannis

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-06 Thread yannis ats (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923048#comment-13923048
 ] 

yannis ats commented on MAHOUT-1431:


From manual inspection it seems that the reducer takes twice the time that a 
mapper needs to complete the task.I did experiments with 6 large machines as 
data nodes.


 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread yannis ats (JIRA)
yannis ats created MAHOUT-1431:
--

 Summary: Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.9, 0.8
Reporter: yannis ats


Hi all,
i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
i performed kmeans experiments with both versions in amazon EMR.
What i found is that mahout 0.8 is faster than mahout 0.9
in particular i observed that mahout 0.8 is performing less iterations and 
every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
0.8 is twice as fast as that of 0.9
the hadoop version was 1.0.x and the input of the data was roughly 2 million 
datapoints with dimensionality of 1800.
The input parameters in both experiments were exactly the same,modulo the 
initialization which was random in both cases and i can understand that this 
may affect the convergence(the amount of iterations),but i am buffled by the 
fact that every iteration takes almost twice the time in 0.9 vs 0.8

Is this normal?is this  expected?

thank you in advance for your time.




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread yannis ats (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

yannis ats updated MAHOUT-1431:
---

Description: 
Hi all,
i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
i performed kmeans experiments with both versions in amazon EMR.
What i found is that mahout 0.8 is faster than mahout 0.9
in particular i observed that mahout 0.8 is performing less iterations and 
every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
0.8 is twice as fast as that of 0.9
the hadoop version was 1.0.x and the input of the data was roughly 2 million 
datapoints with dimensionality of 1800.
The input parameters in both experiments were exactly the same,modulo the 
initialization which was random in both cases and i can understand that this 
may affect the convergence(the amount of iterations),but i am baffled by the 
fact that every iteration takes almost twice the time in 0.9 vs 0.8

Is this normal?is this  expected?

thank you in advance for your time.


  was:
Hi all,
i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
i performed kmeans experiments with both versions in amazon EMR.
What i found is that mahout 0.8 is faster than mahout 0.9
in particular i observed that mahout 0.8 is performing less iterations and 
every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
0.8 is twice as fast as that of 0.9
the hadoop version was 1.0.x and the input of the data was roughly 2 million 
datapoints with dimensionality of 1800.
The input parameters in both experiments were exactly the same,modulo the 
initialization which was random in both cases and i can understand that this 
may affect the convergence(the amount of iterations),but i am buffled by the 
fact that every iteration takes almost twice the time in 0.9 vs 0.8

Is this normal?is this  expected?

thank you in advance for your time.



 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread Sebastian Schelter (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919260#comment-13919260
 ] 

Sebastian Schelter commented on MAHOUT-1431:


That is really strange, I don't think we changed something in our k-Means 
implementation (can someone verify this?). 

Could it be that some change in our vector code causes this behaviour?

Btw: k-Means should have an option to fix the random seed for the 
initialization to enable repeatable experiments

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919267#comment-13919267
 ] 

Suneel Marthi commented on MAHOUT-1431:
---

Could u provide CODE snapshots of where u believe that the iterations r taking 
longer?

the only change that was done to address Mahout-1030 was convert all vectors to 
Named Vectors to store the Vector Ids and vector distances from cluster 
centers. The code changes for that are in ClusterClassificationDriver (for 
Sequential mode) and ClusterClassificationMapper (for MR mode) which are post 
processing steps after clustering is done. 



 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919271#comment-13919271
 ] 

Suneel Marthi commented on MAHOUT-1431:
---

The other change in 0.9 that comes to mind and could have potentially impacted 
the vector behavious is Mahout-1242.  Mere late night speculation and evidence 
not verified?? 

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread yannis ats (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919273#comment-13919273
 ] 

yannis ats commented on MAHOUT-1431:


The notion of time for every iteration is part of what is recorded when the 
job(iteration) finishes and was found from the console of EMR.
i have not inspected the code,i just did experiments with a large dataset 
because when i did the same experiment from 0.7 to 0.8 i found that 0.8 was 
much faster(in terms of the number of iterations,i dont remember how much time 
every iteration was taking)


 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919293#comment-13919293
 ] 

Suneel Marthi commented on MAHOUT-1431:
---

Comparing 0.7 to 0.8 is comparing apples-oranges. The clustering code was 
redone for 0.7 and wasn't functioning right until 0.8.

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread Sebastian Schelter (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919308#comment-13919308
 ] 

Sebastian Schelter commented on MAHOUT-1431:


Could you see where the additional time was spent? (e.g. in the mapper or in 
the reducer)?

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (MAHOUT-1431) Comparison of Mahout 0.8 vs mahout 0.9 in EMR

2014-03-04 Thread yannis ats (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919314#comment-13919314
 ] 

yannis ats commented on MAHOUT-1431:


I am pretty sure that the reducer takes more time than the mapper,but if i 
remember well probably both mapper and reducer take more time in 0.9 than 0.8 
but i think it took more time on the mapper but i am not very confident (my 
memory is not very good)
but i cannot answer this question right now,
probably i have to restart the processes and check manually how long it takes

 Comparison of Mahout 0.8 vs mahout 0.9 in EMR
 -

 Key: MAHOUT-1431
 URL: https://issues.apache.org/jira/browse/MAHOUT-1431
 Project: Mahout
  Issue Type: Question
  Components: Clustering
Affects Versions: 0.8, 0.9
Reporter: yannis ats
  Labels: performance

 Hi all,
 i tested mahout 0.8 and 0.9 in mahout emr with a large dataset as input and 
 i performed kmeans experiments with both versions in amazon EMR.
 What i found is that mahout 0.8 is faster than mahout 0.9
 in particular i observed that mahout 0.8 is performing less iterations and 
 every iteration of kmeans is faster than mahout 0.9.Every iteration in mahout 
 0.8 is twice as fast as that of 0.9
 the hadoop version was 1.0.x and the input of the data was roughly 2 million 
 datapoints with dimensionality of 1800.
 The input parameters in both experiments were exactly the same,modulo the 
 initialization which was random in both cases and i can understand that this 
 may affect the convergence(the amount of iterations),but i am baffled by the 
 fact that every iteration takes almost twice the time in 0.9 vs 0.8
 Is this normal?is this  expected?
 thank you in advance for your time.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: Solr-recommender for Mahout 0.9

2014-02-24 Thread Ted Dunning
On Sat, Feb 22, 2014 at 4:50 PM, Andrew Musselman 
andrew.mussel...@gmail.com wrote:

 *Ted*, if you have any code you could donate for this example from your and
 Ellen's book I'd love to be able to re-use it.


I do.  I will try to open up access to that sometime today.

Pat's work on the cross recommender job is also important.


Re: Solr-recommender for Mahout 0.9

2014-02-22 Thread Andrew Musselman
*Pat*, I opened a ticket(M-1420) for putting a new script in examples/ that
uses the solr-recommender.  Seems there's another, related ticket from
Suneel in M-1288.

Did the work described in the thread below make it into 0.9, and/or how
much more is needed on it?

*Ted*, if you have any code you could donate for this example from your and
Ellen's book I'd love to be able to re-use it.

Thanks
Andrew

On Sun, Nov 17, 2013 at 3:36 PM, Pat Ferrel p...@occamsmachete.com wrote:

 Eventually I'd like to get MAP built into the solr-recommender. Used it at
 a client who had good data. It was very helpful for exploring what data was
 useful and what wasn't. We'd run map with and without detail-view data for
 instance and take the MAP as a measure of how predictive the data was. In
 our case the MAP@ numbers went down with purchase and detail-view mixed
 together. That was why I got interested in the cross-action recommender--as
 a way to scrub less predictive actions. Didn't finish it before I lost
 access to the data unfortunately.

 What form of precision calc will you use? Obviously we used mean average
 precision at different numbers of recommendations, which had the effect of
 producing a fall-off curve. The curve, we took, as a measure of how well
 our ranking was working.

 On Nov 17, 2013, at 10:47 AM, Ken Krugler kkrugler_li...@transpac.com
 wrote:

 Hi Pat,

 On Nov 13, 2013, at 4:43pm, Pat Ferrel pat.fer...@gmail.com wrote:

  Ever done an offline precision calc?

 No, sorry.

 I do (finally) have one client with some data that could be used to
 calculate precision, and a willingness to pay for the work, so I'm hoping
 to include details on that in my next blog post about text feature
 selection.

 -- Ken


  On Nov 13, 2013, at 1:39 PM, Ken Krugler kkrugler_li...@transpac.com
 wrote:
 
  Hi Pat,
 
  On Nov 13, 2013, at 9:21am, Pat Ferrel p...@occamsmachete.com wrote:
 
  A version is now checked in that uses mahout 0.9. Haven't tested it on
 a cluster yet, only locally. I have to upgrade my cluster to Hadoop 1.2.1,
 which takes some time.
 
  Saw the Strata slides from Ted touting dithering of results, which
 I'll implement.
 
  Ken, did you have anything specific for And usually I just use Solr
 to generate a candidate list, then I do more specific scoring to find the N
 best form N*4 candidates?
 
  If I'm looking for the top N best matches, I'll do a Solr query with
 rows=N*4.
 
  Then I use all of the data from these potential matches, and calculate
 a more sophisticated similarity score (e.g. adding a weighting based on the
 user's activity level) between my target and these candidates.
 
  Regards,
 
  -- Ken
 
 
  Was planning to try boosting by something like genre/category in the
 recs query. For instance, in the demo data, each item will soon have a set
 of tags (actually genre names) so these could be a field being queried
 along with the item-item links. The query for recs would then include the
 user history against the item-item links, and the average genre tags
 preferred by the user against item genre tags. This would return recs
 skewed towards the user's genre preference.
 
  Another way this could be used is when showing similar items. You'd
 have the tags for the item being viewed and so could use them to skew
 towards items with similar tags. I think this works but would turn similar
 items from a lookup (they are pre-calculated by Mahout) into another Solr
 query.
 
 
 
  On Nov 8, 2013, at 1:27 PM, Pat Ferrel p...@occamsmachete.com wrote:
 
  Not planning to do anything with weights at present. An ORed query
 should suffice for the time being and Solr weights. There are a good list
 of ways to do this later if it warrants an experiment. Thanks.
 
  Have, similar items as input, recommendations from user likes, and
 just got recs from recently viewed working. Once you have online recs from
 the pre-calculated model experimenting is super easy. The next step will be
 to get more metadata ingested so we can try boosting by context genre, or
 recent genre viewed, which is sort of in line with more specific scoring
 to find the N best from N*4 candidates. Also want to do what Ted calls
 dithering to vary the choices you see.
 
  On Nov 8, 2013, at 10:10 AM, Ken Krugler kkrugler_li...@transpac.com
 wrote:
 
  One other thing I should have mentioned is that if you care about
 setting weights on incoming terms, you can boost them using the ^value
 syntax.
 
  E.g. the_kings_speech^1.5 OR skyfalll^0.5 OR looper^3.0...
 
  If you want to account for weights of terms in the index, it's a bit
 harder. You can do simple boosting by replicating terms, or you can use
 payload-based boosting, or you could code up your own Similarity class that
 takes advantage of side-channel data.
 
  But in my experience the gain from applying weights to terms int he
 index isn't very significant.
 
  And usually I just Solr to generate a candidate list, then I do more
 specific scoring to find the N best

Re: Mahout 0.9 Release Notes - First Draft

2014-02-18 Thread Suneel Marthi
Could someone please point me to the URL for adding Mahout release notes?  




On Monday, February 17, 2014 3:27 PM, Ellen Friedman 
b.ellen.fried...@gmail.com wrote:
 

Hi Suneel,

Thanks for notes. I'm inquiring about status of the notes and update to the 
website to announce 0.9: Ted has reviewed the release notes - were you waiting 
for additional input or are they ready to go on the website? Are you the one 
who updates the site?

I've been asked to write a short blog on the release but wanted to wait until 
the site is updated.

Thanks much
Ellen





On Tue, Feb 11, 2014 at 10:06 AM, Suneel Marthi suneel_mar...@yahoo.com wrote:

Here's a draft of the Release Notes for Mahout 0.9, Please review the same.

--



The Apache Mahout PMC is pleased to announce the release of Mahout 0.9.
Mahout's goal is to build scalable machine learning libraries focused
primarily in the areas of collaborative filtering (recommenders),
clustering and classification (known collectively as the 3Cs), as well as the
necessary infrastructure to support those implementations including, but
not limited to, math packages for statistics, linear algebra and others
as well as Java primitive collections, local and distributed vector and
matrix classes and a variety of integrative code to work with popular
packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache
Cassandra and much more. The 0.9 release is mainly a clean up release in
preparation for an upcoming 1.0 release targeted for first half of 2014, but 
there are a few
significant new features, which are highlighted below.

To get started with Apache Mahout 0.9, download the release artifacts and 
signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central 
Maven repository.


As with any release, we wish to thank all of the users and contributors
to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for
individual credits, as there are too many to list here.

GETTING STARTED

In the release package, the examples directory contains several working 
examples of the core
functionality available in Mahout. These can be run via scripts in the 
examples/bin
directory and will prompt you for more information to help you try things out.
Most examples do not need a Hadoop cluster in order to run.

RELEASE HIGHLIGHTS

The highlights of the Apache Mahout 0.9 release include, but are not
limited to the list below. For further information, see the included 
CHANGELOG[1] file.

-  MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra.
   See 
http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html
-  MAHOUT-1288: Recommenders as a Search.  See 
https://github.com/pferrel/solr-recommender
-  MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1

-  MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 
1-dimensional Clustering
  See 
https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf
 for the details.
-  MAHOUT-1265: MultiLayer Perceptron (MLP) classifier
   This is an early implementation of MLP to solicit user feedback, needs to 
be integrated into Mahout’s processing pipeline to work with Mahout’s vectors.

- Removed Deprecated algorithms as they have been either replaced by better 
performing algorithms or lacked user support and maintenance.

- the usual bug fixes. See [2] for more information on the 0.9 release.

A total of 113 separate JIRA issues were addressed in this release.


The following algorithms that were marked deprecated in 0.8 have been removed 
in 0.9:

- From Clustering:
   Switched LDA implementation from using Dirtichlet to Collapsed Variational 
Bayes (CVB)

  Meanshift

  MinHash - removed due to poor performance,  lack of support and lack of usage


- From Classification (both are sequential implementations)

  Winnow - lack of actual usage and support

  Perceptron - lack of actual usage and support

- Collaborative Filtering

SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and 
org.apache.mahout.cf.taste.impl.recommender.slopeone
    Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo
    TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender

- Mahout Math

    Hadoop entropy stuff in org.apache.mahout.math.stats.entropy


CONTRIBUTING

Mahout is always looking for contributions focused on the 3Cs. If you are
interested in contributing, please see our contribution page 
http://mahout.apache.org/developers/how-to-contribute.html or contact us via 
email at dev@mahout.apache.org.


As the project moves towards a 1.0 release, the community will be focused on 
key algorithms that are proven to scale in production and have seen 
wide-spread adoption.

[1] 
http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661
[2] 
https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20

Apache Mahout 0.9 released

2014-02-18 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.9.
Mahout's goal is to build scalable machine learning libraries focused
primarily in the areas of collaborative filtering (recommenders),
clustering and classification (known collectively as the 3Cs), as well as the
necessary infrastructure to support those implementations including, but
not limited to, math packages for statistics, linear algebra and others
as well as Java primitive collections, local and distributed vector and
matrix classes and a variety of integrative code to work with popular
packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache
Cassandra and much more. The 0.9 release is mainly a clean up release in
preparation for an upcoming 1.0 release targeted for first half of 2014, but 
there are a few
significant new features, which are highlighted below.

To get started with Apache Mahout 0.9, download the release artifacts and 
signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central 
Maven repository.

As with any release, we wish to thank all of the users and contributors
to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for
individual credits, as there are too many to list here.

GETTING STARTED

In the release package, the examples directory contains several working 
examples of the core
functionality available in Mahout. These can be run via scripts in the 
examples/bin
directory and will prompt you for more information to help you try things out. 
Most examples do not need a Hadoop cluster in order to run.

RELEASE HIGHLIGHTS

The highlights of the Apache Mahout 0.9 release include, but are not
limited to the list below. For further information, see the included 
CHANGELOG[1] file.

-  MAHOUT-1245: A new and improved Mahout website based on Apache CMS
-  MAHOUT-1265: MultiLayer Perceptron (MLP) classifier 
   This is an early implementation of MLP to solicit user feedback, needs to be 
integrated into Mahout’s processing pipeline to work with Mahout’s vectors.
-  MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra.  See 
http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html
-  MAHOUT-1288: Recommenders as a Search.  See 
https://github.com/pferrel/solr-recommender
-  MAHOUT-1300: Suport for easy functional Matrix views and derivatives
-  MAHOUT-1343: JSON output format for ClusterDumper
-  MAHOUT-1345: Enable randomised testing for all Mahout modules using Carrot 
RandomizedRunner. 
-  MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 
1-dimensional Clustering.  See 
https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf
 for the details.
-  MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1


- Removed Deprecated algorithms as they have been either replaced by better 
performing algorithms or lacked user support and maintenance.

- the usual bug fixes. See [2] for more information on the 0.9 release.

A total of 113 separate JIRA issues were addressed in this release.

The following algorithms that were marked deprecated in 0.8 have been removed 
in 0.9:

- From Clustering:
   Switched LDA implementation from using Gibbs Sampling to Collapsed 
Variational Bayes (CVB)

  Meanshift

  MinHash - removed due to poor performance,  lack of support and lack of usage

- From Classification (both are sequential implementations)

  Winnow - lack of actual usage and support

  Perceptron - lack of actual usage and support

- Collaborative Filtering
SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and 
org.apache.mahout.cf.taste.impl.recommender.slopeone
    Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo
    TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender

- Mahout Math
    Hadoop entropy stuff in org.apache.mahout.math.stats.entropy


CONTRIBUTING

Mahout is always looking for contributions focused on the 3Cs. If you are
interested in contributing, please see our contribution page 
http://mahout.apache.org/developers/how-to-contribute.html or contact us via 
email at dev@mahout.apache.org.


As the project moves towards a 1.0 release, the community will be focused on 
key algorithms that are proven to scale in production and have seen wide-spread 
adoption. 

[1] 
http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661
[2] 
https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22

Re: Mahout 0.9 Release Notes - First Draft

2014-02-18 Thread Suneel Marthi
Below r the release notes, not sure where they should be going on the website. 
If someone could point me to a location I will go ahead and update the same.

=

The Apache Mahout PMC is pleased to announce the release of Mahout 0.9.
Mahout's goal is to build scalable machine learning libraries focused
primarily in the areas of
 collaborative filtering (recommenders),
clustering and classification (known collectively as the 3Cs), as well as the
necessary infrastructure to support those implementations including, but
not limited to, math packages for statistics, linear algebra and others
as well as Java primitive collections, local and distributed vector and
matrix classes and a variety of integrative code to work with popular
packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache
Cassandra and much more. The 0.9 release is mainly a clean up release in
preparation for an upcoming 1.0 release targeted for first half of 2014, but 
there are a few
significant new features, which are highlighted below.

To get started with Apache Mahout 0.9, download the release artifacts and 
signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central 
Maven repository.

As with any release, we wish to thank all of the users and
 contributors
to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for
individual credits, as there are too many to list here.

GETTING STARTED

In the release package, the examples directory contains several working 
examples of the core
functionality available in Mahout. These can be run via scripts in the 
examples/bin
directory and will prompt you for more information to help you try things out. 
Most examples do not need a Hadoop cluster in order to run.

RELEASE HIGHLIGHTS

The highlights of the Apache Mahout 0.9 release include, but are not
limited to the list below. For further information, see the included 
CHANGELOG[1] file.

-  MAHOUT-1245: A new and improved Mahout website based on Apache CMS
-  MAHOUT-1265: MultiLayer Perceptron (MLP) classifier 
   This is an early implementation of MLP to solicit user feedback, needs to be 
integrated into Mahout’s
 processing pipeline to work with Mahout’s vectors.
-  MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra.  See 
http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html
-  MAHOUT-1288: Recommenders as a Search.  See 
https://github.com/pferrel/solr-recommender
-  MAHOUT-1300: Suport for easy functional Matrix views and derivatives
-  MAHOUT-1343: JSON output format for ClusterDumper
-  MAHOUT-1345: Enable randomised testing for all Mahout modules using Carrot 
RandomizedRunner. 
-  MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 
1-dimensional Clustering.  See 
https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf
 for the details.
-  MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1


- Removed Deprecated algorithms as they have been either replaced by better 
performing algorithms or
 lacked user support and maintenance.

- the usual bug fixes. See [2] for more information on the 0.9 release.

A total of 113 separate JIRA issues were addressed in this release.

The following algorithms that were marked deprecated in 0.8 have been removed 
in 0.9:

- From Clustering:
   Switched LDA implementation from using Gibbs Sampling to Collapsed 
Variational Bayes (CVB)

  Meanshift

  MinHash - removed due to poor performance,  lack of support and lack of usage

- From Classification (both are sequential implementations)

  Winnow - lack of actual usage and support

  Perceptron - lack of actual usage and support

- Collaborative Filtering
SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and 
org.apache.mahout.cf.taste.impl.recommender.slopeone
    Distributed pseudo recommender in
 org.apache.mahout.cf.taste.hadoop.pseudo
    TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender

- Mahout Math
    Hadoop entropy stuff in org.apache.mahout.math.stats.entropy


CONTRIBUTING

Mahout is always looking for contributions focused on the 3Cs. If you are
interested in contributing, please see our contribution page 
http://mahout.apache.org/developers/how-to-contribute.html or contact us via 
email at dev@mahout.apache.org.


As the project moves towards a 1.0 release, the community will be focused on 
key algorithms that are proven to scale in production and have seen wide-spread 
adoption. 

[1] 
http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661
[2] 
https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22





On Monday, February 17, 2014 3:27 PM, Ellen Friedman 
b.ellen.fried...@gmail.com wrote:
 

Hi Suneel,

Thanks for notes. I'm inquiring about status of the notes and update to the 
website to announce 0.9: Ted has reviewed the release notes - were you waiting

Re: Mahout 0.9 Release Notes - First Draft

2014-02-17 Thread Ted Dunning
On Tue, Feb 11, 2014 at 10:06 AM, Suneel Marthi suneel_mar...@yahoo.comwrote:

Switched LDA implementation from using Dirtichlet to Collapsed
 Variational Bayes (CVB)


This line should read:

Switched LDA implementation from using Gibb's sampling to Collapsed
Variational Bayes (CVB)


Otherwise, it looks pretty good.


Re: Mahout 0.9 Release Notes - First Draft

2014-02-17 Thread Ellen Friedman
Hi Suneel,

Thanks for notes. I'm inquiring about status of the notes and update to the
website to announce 0.9: Ted has reviewed the release notes - were you
waiting for additional input or are they ready to go on the website? Are
you the one who updates the site?

I've been asked to write a short blog on the release but wanted to wait
until the site is updated.

Thanks much
Ellen



On Tue, Feb 11, 2014 at 10:06 AM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Here's a draft of the Release Notes for Mahout 0.9, Please review the same.

 --


 The Apache Mahout PMC is pleased to announce the release of Mahout 0.9.
 Mahout's goal is to build scalable machine learning libraries focused
 primarily in the areas of collaborative filtering (recommenders),
 clustering and classification (known collectively as the 3Cs), as well
 as the
 necessary infrastructure to support those implementations including, but
 not limited to, math packages for statistics, linear algebra and others
 as well as Java primitive collections, local and distributed vector and
 matrix classes and a variety of integrative code to work with popular
 packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache
 Cassandra and much more. The 0.9 release is mainly a clean up release in
 preparation for an upcoming 1.0 release targeted for first half of 2014,
 but there are a few
 significant new features, which are highlighted below.

 To get started with Apache Mahout 0.9, download the release artifacts and
 signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the
 central Maven repository.

 As with any release, we wish to thank all of the users and contributors
 to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for
 individual credits, as there are too many to list here.

 GETTING STARTED

 In the release package, the examples directory contains several working
 examples of the core
 functionality available in Mahout. These can be run via scripts in the
 examples/bin
 directory and will prompt you for more information to help you try things
 out.
 Most examples do not need a Hadoop cluster in order to run.

 RELEASE HIGHLIGHTS

 The highlights of the Apache Mahout 0.9 release include, but are not
 limited to the list below. For further information, see the included
 CHANGELOG[1] file.

 -  MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra.
See
 http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html
 -  MAHOUT-1288: Recommenders as a Search.  See
 https://github.com/pferrel/solr-recommender
 -  MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1
 -  MAHOUT-1361: Online Algorithm for computing accurate Quantiles using
 1-dimensional Clustering
   See
 https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdffor
  the details.
 -  MAHOUT-1265: MultiLayer Perceptron (MLP) classifier
This is an early implementation of MLP to solicit user feedback, needs
 to be integrated into Mahout's processing pipeline to work with Mahout's
 vectors.

 - Removed Deprecated algorithms as they have been either replaced by
 better performing algorithms or lacked user support and maintenance.

 - the usual bug fixes. See [2] for more information on the 0.9 release.

 A total of 113 separate JIRA issues were addressed in this release.

 The following algorithms that were marked deprecated in 0.8 have been
 removed in 0.9:

 - From Clustering:
Switched LDA implementation from using Dirtichlet to Collapsed
 Variational Bayes (CVB)

   Meanshift

   MinHash - removed due to poor performance,  lack of support and lack of
 usage

 - From Classification (both are sequential implementations)

   Winnow - lack of actual usage and support

   Perceptron - lack of actual usage and support

 - Collaborative Filtering
 SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone
 and org.apache.mahout.cf.taste.impl.recommender.slopeone
 Distributed pseudo recommender in
 org.apache.mahout.cf.taste.hadoop.pseudo
 TreeClusteringRecommender in
 org.apache.mahout.cf.taste.impl.recommender

 - Mahout Math
 Hadoop entropy stuff in org.apache.mahout.math.stats.entropy

 CONTRIBUTING

 Mahout is always looking for contributions focused on the 3Cs. If you are
 interested in contributing, please see our contribution page
 http://mahout.apache.org/developers/how-to-contribute.html or contact us
 via email at dev@mahout.apache.org.

 As the project moves towards a 1.0 release, the community will be focused
 on key algorithms that are proven to scale in production and have seen
 wide-spread adoption.

 [1]
 http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661
 [2]
 https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22








 On Monday, December 23, 2013 7:41 PM, Dmitriy Lyubimov dlie...@gmail.com
 wrote:

 On Sun, Dec 22, 2013

Re: Mahout 0.9 Release Notes - First Draft

2014-02-11 Thread Suneel Marthi
Here's a draft of the Release Notes for Mahout 0.9, Please review the same.

--


The Apache Mahout PMC is pleased to announce the release of Mahout 0.9.
Mahout's goal is to build scalable machine learning libraries focused
primarily in the areas of collaborative filtering (recommenders),
clustering and classification (known collectively as the 3Cs), as well as the
necessary infrastructure to support those implementations including, but
not limited to, math packages for statistics, linear algebra and others
as well as Java primitive collections, local and distributed vector and
matrix classes and a variety of integrative code to work with popular
packages like Apache Hadoop, Apache Lucene, Apache HBase, Apache
Cassandra and much more. The 0.9 release is mainly a clean up release in
preparation for an upcoming 1.0 release targeted for first half of 2014, but 
there are a few
significant new features, which are highlighted below.

To get started with Apache Mahout 0.9, download the release artifacts and 
signatures at http://www.apache.org/dyn/closer.cgi/mahout or visit the central 
Maven repository.

As with any release, we wish to thank all of the users and contributors
to Mahout. Please see the CHANGELOG [1] and JIRA Release Notes [2] for
individual credits, as there are too many to list here.

GETTING STARTED

In the release package, the examples directory contains several working 
examples of the core
functionality available in Mahout. These can be run via scripts in the 
examples/bin
directory and will prompt you for more information to help you try things out. 
Most examples do not need a Hadoop cluster in order to run.

RELEASE HIGHLIGHTS

The highlights of the Apache Mahout 0.9 release include, but are not
limited to the list below. For further information, see the included 
CHANGELOG[1] file.

-  MAHOUT-1297: Scala DSL Bindings for Mahout Math Linear Algebra.
   See 
http://weatheringthrutechdays.blogspot.com/2013/07/scala-dsl-for-mahout-in-core-linear.html
-  MAHOUT-1288: Recommenders as a Search.  See 
https://github.com/pferrel/solr-recommender
-  MAHOUT-1364: Upgrade Mahout to Lucene 4.6.1
-  MAHOUT-1361: Online Algorithm for computing accurate Quantiles using 
1-dimensional Clustering
  See 
https://github.com/tdunning/t-digest/blob/master/docs/theory/t-digest-paper/histo.pdf
 for the details.
-  MAHOUT-1265: MultiLayer Perceptron (MLP) classifier 
   This is an early implementation of MLP to solicit user feedback, needs to be 
integrated into Mahout’s processing pipeline to work with Mahout’s vectors.

- Removed Deprecated algorithms as they have been either replaced by better 
performing algorithms or lacked user support and maintenance.

- the usual bug fixes. See [2] for more information on the 0.9 release.

A total of 113 separate JIRA issues were addressed in this release.

The following algorithms that were marked deprecated in 0.8 have been removed 
in 0.9:

- From Clustering:
   Switched LDA implementation from using Dirtichlet to Collapsed Variational 
Bayes (CVB)

  Meanshift

  MinHash - removed due to poor performance,  lack of support and lack of usage

- From Classification (both are sequential implementations)

  Winnow - lack of actual usage and support

  Perceptron - lack of actual usage and support

- Collaborative Filtering
SlopeOne implementations in org.apache.mahout.cf.taste.hadoop.slopeone and 
org.apache.mahout.cf.taste.impl.recommender.slopeone
    Distributed pseudo recommender in org.apache.mahout.cf.taste.hadoop.pseudo
    TreeClusteringRecommender in org.apache.mahout.cf.taste.impl.recommender

- Mahout Math
    Hadoop entropy stuff in org.apache.mahout.math.stats.entropy

CONTRIBUTING

Mahout is always looking for contributions focused on the 3Cs. If you are
interested in contributing, please see our contribution page 
http://mahout.apache.org/developers/how-to-contribute.html or contact us via 
email at dev@mahout.apache.org.

As the project moves towards a 1.0 release, the community will be focused on 
key algorithms that are proven to scale in production and have seen wide-spread 
adoption. 

[1] 
http://svn.apache.org/viewvc/mahout/trunk/CHANGELOG?view=markuppathrev=1563661
[2] 
https://issues.apache.org/jira/browse/MAHOUT-1411?jql=project%20%3D%20MAHOUT%20AND%20fixVersion%20%3D%20%220.9%22








On Monday, December 23, 2013 7:41 PM, Dmitriy Lyubimov dlie...@gmail.com 
wrote:
 
On Sun, Dec 22, 2013 at 11:21 AM, Sebastian Schelter 

ssc.o...@googlemail.com wrote:


 
  - Mahout Math
      Lanczos in favour of SSVD

 IIRC, we agreed to not remove Lanczos, although it was initially
 deprecated. We should undeprecate it.


Some folks like Lanczos in Mahout (for reasons not really clear to me,
aside from accuracy when computing svd of a random noise, there are
actually 0 reasons to use Lanczos instead). I agree we don't  necessarily
want to cull it out -- but IMO there should be a clear steer posted in
favor of SSVD

Re: Mahout 0.9 Release

2014-02-02 Thread Suneel Marthi
Mahout 0.9 has been pushed to the mirrors and is available for download at 
http://www.apache.org/dyn/closer.cgi/mahout/




On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com 
wrote:
 
The release has passed with the required votes from PMC, will be pushing 0.9 to 
the mirrors and updating the release notes over the next day or two.




On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote:

+1



On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:

 LGTM


 On 1/29/14, 4:27 PM, peng wrote:

 +1, can't see a bad side.

 On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:

 +1 from me





 On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
 s...@apache.org wrote:

 +1


 On 01/29/2014 05:25 AM, Andrew Musselman wrote:

 Looks good.

 +1


 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
 wrote:

  a), b), c), d) all passed here.

 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
 were
 within the range [0,1].

  Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org

 Fixed the issues that were reported with Clustering code this past
 week,

 upgraded codebase to Lucene 4.6.1 that was released today.


 Here's the URL for the 0.9 release in staging:-

  https://repository.apache.org/content/repositories/
 orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/


 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc

 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run

 through all the different options in each script.


 Need a minimum of 3 '+1' votes from PMC for the release to be
 finalized.







Re: Mahout 0.9 Release

2014-02-02 Thread Andrew Musselman
Yes, thanks Suneel!

 On Feb 2, 2014, at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote:
 
 Mahout 0.9 has been pushed to the mirrors and is available for download at 
 http://www.apache.org/dyn/closer.cgi/mahout/
 
 
 
 
 On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com 
 wrote:
 
 The release has passed with the required votes from PMC, will be pushing 0.9 
 to the mirrors and updating the release notes over the next day or two.
 
 
 
 
 On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote:
 
 +1
 
 
 
 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:
 
 LGTM
 
 
 On 1/29/14, 4:27 PM, peng wrote:
 
 +1, can't see a bad side.
 
 On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:
 
 +1 from me
 
 
 
 
 
 On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
 s...@apache.org wrote:
 
 +1
 
 
 On 01/29/2014 05:25 AM, Andrew Musselman wrote:
 
 Looks good.
 
 +1
 
 
 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
 wrote:
 
   a), b), c), d) all passed here.
 
 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
 were
 within the range [0,1].
 
   Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org
 
 Fixed the issues that were reported with Clustering code this past
 week,
 upgraded codebase to Lucene 4.6.1 that was released today.
 
 
 Here's the URL for the 0.9 release in staging:-
 
   https://repository.apache.org/content/repositories/
 orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/
 
 
 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
 
 Need a minimum of 3 '+1' votes from PMC for the release to be
 finalized.


Re: Mahout 0.9 Release

2014-02-02 Thread Ted Dunning
I just checked and the release has propagated to French mirrors.

On Sun, Feb 2, 2014 at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Mahout 0.9 has been pushed to the mirrors and is available for download at
 http://www.apache.org/dyn/closer.cgi/mahout/




 On Friday, January 31, 2014 11:21 PM, Suneel Marthi 
 suneel_mar...@yahoo.com wrote:

 The release has passed with the required votes from PMC, will be pushing
 0.9 to the mirrors and updating the release notes over the next day or two.



Re: Mahout 0.9 Release

2014-01-31 Thread Suneel Marthi
The release has passed with the required votes from PMC, will be pushing 0.9 to 
the mirrors and updating the release notes over the next day or two.




On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote:
 
+1



On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:

 LGTM


 On 1/29/14, 4:27 PM, peng wrote:

 +1, can't see a bad side.

 On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:

 +1 from me





 On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
 s...@apache.org wrote:

 +1


 On 01/29/2014 05:25 AM, Andrew Musselman wrote:

 Looks good.

 +1


 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
 wrote:

  a), b), c), d) all passed here.

 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
 were
 within the range [0,1].

  Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org

 Fixed the issues that were reported with Clustering code this past
 week,

 upgraded codebase to Lucene 4.6.1 that was released today.


 Here's the URL for the 0.9 release in staging:-

  https://repository.apache.org/content/repositories/
 orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/


 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc

 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run

 through all the different options in each script.


 Need a minimum of 3 '+1' votes from PMC for the release to be
 finalized.







Re: Mahout 0.9 Release

2014-01-30 Thread Sergey Svinarchuk
+1, all steps passed


On Thu, Jan 30, 2014 at 9:16 AM, Stevo Slavić ssla...@gmail.com wrote:

 +1


 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:

  LGTM
 
 
  On 1/29/14, 4:27 PM, peng wrote:
 
  +1, can't see a bad side.
 
  On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:
 
  +1 from me
 
 
 
 
 
  On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
  s...@apache.org wrote:
 
  +1
 
 
  On 01/29/2014 05:25 AM, Andrew Musselman wrote:
 
  Looks good.
 
  +1
 
 
  On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
  wrote:
 
   a), b), c), d) all passed here.
 
  CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
  were
  within the range [0,1].
 
   Date: Tue, 28 Jan 2014 16:45:42 -0800
  From: suneel_mar...@yahoo.com
  Subject: Mahout 0.9 Release
  To: u...@mahout.apache.org; dev@mahout.apache.org
 
  Fixed the issues that were reported with Clustering code this past
  week,
 
  upgraded codebase to Lucene 4.6.1 that was released today.
 
 
  Here's the URL for the 0.9 release in staging:-
 
   https://repository.apache.org/content/repositories/
  orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/
 
 
  The artifacts have been signed with the following key:
  https://people.apache.org/keys/committer/smarthi.asc
 
  Please:-
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please
 run
 
  through all the different options in each script.
 
 
  Need a minimum of 3 '+1' votes from PMC for the release to be
  finalized.
 
 
 
 
 
 


-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.


Re: Mahout 0.9 Release

2014-01-30 Thread Suneel Marthi
Voting for this closes tomorrow, thus far there's been only a single +1 from 
PMC. I'll go ahead and release the artifacts tomorrow regardless of whether we 
succeed/fail to muster the minimum 3 +1 votes.

Thanks






On Thursday, January 30, 2014 9:45 AM, Sergey Svinarchuk 
ssvinarc...@hortonworks.com wrote:
 
+1, all steps passed


On Thu, Jan 30, 2014 at 9:16 AM, Stevo Slavić ssla...@gmail.com wrote:

 +1


 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:

  LGTM
 
 
  On 1/29/14, 4:27 PM, peng wrote:
 
  +1, can't see a bad side.
 
  On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:
 
  +1 from me
 
 
 
 
 
  On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
  s...@apache.org wrote:
 
  +1
 
 
  On 01/29/2014 05:25 AM, Andrew Musselman wrote:
 
  Looks good.
 
  +1
 
 
  On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
  wrote:
 
   a), b), c), d) all passed here.
 
  CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
  were
  within the range [0,1].
 
   Date: Tue, 28 Jan 2014 16:45:42 -0800
  From: suneel_mar...@yahoo.com
  Subject: Mahout 0.9 Release
  To: u...@mahout.apache.org; dev@mahout.apache.org
 
  Fixed the issues that were reported with Clustering code this past
  week,
 
  upgraded codebase to Lucene 4.6.1 that was released today.
 
 
  Here's the URL for the 0.9 release in staging:-
 
   https://repository.apache.org/content/repositories/
  orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/
 
 
  The artifacts have been signed with the following key:
  https://people.apache.org/keys/committer/smarthi.asc
 
  Please:-
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please
 run
 
  through all the different options in each script.
 
 
  Need a minimum of 3 '+1' votes from PMC for the release to be
  finalized.
 
 
 
 
 
 


-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

Re: Mahout 0.9 Release

2014-01-29 Thread Sebastian Schelter

+1

On 01/29/2014 05:25 AM, Andrew Musselman wrote:

Looks good.

+1


On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote:


a), b), c), d) all passed here.

CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were
within the range [0,1].


Date: Tue, 28 Jan 2014 16:45:42 -0800
From: suneel_mar...@yahoo.com
Subject: Mahout 0.9 Release
To: u...@mahout.apache.org; dev@mahout.apache.org

Fixed the issues that were reported with Clustering code this past week,

upgraded codebase to Lucene 4.6.1 that was released today.


Here's the URL for the 0.9 release in staging:-


https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/


The artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc

Please:-
a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run

through all the different options in each script.


Need a minimum of 3 '+1' votes from PMC for the release to be finalized.









Re: Mahout 0.9 Release

2014-01-29 Thread Suneel Marthi
+1 from me 





On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org 
wrote:
 
+1


On 01/29/2014 05:25 AM, Andrew Musselman wrote:
 Looks good.

 +1


 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote:

 a), b), c), d) all passed here.

 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were
 within the range [0,1].

 Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org

 Fixed the issues that were reported with Clustering code this past week,
 upgraded codebase to Lucene 4.6.1 that was released today.

 Here's the URL for the 0.9 release in staging:-

 https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/

 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc

 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.

 Need a minimum of 3 '+1' votes from PMC for the release to be finalized.




Re: Mahout 0.9 Release

2014-01-29 Thread peng

+1, can't see a bad side.

On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:

+1 from me





On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org 
wrote:

+1


On 01/29/2014 05:25 AM, Andrew Musselman wrote:

Looks good.

+1


On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote:


a), b), c), d) all passed here.

CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were
within the range [0,1].


Date: Tue, 28 Jan 2014 16:45:42 -0800
From: suneel_mar...@yahoo.com
Subject: Mahout 0.9 Release
To: u...@mahout.apache.org; dev@mahout.apache.org

Fixed the issues that were reported with Clustering code this past week,

upgraded codebase to Lucene 4.6.1 that was released today.


Here's the URL for the 0.9 release in staging:-


https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/


The artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc

Please:-
a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run

through all the different options in each script.


Need a minimum of 3 '+1' votes from PMC for the release to be finalized.







Re: Mahout 0.9 Release

2014-01-29 Thread Shannon Quinn

LGTM

On 1/29/14, 4:27 PM, peng wrote:

+1, can't see a bad side.

On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:

+1 from me





On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
s...@apache.org wrote:


+1


On 01/29/2014 05:25 AM, Andrew Musselman wrote:

Looks good.

+1


On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com 
wrote:



a), b), c), d) all passed here.

CosineDistance of clustered points from cluster-reuters.sh -1 
kmeans were

within the range [0,1].


Date: Tue, 28 Jan 2014 16:45:42 -0800
From: suneel_mar...@yahoo.com
Subject: Mahout 0.9 Release
To: u...@mahout.apache.org; dev@mahout.apache.org

Fixed the issues that were reported with Clustering code this past 
week,

upgraded codebase to Lucene 4.6.1 that was released today.


Here's the URL for the 0.9 release in staging:-

https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ 



The artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc

Please:-
a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please 
run

through all the different options in each script.


Need a minimum of 3 '+1' votes from PMC for the release to be 
finalized.









Re: Mahout 0.9 Release

2014-01-29 Thread Stevo Slavić
+1


On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:

 LGTM


 On 1/29/14, 4:27 PM, peng wrote:

 +1, can't see a bad side.

 On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:

 +1 from me





 On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
 s...@apache.org wrote:

 +1


 On 01/29/2014 05:25 AM, Andrew Musselman wrote:

 Looks good.

 +1


 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
 wrote:

  a), b), c), d) all passed here.

 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
 were
 within the range [0,1].

  Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org

 Fixed the issues that were reported with Clustering code this past
 week,

 upgraded codebase to Lucene 4.6.1 that was released today.


 Here's the URL for the 0.9 release in staging:-

  https://repository.apache.org/content/repositories/
 orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/


 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc

 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run

 through all the different options in each script.


 Need a minimum of 3 '+1' votes from PMC for the release to be
 finalized.








Mahout 0.9 Release

2014-01-28 Thread Suneel Marthi
Fixed the issues that were reported with Clustering code this past week, 
upgraded codebase to Lucene 4.6.1 that was released today.

Here's the URL for the 0.9 release in staging:-
https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/

The artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc

Please:-
a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through 
all the different options in each script.

Need a minimum of 3 '+1' votes from PMC for the release to be finalized.

RE: Mahout 0.9 Release

2014-01-28 Thread Andrew Palumbo
a), b), c), d) all passed here. 

CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were 
within the range [0,1].

 Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org
 
 Fixed the issues that were reported with Clustering code this past week, 
 upgraded codebase to Lucene 4.6.1 that was released today.
 
 Here's the URL for the 0.9 release in staging:-
 https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/
 
 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run 
 through all the different options in each script.
 
 Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
  

Re: Mahout 0.9 Release

2014-01-28 Thread Andrew Musselman
Looks good.

+1


On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote:

 a), b), c), d) all passed here.

 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were
 within the range [0,1].

  Date: Tue, 28 Jan 2014 16:45:42 -0800
  From: suneel_mar...@yahoo.com
  Subject: Mahout 0.9 Release
  To: u...@mahout.apache.org; dev@mahout.apache.org
 
  Fixed the issues that were reported with Clustering code this past week,
 upgraded codebase to Lucene 4.6.1 that was released today.
 
  Here's the URL for the 0.9 release in staging:-
 
 https://repository.apache.org/content/repositories/orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/
 
  The artifacts have been signed with the following key:
  https://people.apache.org/keys/committer/smarthi.asc
 
  Please:-
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
  Need a minimum of 3 '+1' votes from PMC for the release to be finalized.




Re: MAHOUT 0.9 Release - New URL

2014-01-25 Thread Suneel Marthi
I'll be rolling back the 0.9 Release today that's presently in staging in light 
of the issues that have been reported in the last 2 days and need to be fixed 
as part of the Release.

Please hold off from committing any new code to trunk meanwhile.

Thanks.





On Friday, January 24, 2014 7:36 PM, Ted Dunning ted.dunn...@gmail.com wrote:
 


My schedule has opened up a bit and I can review as well.





On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com 
wrote:

I will try the next candidate agaim, so one vote is sure.
Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com:


 I am open to having the conversation (and a part of me feels that the
 clusteringId fix should be in 0.9).

 If we decide to incorporate that into 0.9, I need to rollback the 0.9
 Release that's presently out there in staging (for the 5th time in a row
 now).
 I am fine with doing that.

 What do you think we should do?

 a) Go ahead with 0.9 release without the fix for M-1410 .
 b) Rollback 0.9 and include the fix for M-1410
 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes
 M-1410 and any other issues/enhancements that are fixed.


 I am leaning towards (b), my only concern being that from my experience in
 the past few weeks; its become real hard to muster the minimum 3 +1 PMC
 votes required for a release to pass.








 On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com
 wrote:



 Can we hold a separate discussion about whether the clustering id issue
 has to be in 0.9 while extending the vote deadline if necessary?

 If not, then all these votes are great and the release can go forward.

 If it is the sense that that fix has to be in, we should leave time for
 people for people to reverse their votes to -1.




 On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

 Thanks for all those that volunteered.  The voting for 0.9 Release closes
 tomorrow.
 
 
 
 
 
 
 
 
 On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com
 wrote:
 
 Using CentOS 6.5 and hadoop 1.2.1, all passed.
 
 +1 from me
 
 Gokhan
 
 
 
 On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com
 wrote:
 
  a),b),c),d) all passed on CentOS for me
 
   Date: Thu, 23 Jan 2014 13:43:06 +0200
   Subject: Re: MAHOUT 0.9 Release - New URL
   From: ssvinarc...@hortonworks.com
   To: dev@mahout.apache.org
  
   I did a), b), c), d) and all steps pass.
   +1
  
  
   On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org
  wrote:
  
+1 from me.
   
On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
 
wrote:
   
 Fixed the issues that were reported this week and restored FP
 mining
into the codebase.

 Here's the URL for the final release in staging:-

   
 
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/

 The artifacts have been signed with the
  following key:
 https://people.apache.org/keys/committer/smarthi.asc


 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under
  $MAHOUT_HOME/examples/bin. Please
  run
through all the different options in each script.

 Committers and PMC, need a minimum of 3 '+1' votes for the release
  to be
finalized.
   

Grant Ingersoll | @gsingers
http://www.lucidworks.com
   
   
   
   
   
   
  
   --
 
   CONFIDENTIALITY NOTICE
   NOTICE: This message is intended for the use of the individual or
 entity
  to
   which it is addressed and may contain information that is
 confidential,
   privileged and exempt from disclosure under applicable law. If the
 reader
   of this message is not the intended recipient, you are hereby notified
  that
   any printing, copying, dissemination, distribution, disclosure or
   forwarding of this communication is strictly prohibited. If you have
   received this communication in error, please contact the sender
  immediately
   and delete it from your system. Thank You.
 
 


Re: MAHOUT 0.9 Release - New URL

2014-01-25 Thread Suneel Marthi
Rolled back trunk to 0.9-SNAPSHOT, please go ahead and commit any changes.




On Saturday, January 25, 2014 4:19 AM, Suneel Marthi suneel_mar...@yahoo.com 
wrote:
 
I'll be rolling back the 0.9 Release today that's presently in staging in light 
of the issues that have been reported in the last 2 days and need to be fixed 
as part of the Release.

Please hold off from committing any new code to trunk meanwhile.

Thanks.






On Friday, January 24, 2014 7:36 PM, Ted Dunning ted.dunn...@gmail.com wrote:



My schedule has opened up a bit and I can review as well.





On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com 
wrote:

I will try the next candidate agaim, so one vote is sure.
Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com:


 I am open to having the conversation (and a part of me feels that the
 clusteringId fix should be in 0.9).

 If we decide to incorporate that into 0.9, I need to rollback the 0.9
 Release that's presently out there in staging (for the 5th time in a row
 now).
 I am fine with doing that.

 What do you think we should do?

 a) Go ahead with 0.9 release without the fix for M-1410 .
 b) Rollback 0.9 and include the fix for M-1410
 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes
 M-1410 and any other issues/enhancements that are fixed.


 I am leaning towards (b), my only concern being that from my experience in
 the past few weeks; its become real hard to muster the minimum 3 +1 PMC
 votes required for a release to pass.








 On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com
 wrote:



 Can we hold a separate discussion about whether the clustering id issue
 has to be in 0.9 while extending the vote deadline if necessary?

 If not, then all these votes are great and the release can go forward.

 If it is the sense that that fix has to be in, we should leave time for
 people for people to reverse their votes to -1.




 On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

 Thanks for all those that volunteered.  The voting for 0.9 Release closes
 tomorrow.
 
 
 
 
 
 
 
 
 On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com
 wrote:
 
 Using CentOS 6.5 and hadoop 1.2.1, all passed.
 
 +1 from me
 
 Gokhan
 
 
 
 On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com
 wrote:
 
  a),b),c),d) all passed on CentOS for me
 
   Date: Thu, 23 Jan 2014 13:43:06 +0200
   Subject: Re: MAHOUT 0.9 Release - New URL
   From: ssvinarc...@hortonworks.com
   To: dev@mahout.apache.org
  
   I did a), b), c), d) and all steps pass.
   +1
  
  
   On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org
  wrote:
  
+1 from me.
   
On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
 
wrote:
   
 Fixed the issues that were reported this week and restored FP
 mining
into the codebase.

 Here's the URL for the final release in staging:-

   
 
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/

 The artifacts have been signed with the
  following key:
 https://people.apache.org/keys/committer/smarthi.asc


 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under
  $MAHOUT_HOME/examples/bin. Please
  run
through all the different options in each script.

 Committers and PMC, need a minimum of 3 '+1' votes for the release
  to be
finalized.
   

Grant Ingersoll | @gsingers
http://www.lucidworks.com
   
   
   
   
   
   
  
   --
 
   CONFIDENTIALITY NOTICE
   NOTICE: This message is intended for the use of the individual or
 entity
  to
   which it is addressed and may contain information that is
 confidential,
   privileged and exempt from disclosure under applicable law. If the
 reader
   of this message is not the intended recipient, you are hereby notified
  that
   any printing, copying, dissemination, distribution, disclosure or
   forwarding of this communication is strictly prohibited. If you have
   received this communication in error, please contact the sender
  immediately
   and delete it from your system. Thank You.
 
 


Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Gokhan Capan
Using CentOS 6.5 and hadoop 1.2.1, all passed.

+1 from me

Gokhan


On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote:

 a),b),c),d) all passed on CentOS for me

  Date: Thu, 23 Jan 2014 13:43:06 +0200
  Subject: Re: MAHOUT 0.9 Release - New URL
  From: ssvinarc...@hortonworks.com
  To: dev@mahout.apache.org
 
  I did a), b), c), d) and all steps pass.
  +1
 
 
  On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org
 wrote:
 
   +1 from me.
  
   On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
   wrote:
  
Fixed the issues that were reported this week and restored FP mining
   into the codebase.
   
Here's the URL for the final release in staging:-
   
  
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
   
The artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc
   
   
a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please
 run
   through all the different options in each script.
   
Committers and PMC, need a minimum of 3 '+1' votes for the release
 to be
   finalized.
  
   
   Grant Ingersoll | @gsingers
   http://www.lucidworks.com
  
  
  
  
  
  
 
  --
  CONFIDENTIALITY NOTICE
  NOTICE: This message is intended for the use of the individual or entity
 to
  which it is addressed and may contain information that is confidential,
  privileged and exempt from disclosure under applicable law. If the reader
  of this message is not the intended recipient, you are hereby notified
 that
  any printing, copying, dissemination, distribution, disclosure or
  forwarding of this communication is strictly prohibited. If you have
  received this communication in error, please contact the sender
 immediately
  and delete it from your system. Thank You.




Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Suneel Marthi
Thanks for all those that volunteered.  The voting for 0.9 Release closes 
tomorrow.







On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote:
 
Using CentOS 6.5 and hadoop 1.2.1, all passed.

+1 from me

Gokhan



On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote:

 a),b),c),d) all passed on CentOS for me

  Date: Thu, 23 Jan 2014 13:43:06 +0200
  Subject: Re: MAHOUT 0.9 Release - New URL
  From: ssvinarc...@hortonworks.com
  To: dev@mahout.apache.org
 
  I did a), b), c), d) and all steps pass.
  +1
 
 
  On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org
 wrote:
 
   +1 from me.
  
   On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
   wrote:
  
Fixed the issues that were reported this week and restored FP mining
   into the codebase.
   
Here's the URL for the final release in staging:-
   
  
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
   
The artifacts have been signed with the
 following key:
https://people.apache.org/keys/committer/smarthi.asc
   
   
a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under
 $MAHOUT_HOME/examples/bin. Please
 run
   through all the different options in each script.
   
Committers and PMC, need a minimum of 3 '+1' votes for the release
 to be
   finalized.
  
   
   Grant Ingersoll | @gsingers
   http://www.lucidworks.com
  
  
  
  
  
  
 
  --

  CONFIDENTIALITY NOTICE
  NOTICE: This message is intended for the use of the individual or entity
 to
  which it is addressed and may contain information that is confidential,
  privileged and exempt from disclosure under applicable law. If the reader
  of this message is not the intended recipient, you are hereby notified
 that
  any printing, copying, dissemination, distribution, disclosure or
  forwarding of this communication is strictly prohibited. If you have
  received this communication in error, please contact the sender
 immediately
  and delete it from your system. Thank You.



Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Ted Dunning
Can we hold a separate discussion about whether the clustering id issue has
to be in 0.9 while extending the vote deadline if necessary?

If not, then all these votes are great and the release can go forward.

If it is the sense that that fix has to be in, we should leave time for
people for people to reverse their votes to -1.



On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Thanks for all those that volunteered.  The voting for 0.9 Release closes
 tomorrow.







 On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com
 wrote:

 Using CentOS 6.5 and hadoop 1.2.1, all passed.

 +1 from me

 Gokhan



 On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com
 wrote:

  a),b),c),d) all passed on CentOS for me
 
   Date: Thu, 23 Jan 2014 13:43:06 +0200
   Subject: Re: MAHOUT 0.9 Release - New URL
   From: ssvinarc...@hortonworks.com
   To: dev@mahout.apache.org
  
   I did a), b), c), d) and all steps pass.
   +1
  
  
   On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org
  wrote:
  
+1 from me.
   
On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
wrote:
   
 Fixed the issues that were reported this week and restored FP
 mining
into the codebase.

 Here's the URL for the final release in staging:-

   
 
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/

 The artifacts have been signed with the
  following key:
 https://people.apache.org/keys/committer/smarthi.asc


 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under
  $MAHOUT_HOME/examples/bin. Please
  run
through all the different options in each script.

 Committers and PMC, need a minimum of 3 '+1' votes for the release
  to be
finalized.
   

Grant Ingersoll | @gsingers
http://www.lucidworks.com
   
   
   
   
   
   
  
   --
 
   CONFIDENTIALITY NOTICE
   NOTICE: This message is intended for the use of the individual or
 entity
  to
   which it is addressed and may contain information that is confidential,
   privileged and exempt from disclosure under applicable law. If the
 reader
   of this message is not the intended recipient, you are hereby notified
  that
   any printing, copying, dissemination, distribution, disclosure or
   forwarding of this communication is strictly prohibited. If you have
   received this communication in error, please contact the sender
  immediately
   and delete it from your system. Thank You.
 
 



Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Suneel Marthi
I am open to having the conversation (and a part of me feels that the 
clusteringId fix should be in 0.9). 

If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release 
that's presently out there in staging (for the 5th time in a row now). 
I am fine with doing that.  

What do you think we should do?

a) Go ahead with 0.9 release without the fix for M-1410 .
b) Rollback 0.9 and include the fix for M-1410
c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes 
M-1410 and any other issues/enhancements that are fixed.


I am leaning towards (b), my only concern being that from my experience in the 
past few weeks; its become real hard to muster the minimum 3 +1 PMC votes 
required for a release to pass. 








On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote:
 


Can we hold a separate discussion about whether the clustering id issue has to 
be in 0.9 while extending the vote deadline if necessary?

If not, then all these votes are great and the release can go forward.

If it is the sense that that fix has to be in, we should leave time for people 
for people to reverse their votes to -1.




On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote:

Thanks for all those that volunteered.  The voting for 0.9 Release closes 
tomorrow.








On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote:

Using CentOS 6.5 and hadoop 1.2.1, all passed.

+1 from me

Gokhan



On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote:

 a),b),c),d) all passed on CentOS for me

  Date: Thu, 23 Jan 2014 13:43:06 +0200
  Subject: Re: MAHOUT 0.9 Release - New URL
  From: ssvinarc...@hortonworks.com
  To: dev@mahout.apache.org
 
  I did a), b), c), d) and all steps pass.
  +1
 
 
  On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org
 wrote:
 
   +1 from me.
  
   On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
   wrote:
  
Fixed the issues that were reported this week and restored FP mining
   into the codebase.
   
Here's the URL for the final release in staging:-
   
  
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
   
The artifacts have been signed with the
 following key:
https://people.apache.org/keys/committer/smarthi.asc
   
   
a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under
 $MAHOUT_HOME/examples/bin. Please
 run
   through all the different options in each script.
   
Committers and PMC, need a minimum of 3 '+1' votes for the release
 to be
   finalized.
  
   
   Grant Ingersoll | @gsingers
   http://www.lucidworks.com
  
  
  
  
  
  
 
  --

  CONFIDENTIALITY NOTICE
  NOTICE: This message is intended for the use of the individual or entity
 to
  which it is addressed and may contain information that is confidential,
  privileged and exempt from disclosure under applicable law. If the reader
  of this message is not the intended recipient, you are hereby notified
 that
  any printing, copying, dissemination, distribution, disclosure or
  forwarding of this communication is strictly prohibited. If you have
  received this communication in error, please contact the sender
 immediately
  and delete it from your system. Thank You.



Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Sebastian Schelter
I will try the next candidate agaim, so one vote is sure.
Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com:

 I am open to having the conversation (and a part of me feels that the
 clusteringId fix should be in 0.9).

 If we decide to incorporate that into 0.9, I need to rollback the 0.9
 Release that's presently out there in staging (for the 5th time in a row
 now).
 I am fine with doing that.

 What do you think we should do?

 a) Go ahead with 0.9 release without the fix for M-1410 .
 b) Rollback 0.9 and include the fix for M-1410
 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes
 M-1410 and any other issues/enhancements that are fixed.


 I am leaning towards (b), my only concern being that from my experience in
 the past few weeks; its become real hard to muster the minimum 3 +1 PMC
 votes required for a release to pass.








 On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com
 wrote:



 Can we hold a separate discussion about whether the clustering id issue
 has to be in 0.9 while extending the vote deadline if necessary?

 If not, then all these votes are great and the release can go forward.

 If it is the sense that that fix has to be in, we should leave time for
 people for people to reverse their votes to -1.




 On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

 Thanks for all those that volunteered.  The voting for 0.9 Release closes
 tomorrow.
 
 
 
 
 
 
 
 
 On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com
 wrote:
 
 Using CentOS 6.5 and hadoop 1.2.1, all passed.
 
 +1 from me
 
 Gokhan
 
 
 
 On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com
 wrote:
 
  a),b),c),d) all passed on CentOS for me
 
   Date: Thu, 23 Jan 2014 13:43:06 +0200
   Subject: Re: MAHOUT 0.9 Release - New URL
   From: ssvinarc...@hortonworks.com
   To: dev@mahout.apache.org
  
   I did a), b), c), d) and all steps pass.
   +1
  
  
   On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org
  wrote:
  
+1 from me.
   
On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
 
wrote:
   
 Fixed the issues that were reported this week and restored FP
 mining
into the codebase.

 Here's the URL for the final release in staging:-

   
 
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/

 The artifacts have been signed with the
  following key:
 https://people.apache.org/keys/committer/smarthi.asc


 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under
  $MAHOUT_HOME/examples/bin. Please
  run
through all the different options in each script.

 Committers and PMC, need a minimum of 3 '+1' votes for the release
  to be
finalized.
   

Grant Ingersoll | @gsingers
http://www.lucidworks.com
   
   
   
   
   
   
  
   --
 
   CONFIDENTIALITY NOTICE
   NOTICE: This message is intended for the use of the individual or
 entity
  to
   which it is addressed and may contain information that is
 confidential,
   privileged and exempt from disclosure under applicable law. If the
 reader
   of this message is not the intended recipient, you are hereby notified
  that
   any printing, copying, dissemination, distribution, disclosure or
   forwarding of this communication is strictly prohibited. If you have
   received this communication in error, please contact the sender
  immediately
   and delete it from your system. Thank You.
 
 


Re: MAHOUT 0.9 Release - New URL

2014-01-24 Thread Ted Dunning
My schedule has opened up a bit and I can review as well.




On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com
 wrote:

 I will try the next candidate agaim, so one vote is sure.
 Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com:

  I am open to having the conversation (and a part of me feels that the
  clusteringId fix should be in 0.9).
 
  If we decide to incorporate that into 0.9, I need to rollback the 0.9
  Release that's presently out there in staging (for the 5th time in a row
  now).
  I am fine with doing that.
 
  What do you think we should do?
 
  a) Go ahead with 0.9 release without the fix for M-1410 .
  b) Rollback 0.9 and include the fix for M-1410
  c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes
  M-1410 and any other issues/enhancements that are fixed.
 
 
  I am leaning towards (b), my only concern being that from my experience
 in
  the past few weeks; its become real hard to muster the minimum 3 +1 PMC
  votes required for a release to pass.
 
 
 
 
 
 
 
 
  On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com
  wrote:
 
 
 
  Can we hold a separate discussion about whether the clustering id issue
  has to be in 0.9 while extending the vote deadline if necessary?
 
  If not, then all these votes are great and the release can go forward.
 
  If it is the sense that that fix has to be in, we should leave time for
  people for people to reverse their votes to -1.
 
 
 
 
  On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com
  wrote:
 
  Thanks for all those that volunteered.  The voting for 0.9 Release closes
  tomorrow.
  
  
  
  
  
  
  
  
  On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com
  wrote:
  
  Using CentOS 6.5 and hadoop 1.2.1, all passed.
  
  +1 from me
  
  Gokhan
  
  
  
  On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com
  wrote:
  
   a),b),c),d) all passed on CentOS for me
  
Date: Thu, 23 Jan 2014 13:43:06 +0200
Subject: Re: MAHOUT 0.9 Release - New URL
From: ssvinarc...@hortonworks.com
To: dev@mahout.apache.org
   
I did a), b), c), d) and all steps pass.
+1
   
   
On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll 
 gsing...@apache.org
   wrote:
   
 +1 from me.

 On Jan 22, 2014, at 5:55 PM, Suneel Marthi 
 suneel_mar...@yahoo.com
  
 wrote:

  Fixed the issues that were reported this week and restored FP
  mining
 into the codebase.
 
  Here's the URL for the final release in staging:-
 

  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
 
  The artifacts have been signed with the
   following key:
  https://people.apache.org/keys/committer/smarthi.asc
 
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under
   $MAHOUT_HOME/examples/bin. Please
   run
 through all the different options in each script.
 
  Committers and PMC, need a minimum of 3 '+1' votes for the
 release
   to be
 finalized.

 
 Grant Ingersoll | @gsingers
 http://www.lucidworks.com






   
--
  
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or
  entity
   to
which it is addressed and may contain information that is
  confidential,
privileged and exempt from disclosure under applicable law. If the
  reader
of this message is not the intended recipient, you are hereby
 notified
   that
any printing, copying, dissemination, distribution, disclosure or
forwarding of this communication is strictly prohibited. If you have
received this communication in error, please contact the sender
   immediately
and delete it from your system. Thank You.
  
  



Re: MAHOUT 0.9 Release - New URL

2014-01-23 Thread Grant Ingersoll
+1 from me.

On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote:

 Fixed the issues that were reported this week and restored FP mining into the 
 codebase.
 
 Here's the URL for the final release in staging:-
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
 
 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run 
 through all the different options in each script.
 
 Committers and PMC, need a minimum of 3 '+1' votes for the release to be 
 finalized.


Grant Ingersoll | @gsingers
http://www.lucidworks.com







Re: MAHOUT 0.9 Release - New URL

2014-01-23 Thread Sergey Svinarchuk
I did a), b), c), d) and all steps pass.
+1


On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.orgwrote:

 +1 from me.

 On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

  Fixed the issues that were reported this week and restored FP mining
 into the codebase.
 
  Here's the URL for the final release in staging:-
 
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
 
  The artifacts have been signed with the following key:
  https://people.apache.org/keys/committer/smarthi.asc
 
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
  Committers and PMC, need a minimum of 3 '+1' votes for the release to be
 finalized.

 
 Grant Ingersoll | @gsingers
 http://www.lucidworks.com







-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.


RE: MAHOUT 0.9 Release - New URL

2014-01-23 Thread Andrew Palumbo
a),b),c),d) all passed on CentOS for me

 Date: Thu, 23 Jan 2014 13:43:06 +0200
 Subject: Re: MAHOUT 0.9 Release - New URL
 From: ssvinarc...@hortonworks.com
 To: dev@mahout.apache.org
 
 I did a), b), c), d) and all steps pass.
 +1
 
 
 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.orgwrote:
 
  +1 from me.
 
  On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com
  wrote:
 
   Fixed the issues that were reported this week and restored FP mining
  into the codebase.
  
   Here's the URL for the final release in staging:-
  
  https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
  
   The artifacts have been signed with the following key:
   https://people.apache.org/keys/committer/smarthi.asc
  
  
   a) Verify that u can unpack the release (tar or zip)
   b) Verify u r able to compile the distro
   c)  Run through the unit tests: mvn clean test
   d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
  through all the different options in each script.
  
   Committers and PMC, need a minimum of 3 '+1' votes for the release to be
  finalized.
 
  
  Grant Ingersoll | @gsingers
  http://www.lucidworks.com
 
 
 
 
 
 
 
 -- 
 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity to 
 which it is addressed and may contain information that is confidential, 
 privileged and exempt from disclosure under applicable law. If the reader 
 of this message is not the intended recipient, you are hereby notified that 
 any printing, copying, dissemination, distribution, disclosure or 
 forwarding of this communication is strictly prohibited. If you have 
 received this communication in error, please contact the sender immediately 
 and delete it from your system. Thank You.
  

Re: MAHOUT 0.9 Release - New URL

2014-01-22 Thread Suneel Marthi
Fixed the issues that were reported this week and restored FP mining into the 
codebase.

Here's the URL for the final release in staging:-
https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/

The artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc


a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through 
all the different options in each script.

Committers and PMC, need a minimum of 3 '+1' votes for the release to be 
finalized. 

Re: MAHOUT 0.9 Release - New URL

2014-01-22 Thread Sebastian Schelter

I did a) b) c) and d) without noting any problem so far. +1 from me.

--sebastian


On 01/22/2014 11:55 PM, Suneel Marthi wrote:

Fixed the issues that were reported this week and restored FP mining into the 
codebase.

Here's the URL for the final release in staging:-
https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/

The artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc


a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through 
all the different options in each script.

Committers and PMC, need a minimum of 3 '+1' votes for the release to be 
finalized.






Re: MAHOUT 0.9 Release - New URL

2014-01-22 Thread Suneel Marthi
Same here. I did a), b), c) and d) too and all tests pass. Here's my +1, if my 
vote counts.





On Wednesday, January 22, 2014 7:11 PM, Sebastian Schelter s...@apache.org 
wrote:
 
I did a) b) c) and d) without noting any problem so far. +1 from me.

--sebastian



On 01/22/2014 11:55 PM, Suneel Marthi wrote:
 Fixed the issues that were reported this week and restored FP mining into the 
 codebase.

 Here's the URL for the final release in staging:-
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/

 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc


 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run 
 through all the different options in each script.

 Committers and PMC, need a minimum of 3 '+1' votes for the release to be 
 finalized.


Re: MAHOUT 0.9 Release - New URL

2014-01-22 Thread Andrew Musselman
Likewise, a) through d) work on an Amazon AMI and Ubuntu 12.04.

+1


On Wed, Jan 22, 2014 at 6:38 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Same here. I did a), b), c) and d) too and all tests pass. Here's my +1,
 if my vote counts.





 On Wednesday, January 22, 2014 7:11 PM, Sebastian Schelter s...@apache.org
 wrote:

 I did a) b) c) and d) without noting any problem so far. +1 from me.

 --sebastian



 On 01/22/2014 11:55 PM, Suneel Marthi wrote:
  Fixed the issues that were reported this week and restored FP mining
 into the codebase.
 
  Here's the URL for the final release in staging:-
 
 https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/
 
  The artifacts have been signed with the following key:
  https://people.apache.org/keys/committer/smarthi.asc
 
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
  Committers and PMC, need a minimum of 3 '+1' votes for the release to be
 finalized.
 



Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Suneel Marthi
] First: NaN
     Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info
     INFO: Program took 669 ms (Minutes: 0.01115)
     cluster,distance.mean,distance.sd

 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train






  Date: Thu, 16 Jan 2014 06:41:09 -0800
  From: suneel_mar...@yahoo.com
  Subject: MAHOUT 0.9 Release - New URL
  To: u...@mahout.apache.org; dev@mahout.apache.org
 
  Third time's a Charm!!!
 
 
  Here's the new URL for Mahout 0.9 Release:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts
  under $MAHOUT_HOME/examples/bin. Please run through all the different
 options in each script.
 
 
  Committers
   and PMC members:
  ---
 
  Need 'at least 3 +1 votes' for the Release to pass.
 
 
  Thanks and
 Regards.




Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Andrew Musselman
   'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen.
 
 
  ./classify-20newsgroups.sh -1 [works]
  ./classify-20newsgroups.sh -2 [works]
 
 
  cluster-reuters.sh -1 [works]
 
  cluster-reuters.sh -2 [works]
  cluster-reuters.sh -3 [works]
 
  Same error as noted previosly in the thread:
 
  cluster-reuters.sh -4 [0 clusters]
 
  [...]
 
  WARNING: No qualcluster.props found on classpath, will use
  command-line arguments only
  Num clusters: 0; maxDistance: 0.00
  [Dunn Index]
   First: Infinity
  [Davies-Bouldin Index] First: NaN
  Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info
  INFO: Program took 669 ms (Minutes: 0.01115)
  cluster,distance.mean,distance.sd
 

  ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train
 
 
 
 
 
 
   Date: Thu, 16 Jan 2014 06:41:09 -0800
   From: suneel_mar...@yahoo.com
   Subject: MAHOUT 0.9 Release - New URL
   To: u...@mahout.apache.org; dev@mahout.apache.org
  
   Third time's a Charm!!!
  
  
   Here's the new URL for Mahout 0.9 Release:
  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
  
   For those volunteering to test this, some of the things to be
 verified:
  
   a) Verify that u can unpack the release (tar or zip)
   b) Verify u r able to compile the distro
   c)  Run through the unit tests: mvn clean test
   d) Run the example scripts
   under $MAHOUT_HOME/examples/bin. Please run through all the different
  options in each script.
  
  
   Committers
and PMC members:
   ---
  
   Need 'at least 3 +1 votes' for the Release to pass.
  
  
   Thanks and
  Regards.
 
 
 



RE: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Andrew Palumbo
from the asf-email-examples.sh script:

# You will need to download or otherwise obtain some or all of the Amazon ASF Em
ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this
 script.
# To obtain a full copy you will need to launch an EC2 instance and mount the da
taset to download it, otherwise you can get a sample of it at
# http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout

It looks like the:
 http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout

link is down.  

Is there somewhere else that we can get a subset of the ASF emails?



Date: Tue, 21 Jan 2014 09:48:06 -0800
 Subject: Re: MAHOUT 0.9 Release - New URL
 From: andrew.mussel...@gmail.com
 To: dev@mahout.apache.org
 
 Sure thing; continuing to smoke test the other examples tonight
 
 
 On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote:
 
  Thanks Andrew M., see that some of the example scripts need to be fixed as
  they still refer to the deprecated algorithms.
  See that the Streaming KMeans has failed for you as well.
 
  I'll be rolling back the release today to fix these issues.
 
 
 
 
 
  On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman 
  andrew.mussel...@gmail.com wrote:
 
  Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit
  Linux AMI from tarball.
 
  All tests pass.
 
  *Output of examples:*
  *asf-email-examples.sh, run on mahout.apache.org
  http://mahout.apache.org:*
  *recommendations:*
  [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat
  /user/ec2-user/asf-output/prefs/recommendations/part-r-0  | less
  1
 
  [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0]
  4
 
  [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0]
  6
 
  [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0]
  8
  [12758:1.0,19409:1.0,2:1.0]
  11
 
  [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0]
  14
 
  [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0]
  15
 
  [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0]
  16
 
  [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0]
  18
 
  [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0]
  19  [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0]
  20
 
  [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0]
  [snip]
 
  *clustering; kmeans:*
  [snip]
  Weight : [props - optional]:  Point:
  1.0 :
   [distance-squared=1.0193102046188427]:
  /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus =
  [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204,
  7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159,
  10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244,
  19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095,
  25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075,
  31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130,
  39789:0.110, 40743:0.190, 45775:0.086]
  1.0 : [distance-squared=0.9823018320457279]:
  /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus =
  [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106,
  6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072,
  10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177,
  19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092,
  25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072,
  30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073,
  36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083]
  1.0 : [distance-squared=0.9509142993214911]:
  /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor =
  [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048,
   4419:0.076,
  4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048,
  7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077,
  7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081,
  10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087,
  12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061,
  14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124,
  20346:0.159, 21233:0.075, 23657:0.089, 23939:0.078, 23974:0.105,
  23998:0.146, 24962:0.122, 25003:0.093, 25084:0.151, 25128:0.052,
  29143:0.095, 30459:0.046, 30806:0.075, 31559:0.046, 31727:0.104,
  31895:0.105, 31900:0.153, 32149:0.079, 32993:0.069, 33112:0.177,
  33208:0.101, 33351:0.089, 33533:0.079, 33638:0.042, 35795:0.066,
  36189:0.078, 36491

Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Suneel Marthi
Thanks Andrew for reporting that. I rolled back the release to fix this and few 
other issues.

We have removed asf-examples*.sh from trunk as the sample file at the url 
mentioned in ur email is not available.
This is something we need to fix and restore in 1.0.







On Tuesday, January 21, 2014 3:21 PM, Andrew Palumbo ap@outlook.com wrote:
 
from the asf-email-examples.sh script:

# You will need to download or otherwise obtain some or all of the Amazon ASF Em
ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this
script.
# To obtain a full copy you will need to launch an EC2 instance and mount the da
taset to download it, otherwise you can get a sample of it at
# http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout

It looks like the:
http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout

link is down.  

Is there somewhere else that we can get a subset of the ASF emails?



Date: Tue, 21 Jan 2014 09:48:06 -0800
 Subject: Re: MAHOUT 0.9 Release - New URL
 From: andrew.mussel...@gmail.com
 To: dev@mahout.apache.org
 
 Sure thing; continuing to smoke test the other examples tonight
 
 
 On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote:
 
  Thanks Andrew M., see that some of the example scripts need to be fixed as
  they still refer to the deprecated algorithms.
  See that the Streaming KMeans has failed for you as well.
 
  I'll be rolling back the release today to fix these issues.
 
 
 
 
 
  On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman 
  andrew.mussel...@gmail.com wrote:
 
  Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit
  Linux AMI from tarball.
 
  All tests pass.
 
  *Output of examples:*
  *asf-email-examples.sh, run on mahout.apache.org
  http://mahout.apache.org:*
  *recommendations:*
  [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat
  /user/ec2-user/asf-output/prefs/recommendations/part-r-0  | less
  1
 
  [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0]
  4
 
  [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0]
  6
 
  [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0]
  8
      [12758:1.0,19409:1.0,2:1.0]
  11
 
  [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0]
  14
 
  [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0]
  15
 
  [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0]
  16
 
  [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0]
  18
 
  [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0]
  19      [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0]
  20
 
  [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0]
  [snip]
 
  *clustering; kmeans:*
  [snip]
          Weight : [props - optional]:  Point:
          1.0 :
   [distance-squared=1.0193102046188427]:
  /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus =
  [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204,
  7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159,
  10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244,
  19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095,
  25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075,
  31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130,
  39789:0.110, 40743:0.190, 45775:0.086]
          1.0 : [distance-squared=0.9823018320457279]:
  /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus =
  [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106,
  6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072,
  10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177,
  19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092,
  25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072,
  30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073,
  36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083]
          1.0 : [distance-squared=0.9509142993214911]:
  /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor =
  [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048,
   4419:0.076,
  4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048,
  7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077,
  7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081,
  10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087,
  12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061,
  14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124,
  20346

Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Andrew Musselman
75%
Reliability70.6238%
Reliability (standard deviation)0.2187
Log-likelihoodmean  :-1.1182
  25%-ile   :-1.6911
  75%-ile   :-0.0803

Jan 21, 2014 9:46:39 PM org.slf4j.impl.JCLLoggerAdapter info
INFO: Program took 10783 ms (Minutes: 0.17971)




On Tue, Jan 21, 2014 at 1:08 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Thanks Andrew for reporting that. I rolled back the release to fix this
 and few other issues.

 We have removed asf-examples*.sh from trunk as the sample file at the url
 mentioned in ur email is not available.
 This is something we need to fix and restore in 1.0.







 On Tuesday, January 21, 2014 3:21 PM, Andrew Palumbo ap@outlook.com
 wrote:

 from the asf-email-examples.sh script:

 # You will need to download or otherwise obtain some or all of the Amazon
 ASF Em
 ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to
 use this
 script.
 # To obtain a full copy you will need to launch an EC2 instance and mount
 the da
 taset to download it, otherwise you can get a sample of it at
 #
 http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout

 It looks like the:
 http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout

 link is down.

 Is there somewhere else that we can get a subset of the ASF emails?



 Date: Tue, 21 Jan 2014 09:48:06 -0800
  Subject: Re: MAHOUT 0.9 Release - New URL
  From: andrew.mussel...@gmail.com
  To: dev@mahout.apache.org
 
  Sure thing; continuing to smoke test the other examples tonight
 
 
  On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:
 
   Thanks Andrew M., see that some of the example scripts need to be
 fixed as
   they still refer to the deprecated algorithms.
   See that the Streaming KMeans has failed for you as well.
  
   I'll be rolling back the release today to fix these issues.
  
  
  
  
  
   On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman 
   andrew.mussel...@gmail.com wrote:
  
   Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default
 64-bit
   Linux AMI from tarball.
  
   All tests pass.
  
   *Output of examples:*
   *asf-email-examples.sh, run on mahout.apache.org
   http://mahout.apache.org:*
   *recommendations:*
   [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat
   /user/ec2-user/asf-output/prefs/recommendations/part-r-0  | less
   1
  
  
 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0]
   4
  
  
 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0]
   6
  
  
 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0]
   8
   [12758:1.0,19409:1.0,2:1.0]
   11
  
  
 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0]
   14
  
  
 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0]
   15
  
  
 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0]
   16
  
  
 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0]
   18
  
  
 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0]
   19  [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0]
   20
  
  
 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0]
   [snip]
  
   *clustering; kmeans:*
   [snip]
   Weight : [props - optional]:  Point:
   1.0 :
[distance-squared=1.0193102046188427]:
   /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus =
   [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110,
 7573:0.204,
   7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159,
   10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244,
   19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095,
   25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075,
   31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130,
   39789:0.110, 40743:0.190, 45775:0.086]
   1.0 : [distance-squared=0.9823018320457279]:
   /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus =
   [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104,
 5336:0.106,
   6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072,
   10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177,
   19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092,
   25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072,
   30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073,
   36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083]
   1.0

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Suneel Marthi
Hmmm... that's an issue. Since both Dirichlet and Meanshift clustering have 
been removed from 0.9, cluster-syntheticcontrol.sh options 4,5 are not gonna 
work and should have been removed for 0.9.

To PMC,

 - rollback the release, fix this issue (and other patches that were submitted 
in the last few days) and put out another release ?







On Monday, January 20, 2014 12:33 AM, Andrew Palumbo ap@outlook.com wrote:
 
I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM.  Had a bit 
of trouble getting the Hadoop natives to compile and therefore may have run 
into some problems because of the hadoop setup.  Ran into some problems in the 
example scripts.  Particularly with ./cluster-syntheticcontrol.sh -4,5.  I 
will run through the rest of the examples when im sure I've got hadoop setup 
right.


Apache Maven 3.1.2-SNAPSHOT 
Java version: 1.6.0_45, vendor: Sun Microsystems Inc.
Java home: /usr/java/jdk1.6.0_45/jre
OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: 
unix
$MAHOUT_LOCAL=true
Hadoop 2.2.0


a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ]

b) Verify u r able to compile the distro

    mvn compile- [passed with warnings]

    [WARNING]  Expected all dependencies to require Scala version: 2.9.3
    [WARNING]  org.apache.mahout:mahout-math-scala:0.9 requires scala version: 
2.9.3
    [WARNING]  org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2
    [WARNING] Multiple versions of scala libraries detected!

c)  Run through the unit tests: mvn clean test
    mvn clean test [passed]

d) Run the example scripts under $MAHOUT_HOME/examples/bin. 
Please run through all the different options in each script

    Running example scripts with $MAHOUT_LOCAL=true

    ./cluster-syntheticcontrol.sh -1 [works]
    ./cluster-syntheticcontrol.sh -2 [works]
    ./cluster-syntheticcontrol.sh -3 [works]


    ./cluster-syntheticcontrol.sh -4 [exits, throws exception]
    [...]
    WARNING: Unable to add class: 
org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
    java.lang.ClassNotFoundException: 
org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
        at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:171)
        at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
        at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
    Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn


    ./cluster-syntheticcontrol.sh -5 [exits, throws exception]

    WARNING: Unable to add class: 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
    java.lang.ClassNotFoundException: 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
        at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:171)
        at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
        at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
    Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn
    WARNING: No 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on 
classpath, will use command-line arguments only
    Unknown program 
'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen.


    ./classify-20newsgroups.sh -1 [works]
    ./classify-20newsgroups.sh -2 [works]


    cluster-reuters.sh -1 [works]
    cluster-reuters.sh -2 [works]
    cluster-reuters.sh -3 [works]
    
    Same error as noted previosly in the thread:

    cluster-reuters.sh -4 [0 clusters]

    [...]

    WARNING: No qualcluster.props found on classpath, will use command-line 
arguments only
    Num clusters: 0; maxDistance: 0.00
    [Dunn Index] First: Infinity
    [Davies-Bouldin Index] First: NaN
    Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info
    INFO: Program took 669 ms (Minutes: 0.01115)
    
cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train






 Date: Thu, 16 Jan 2014 06:41:09 -0800
 From: suneel_mar...@yahoo.com
 Subject: MAHOUT 0.9 Release - New URL 
 To: u

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Suneel Marthi
This is an issue (trivial one though) that needs to be fixed for 0.9 Release, 
will be rerolling the release today (in the next few hrs) and putting out a new 
release candidate in staging.

Thanks for reporting this Andrew P. 





On Monday, January 20, 2014 12:34 AM, Andrew Palumbo ap@outlook.com wrote:
 
I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM.  Had a bit 
of trouble getting the Hadoop natives to compile and therefore may have run 
into some problems because of the hadoop setup.  Ran into some problems in the 
example scripts.  Particularly with ./cluster-syntheticcontrol.sh -4,5.  I 
will run through the rest of the examples when im sure I've got hadoop setup 
right.


Apache Maven 3.1.2-SNAPSHOT 
Java version: 1.6.0_45, vendor: Sun Microsystems Inc.
Java home: /usr/java/jdk1.6.0_45/jre
OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: 
unix
$MAHOUT_LOCAL=true
Hadoop 2.2.0


a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ]

b) Verify u r able to compile the distro

    mvn compile- [passed with warnings]

    [WARNING]  Expected all dependencies to require Scala version: 2.9.3
    [WARNING]  org.apache.mahout:mahout-math-scala:0.9 requires scala version: 
2.9.3
    [WARNING]  org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2
    [WARNING] Multiple versions of scala libraries detected!

c)  Run through the unit tests: mvn clean test
    mvn clean test [passed]

d) Run the
 example scripts under $MAHOUT_HOME/examples/bin. 
Please run through all the different options in each script

    Running example scripts with $MAHOUT_LOCAL=true

    ./cluster-syntheticcontrol.sh -1 [works]
    ./cluster-syntheticcontrol.sh -2 [works]
    ./cluster-syntheticcontrol.sh -3 [works]


    ./cluster-syntheticcontrol.sh -4 [exits, throws exception]
    [...]
    WARNING: Unable to add class: 
org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
    java.lang.ClassNotFoundException: 
org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
        at
 java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:171)
        at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
        at
 org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
    Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn


    ./cluster-syntheticcontrol.sh -5 [exits, throws exception]

    WARNING: Unable to add class: 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
    java.lang.ClassNotFoundException: 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
        at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:171)
        at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
        at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
    Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn
    WARNING: No 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on 
classpath, will use command-line arguments only
    Unknown program
 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen.


    ./classify-20newsgroups.sh -1 [works]
    ./classify-20newsgroups.sh -2 [works]


    cluster-reuters.sh -1 [works]
    cluster-reuters.sh -2 [works]
    cluster-reuters.sh -3 [works]
    
    Same error as noted previosly in the thread:

    cluster-reuters.sh -4 [0 clusters]

    [...]

    WARNING: No qualcluster.props found on classpath, will use command-line 
arguments only
    Num clusters: 0; maxDistance: 0.00
    [Dunn Index]
 First: Infinity
    [Davies-Bouldin Index] First: NaN
    Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info
    INFO: Program took 669 ms (Minutes: 0.01115)
    
cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train






 Date: Thu, 16 Jan 2014 06:41:09 -0800
 From: suneel_mar...@yahoo.com
 Subject: MAHOUT 0.9 Release - New URL 
 To: u...@mahout.apache.org; dev@mahout.apache.org
 
 Third time's a Charm!!!
 
 
 Here's the new URL for Mahout 0.9

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Andrew Musselman
: suneel_mar...@yahoo.com
  Subject: MAHOUT 0.9 Release - New URL
  To: u...@mahout.apache.org; dev@mahout.apache.org
 
  Third time's a Charm!!!
 
 
  Here's the new URL for Mahout 0.9 Release:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts
  under $MAHOUT_HOME/examples/bin. Please run through all the different
 options in each script.
 
 
  Committers
   and PMC members:
  ---
 
  Need 'at least 3 +1 votes' for the Release to pass.
 
 
  Thanks and Regards.



Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Andrew Musselman
, 2014 at 6:00 AM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 This is an issue (trivial one though) that needs to be fixed for 0.9
 Release, will be rerolling the release today (in the next few hrs) and
 putting out a new release candidate in staging.

 Thanks for reporting this Andrew P.





 On Monday, January 20, 2014 12:34 AM, Andrew Palumbo ap@outlook.com
 wrote:

 I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM.  Had
 a bit of trouble getting the Hadoop natives to compile and therefore may
 have run into some problems because of the hadoop setup.  Ran into some
 problems in the example scripts.  Particularly with
 ./cluster-syntheticcontrol.sh -4,5.  I will run through the rest of the
 examples when im sure I've got hadoop setup right.


 Apache Maven 3.1.2-SNAPSHOT
 Java version: 1.6.0_45, vendor: Sun Microsystems Inc.
 Java home: /usr/java/jdk1.6.0_45/jre
 OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64,
 family: unix
 $MAHOUT_LOCAL=true
 Hadoop 2.2.0


 a) Verify that u can unpack the release (tar or zip) ...passed (tar)
 [passed ]

 b) Verify u r able to compile the distro

 mvn compile- [passed with warnings]

 [WARNING]  Expected all dependencies to require Scala version: 2.9.3
 [WARNING]  org.apache.mahout:mahout-math-scala:0.9 requires scala
 version: 2.9.3
 [WARNING]  org.scalatest:scalatest_2.9.2:1.9.1 requires scala
 version: 2.9.2
 [WARNING] Multiple versions of scala libraries detected!

 c)  Run through the unit tests: mvn clean test
 mvn clean test [passed]

 d) Run the
  example scripts under $MAHOUT_HOME/examples/bin.
 Please run through all the different options in each script

 Running example scripts with $MAHOUT_LOCAL=true

 ./cluster-syntheticcontrol.sh -1 [works]
 ./cluster-syntheticcontrol.sh -2 [works]
 ./cluster-syntheticcontrol.sh -3 [works]


 ./cluster-syntheticcontrol.sh -4 [exits, throws exception]
 [...]
 WARNING: Unable to add class:
 org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
 java.lang.ClassNotFoundException:
 org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
 at
  java.net.URLClassLoader$1.run(URLClassLoader.java:202)
 at java.security.AccessController.doPrivileged(Native Method)
 at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
 at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
 at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
 at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
 at java.lang.Class.forName0(Native Method)
 at java.lang.Class.forName(Class.java:171)
 at
 org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
 at
  org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
 Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn


 ./cluster-syntheticcontrol.sh -5 [exits, throws exception]

 WARNING: Unable to add class:
 org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
 java.lang.ClassNotFoundException:
 org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
 at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
 at java.security.AccessController.doPrivileged(Native Method)
 at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
 at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
 at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
 at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
 at java.lang.Class.forName0(Native Method)
 at java.lang.Class.forName(Class.java:171)
 at
 org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
 at
 org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
 Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn
 WARNING: No
 org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on
 classpath, will use command-line arguments only
 Unknown program
  'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen.


 ./classify-20newsgroups.sh -1 [works]
 ./classify-20newsgroups.sh -2 [works]


 cluster-reuters.sh -1 [works]
 cluster-reuters.sh -2 [works]
 cluster-reuters.sh -3 [works]

 Same error as noted previosly in the thread:

 cluster-reuters.sh -4 [0 clusters]

 [...]

 WARNING: No qualcluster.props found on classpath, will use
 command-line arguments only
 Num clusters: 0; maxDistance: 0.00
 [Dunn Index]
  First: Infinity
 [Davies-Bouldin Index] First: NaN
 Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info
 INFO: Program took 669 ms (Minutes: 0.01115)
 cluster,distance.mean,distance.sd
 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train






  Date: Thu, 16 Jan 2014 06:41:09 -0800
  From: suneel_mar...@yahoo.com
  Subject: MAHOUT 0.9

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
Thanks Grant. 

Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 
0.9. 
Here's my +1 FWIW.

a) Attached is the draft of the Release notes for 0.9, would definitely 
appreciate feedback on that.

b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a 
majority of atleast 3 +1 PMC votes are cast.

The release files, including signatures, digests, etc can be found at:
https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/

The staging repository for this release can be found at:
https://repository.apache.org/content/repositories/orgapachemahout-1002

Release artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc







On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org 
wrote:
 
Ran the tests, verified sigs, tried out a few of the examples.

+1 (binding)


On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote:

 Third time's a Charm!!!
 
 
 Here's the new URL for Mahout 0.9 Release:
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 For those volunteering to test this, some of the things to be verified:
 
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run 
 through all the different options in each script.
      
 
 Committers
 and PMC members:
 ---
 
 Need 'at least 3 +1 votes' for the Release to pass. 
 
 
 Thanks and Regards.

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Frank Scholten
-1

The cluster reuters example results in zero clusters when choosing
streaming k-means. The other steps, unpacking and building do work.

I see this stacktrace:

INFO: Number of Centroids: 0
Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
WARNING: job_local797072544_0001
java.lang.IllegalArgumentException: Must have nonzero number of training
and test vectors. Asked for %.1f %% of %d vectors for test
[10.00149011612, 0]
at
com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
at
org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
at
org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
at
org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
at
org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
at
org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
at
org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)

Num clusters: 0; maxDistance: 0.00
[Dunn Index] First: Infinity
[Davies-Bouldin Index] First: NaN
Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
INFO: Program took 278 ms (Minutes: 0.004633)
cluster,distance.mean,distance.sd
,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train

Here is the full log: http://pastebin.com/TxLV0rDr

As of yet I am unfamiliar with the streaming k-means code and the
algorithms behind it. If anyone has suggestion on what goes wrong in the
code I am I happy to help where I can.

Frank

On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Thanks Grant.

 Not sure if I can vote given my role as the BuildMeister/ReleaseMeister
 for 0.9.
 Here's my +1 FWIW.

 a) Attached is the draft of the Release notes for 0.9, would definitely
 appreciate feedback on that.

 b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a
 majority of atleast 3 +1 PMC votes are cast.

 The release files, including signatures, digests, etc can be found at:

 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/

 The staging repository for this release can be found at:
 https://repository.apache.org/content/repositories/orgapachemahout-1002https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/

 Release artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.aschttps://people.apache.org/keys/committer/pwendell.asc






   On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll 
 gsing...@apache.org wrote:
  Ran the tests, verified sigs, tried out a few of the examples.

 +1 (binding)

 On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

  Third time's a Charm!!!
 
 
  Here's the new URL for Mahout 0.9 Release:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
 
  Committers
  and PMC members:
  ---
 
  Need 'at least 3 +1 votes' for the Release to pass.
 
 
  Thanks and Regards.






Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
Frank,

Were u running this with MAHOUT_LOCAL=true?





On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl 
wrote:
 
-1

The cluster reuters example results in zero clusters when choosing streaming 
k-means. The other steps, unpacking and building do work.

I see this stacktrace:

INFO: Number of Centroids: 0
Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
WARNING: job_local797072544_0001
java.lang.IllegalArgumentException: Must have nonzero number of training and 
test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0]
    at 
com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
    at 
org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
    at 
org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
    at 
org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
    at 
org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
    at 
org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
    at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
    at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
    at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
    at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)

Num clusters: 0; maxDistance: 0.00
[Dunn Index] First: Infinity
[Davies-Bouldin Index] First: NaN
Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
INFO: Program took 278 ms (Minutes: 0.004633)
cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train


Here is the full log: http://pastebin.com/TxLV0rDr

As of  yet I am  unfamiliar with the streaming k-means code and the algorithms 
behind it. If anyone has suggestion on what goes wrong in the code I am I happy 
to help  where I can.


Frank



On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote:

Thanks Grant. 

Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 
0.9. 
Here's my +1 FWIW.

a) Attached is the draft of the Release notes for 0.9, would definitely 
appreciate feedback on that.

b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a 
majority of atleast 3 +1 PMC votes are cast.

The release files, including signatures, digests, etc can be found at:
https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/

The staging repository for this release can be found at:
https://repository.apache.org/content/repositories/orgapachemahout-1002

Release artifacts have been signed with the following key:
https://people.apache.org/keys/committer/smarthi.asc








On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org 
wrote:
 
Ran the tests, verified sigs, tried out a few of the examples.

+1 (binding)


On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote:

 Third time's a Charm!!!
 
 
 Here's the new URL for Mahout 0.9 Release:
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 For those volunteering to test this, some of the things to be verified:
 
 a) Verify that u can
 unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run 
 through all the different options in each script.
      
 
 Committers
 and PMC members:
 ---
 
 Need 'at least 3 +1 votes' for the Release to pass. 
 
 
 Thanks and Regards.





Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Frank Scholten
Exported MAHOUT_LOCAL=true and still get the same results.


On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Frank,

 Were u running this with MAHOUT_LOCAL=true?





 On Sunday, January 19, 2014 10:29 AM, Frank Scholten 
 fr...@frankscholten.nl wrote:

 -1

 The cluster reuters example results in zero clusters when choosing
 streaming k-means. The other steps, unpacking and building do work.

 I see this stacktrace:

 INFO: Number of Centroids: 0
 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
 WARNING: job_local797072544_0001
 java.lang.IllegalArgumentException: Must have nonzero number of training
 and test vectors. Asked for %.1f %% of %d vectors for test
 [10.00149011612, 0]
 at
 com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
 at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
 at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
 at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
 at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
 at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
 at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
 at
 org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
 at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
 at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)

 Num clusters: 0; maxDistance: 0.00
 [Dunn Index] First: Infinity
 [Davies-Bouldin Index] First: NaN
 Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
 INFO: Program took 278 ms (Minutes: 0.004633)
 cluster,distance.mean,distance.sd
 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train


 Here is the full log: http://pastebin.com/TxLV0rDr

 As of  yet I am  unfamiliar with the streaming k-means code and the
 algorithms behind it. If anyone has suggestion on what goes wrong in the
 code I am I happy to help  where I can.


 Frank



 On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

 Thanks Grant.
 
 Not sure if I can vote given my role as the BuildMeister/ReleaseMeister
 for 0.9.
 Here's my +1 FWIW.
 
 a) Attached is the draft of the Release notes for 0.9, would definitely
 appreciate feedback on that.
 
 b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if
 a majority of atleast 3 +1 PMC votes are cast.
 
 The release files, including signatures, digests, etc can be found at:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 The staging repository for this release can be found at:
 https://repository.apache.org/content/repositories/orgapachemahout-1002
 
 Release artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 
 
 
 
 
 
 
 On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll 
 gsing...@apache.org wrote:
 
 Ran the tests, verified sigs, tried out a few of the examples.
 
 +1 (binding)
 
 
 On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:
 
  Third time's a Charm!!!
 
 
  Here's the new URL for Mahout 0.9 Release:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can
  unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
 
  Committers
  and PMC members:
  ---
 
  Need 'at least 3 +1 votes' for the Release to pass.
 
 
  Thanks and Regards.
 
 
 
 



Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
Its presently setup to run in MR mode (the way its been coded in 
cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this.
I am able to see this fail locally when MAHOUT_LOCAL=true.  





On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl 
wrote:
 
Exported MAHOUT_LOCAL=true and still get the same results.



On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Frank,

 Were u running this with MAHOUT_LOCAL=true?





 On Sunday, January 19, 2014 10:29 AM, Frank Scholten 
 fr...@frankscholten.nl wrote:

 -1

 The cluster reuters example results in zero clusters when choosing
 streaming k-means. The other steps, unpacking and building do work.

 I see this stacktrace:

 INFO: Number of Centroids: 0
 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
 WARNING: job_local797072544_0001
 java.lang.IllegalArgumentException: Must have nonzero number of training
 and test vectors. Asked for %.1f %% of %d vectors for test
 [10.00149011612, 0]
     at
 com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
     at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
     at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
     at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
     at
 org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
     at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
     at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)

 Num clusters: 0; maxDistance: 0.00
 [Dunn Index] First: Infinity
 [Davies-Bouldin Index] First: NaN
 Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
 INFO: Program took 278 ms (Minutes: 0.004633)
 cluster,distance.mean,distance.sd
 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train


 Here is the full log: http://pastebin.com/TxLV0rDr

 As of  yet I am  unfamiliar with the streaming k-means code and the
 algorithms behind it. If anyone has suggestion on what goes wrong in the
 code I am I happy to help  where I can.


 Frank



 On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

 Thanks Grant.
 
 Not sure if I can vote given my role as the BuildMeister/ReleaseMeister
 for 0.9.
 Here's my +1 FWIW.
 
 a) Attached is the draft of the Release notes for 0.9, would definitely
 appreciate feedback on that.
 
 b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if
 a majority of atleast 3 +1 PMC votes are cast.
 
 The release files, including signatures, digests, etc can be found at:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 The staging repository for this release can be found at:
 https://repository.apache.org/content/repositories/orgapachemahout-1002
 
 Release artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 
 
 
 
 
 
 
 On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll 
 gsing...@apache.org wrote:
 
 Ran the tests, verified sigs, tried out a few of the examples.
 
 +1 (binding)
 
 
 On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:
 
  Third time's a Charm!!!
 
 
  Here's the new URL for Mahout 0.9 Release:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can
  unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
 
  Committers
  and PMC members:
  ---
 
  Need 'at least 3 +1 votes' for the Release to pass.
 
 
  Thanks and Regards.
 
 
 
 


Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Frank Scholten
OK, running in MR mode now.


On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Its presently setup to run in MR mode (the way its been coded in
 cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this.
 I am able to see this fail locally when MAHOUT_LOCAL=true.





 On Sunday, January 19, 2014 11:17 AM, Frank Scholten 
 fr...@frankscholten.nl wrote:

 Exported MAHOUT_LOCAL=true and still get the same results.



 On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

  Frank,
 
  Were u running this with MAHOUT_LOCAL=true?
 
 
 
 
 
  On Sunday, January 19, 2014 10:29 AM, Frank Scholten 
  fr...@frankscholten.nl wrote:
 
  -1
 
  The cluster reuters example results in zero clusters when choosing
  streaming k-means. The other steps, unpacking and building do work.
 
  I see this stacktrace:
 
  INFO: Number of Centroids: 0
  Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
  WARNING: job_local797072544_0001
  java.lang.IllegalArgumentException: Must have nonzero number of training
  and test vectors. Asked for %.1f %% of %d vectors for test
  [10.00149011612, 0]
  at
 
 com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
  at
 
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
  at
 
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
  at
 
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
  at
 
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
  at
 
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
  at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
  at
  org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
  at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
  at
  org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
 
  Num clusters: 0; maxDistance: 0.00
  [Dunn Index] First: Infinity
  [Davies-Bouldin Index] First: NaN
  Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
  INFO: Program took 278 ms (Minutes: 0.004633)
  cluster,distance.mean,distance.sd
 
 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train
 
 
  Here is the full log: http://pastebin.com/TxLV0rDr
 
  As of  yet I am  unfamiliar with the streaming k-means code and the
  algorithms behind it. If anyone has suggestion on what goes wrong in the
  code I am I happy to help  where I can.
 
 
  Frank
 
 
 
  On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com
 
  wrote:
 
  Thanks Grant.
  
  Not sure if I can vote given my role as the BuildMeister/ReleaseMeister
  for 0.9.
  Here's my +1 FWIW.
  
  a) Attached is the draft of the Release notes for 0.9, would definitely
  appreciate feedback on that.
  
  b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if
  a majority of atleast 3 +1 PMC votes are cast.
  
  The release files, including signatures, digests, etc can be found at:
  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
  
  The staging repository for this release can be found at:
  https://repository.apache.org/content/repositories/orgapachemahout-1002
  
  Release artifacts have been signed with the following key:
  https://people.apache.org/keys/committer/smarthi.asc
  
  
  
  
  
  
  
  
  On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll 
  gsing...@apache.org wrote:
  
  Ran the tests, verified sigs, tried out a few of the examples.
  
  +1 (binding)
  
  
  On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com
  wrote:
  
   Third time's a Charm!!!
  
  
   Here's the new URL for Mahout 0.9 Release:
  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
  
   For those volunteering to test this, some of the things to be
 verified:
  
   a) Verify that u can
   unpack the release (tar or zip)
   b) Verify u r able to compile the distro
   c)  Run through the unit tests: mvn clean test
   d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
  through all the different options in each script.
  
  
   Committers
   and PMC members:
   ---
  
   Need 'at least 3 +1 votes' for the Release to pass.
  
  
   Thanks and Regards.
  
  
  
  
 



Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Frank Scholten
When I run in MR mode I get the same problem.

See http://pastebin.com/TXJ5mQmt


On Sun, Jan 19, 2014 at 5:31 PM, Frank Scholten fr...@frankscholten.nlwrote:

 OK, running in MR mode now.


 On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Its presently setup to run in MR mode (the way its been coded in
 cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this.
 I am able to see this fail locally when MAHOUT_LOCAL=true.





 On Sunday, January 19, 2014 11:17 AM, Frank Scholten 
 fr...@frankscholten.nl wrote:

 Exported MAHOUT_LOCAL=true and still get the same results.



 On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

  Frank,
 
  Were u running this with MAHOUT_LOCAL=true?
 
 
 
 
 
  On Sunday, January 19, 2014 10:29 AM, Frank Scholten 
  fr...@frankscholten.nl wrote:
 
  -1
 
  The cluster reuters example results in zero clusters when choosing
  streaming k-means. The other steps, unpacking and building do work.
 
  I see this stacktrace:
 
  INFO: Number of Centroids: 0
  Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
  WARNING: job_local797072544_0001
  java.lang.IllegalArgumentException: Must have nonzero number of training
  and test vectors. Asked for %.1f %% of %d vectors for test
  [10.00149011612, 0]
  at
 
 com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
  at
 
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
  at
 
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
  at
 
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
  at
 
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
  at
 
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
  at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
  at
  org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
  at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
  at
  org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
 
  Num clusters: 0; maxDistance: 0.00
  [Dunn Index] First: Infinity
  [Davies-Bouldin Index] First: NaN
  Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
  INFO: Program took 278 ms (Minutes: 0.004633)
  cluster,distance.mean,distance.sd
 
 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train
 
 
  Here is the full log: http://pastebin.com/TxLV0rDr
 
  As of  yet I am  unfamiliar with the streaming k-means code and the
  algorithms behind it. If anyone has suggestion on what goes wrong in the
  code I am I happy to help  where I can.
 
 
  Frank
 
 
 
  On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi 
 suneel_mar...@yahoo.com
  wrote:
 
  Thanks Grant.
  
  Not sure if I can vote given my role as the BuildMeister/ReleaseMeister
  for 0.9.
  Here's my +1 FWIW.
  
  a) Attached is the draft of the Release notes for 0.9, would definitely
  appreciate feedback on that.
  
  b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes
 if
  a majority of atleast 3 +1 PMC votes are cast.
  
  The release files, including signatures, digests, etc can be found at:
  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
  
  The staging repository for this release can be found at:
  
 https://repository.apache.org/content/repositories/orgapachemahout-1002
  
  Release artifacts have been signed with the following key:
  https://people.apache.org/keys/committer/smarthi.asc
  
  
  
  
  
  
  
  
  On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll 
  gsing...@apache.org wrote:
  
  Ran the tests, verified sigs, tried out a few of the examples.
  
  +1 (binding)
  
  
  On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com
  wrote:
  
   Third time's a Charm!!!
  
  
   Here's the new URL for Mahout 0.9 Release:
  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
  
   For those volunteering to test this, some of the things to be
 verified:
  
   a) Verify that u can
   unpack the release (tar or zip)
   b) Verify u r able to compile the distro
   c)  Run through the unit tests: mvn clean test
   d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please
 run
  through all the different options in each script.
  
  
   Committers
   and PMC members:
   ---
  
   Need 'at least 3 +1 votes' for the Release to pass.
  
  
   Thanks and Regards.
  
  
  
  
 





Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
It works when both MAHOUT_LOCAL=true and '-xm sequential' option are set.
Guess will have to cut a release again with '-xm sequential' option set. 




On Sunday, January 19, 2014 11:31 AM, Suneel Marthi suneel_mar...@yahoo.com 
wrote:
 
Its presently setup to run in MR mode (the way its been coded in 
cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this.
I am able to see this fail locally when MAHOUT_LOCAL=true.  






On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl 
wrote:

Exported MAHOUT_LOCAL=true and still get the same results.



On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Frank,

 Were u running this with MAHOUT_LOCAL=true?





 On Sunday, January 19, 2014 10:29 AM, Frank Scholten 
 fr...@frankscholten.nl wrote:

 -1

 The cluster reuters example results in zero clusters when choosing
 streaming k-means. The other steps, unpacking and building do work.

 I see this stacktrace:

 INFO: Number of Centroids: 0
 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
 WARNING: job_local797072544_0001
 java.lang.IllegalArgumentException: Must have nonzero number of training
 and test
 vectors. Asked for %.1f %% of %d vectors for test
 [10.00149011612, 0]
     at
 com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
     at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
     at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
     at

 
org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
     at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
     at
 org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
     at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
     at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)

 Num clusters: 0; maxDistance: 0.00
 [Dunn Index] First: Infinity
 [Davies-Bouldin Index] First: NaN
 Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
 INFO: Program took 278 ms (Minutes: 0.004633)

 cluster,distance.mean,distance.sd
 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train


 Here is the full log: http://pastebin.com/TxLV0rDr

 As of  yet I am  unfamiliar with the streaming k-means code and the
 algorithms behind it. If anyone has suggestion on what goes wrong in the
 code I am I happy to help  where I can.


 Frank



 On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

 Thanks Grant.
 
 Not sure if I can vote given my role as the BuildMeister/ReleaseMeister
 for 0.9.
 Here's my +1 FWIW.
 
 a) Attached is the draft of the Release notes for 0.9, would definitely
 appreciate feedback on that.
 
 b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if
 a majority of atleast 3 +1 PMC votes are cast.
 
 The release files, including signatures, digests, etc can be found at:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 The staging repository for this release can be found at:
 https://repository.apache.org/content/repositories/orgapachemahout-1002
 
 Release artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 
 
 
 
 
 
 

 On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll 
 gsing...@apache.org wrote:
 
 Ran the tests, verified sigs, tried out a few of the examples.
 
 +1 (binding)
 
 
 On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:
 
  Third time's a Charm!!!
 
 
  Here's the new URL for Mahout 0.9 Release:

 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can
  unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 

 
  Committers
  and PMC members:
  ---
 
  Need 'at least 3 +1 votes' for the Release to pass.
 
 
  Thanks and Regards.
 
 
 
 


Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
The reason u r seeing the error is because there are were no sequence files in 
HDFS in MR mode to begin with = hence no term vectors generated = and hence 
no vectors to cluster.

MR mode:

1. Set HADOOP_HOME
2. unset MAHOUT_LOCAL
3. clean up ur local /tmp/mahout-work-x directory
4. run ./examples/bin/cluster-reuters.sh = option 4

Sequential Mode:
-

1. set MAHOUT_LOCAL=true
2. Add -xm sequential flag to cluster-reuters.sh script
3. run ./examples/bin/cluster-reuters.sh = option 4








On Sunday, January 19, 2014 12:22 PM, Frank Scholten fr...@frankscholten.nl 
wrote:
 
When I run in MR mode I get the same problem.

See http://pastebin.com/TXJ5mQmt




On Sun, Jan 19, 2014 at 5:31 PM, Frank Scholten fr...@frankscholten.nl wrote:

OK, running in MR mode now.




On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.com wrote:

Its presently setup to run in MR mode (the way its been coded in 
cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this.
I am able to see this fail locally when MAHOUT_LOCAL=true. 






On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl 
wrote:

Exported MAHOUT_LOCAL=true and still get the same results.



On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Frank,

 Were u running this with MAHOUT_LOCAL=true?





 On Sunday, January 19, 2014 10:29 AM, Frank Scholten 
 fr...@frankscholten.nl wrote:

 -1

 The cluster reuters example results in zero clusters when choosing
 streaming k-means. The other steps, unpacking and building do work.

 I see this stacktrace:

 INFO: Number of Centroids: 0
 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run
 WARNING: job_local797072544_0001
 java.lang.IllegalArgumentException: Must have nonzero number of training
 and test vectors. Asked for %.1f %% of %d vectors for test
 [10.00149011612, 0]
     at
 com.google.common.base.Preconditions.checkArgument(Preconditions.java:120)
     at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176)
     at
 org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73)
     at
 org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37)
     at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177)
     at
 org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)
     at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418)
     at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)

 Num clusters: 0; maxDistance: 0.00
 [Dunn Index] First: Infinity
 [Davies-Bouldin Index] First: NaN
 Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info
 INFO: Program took 278 ms (Minutes: 0.004633)
 cluster,distance.mean,distance.sd
 ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train


 Here is the full log: http://pastebin.com/TxLV0rDr

 As of  yet I am  unfamiliar with the streaming k-means code and the
 algorithms behind it. If anyone has suggestion on what goes wrong in the
 code I am I happy to help  where I can.


 Frank



 On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

 Thanks Grant.
 
 Not sure if I can vote given my role as the BuildMeister/ReleaseMeister
 for 0.9.
 Here's my +1 FWIW.
 
 a) Attached is the draft of the Release notes for 0.9, would definitely
 appreciate feedback on that.
 
 b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if
 a majority of atleast 3 +1 PMC votes are cast.
 
 The release files, including signatures, digests, etc can be found at:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 The staging repository for this release can be found at:
 https://repository.apache.org/content/repositories/orgapachemahout-1002
 
 Release artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 
 
 
 
 
 
 
 On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll 
 gsing...@apache.org wrote:
 
 Ran the tests, verified sigs, tried out a few of the examples.
 
 +1 (binding)
 
 
 On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:
 
  Third time's a Charm!!!
 
 
  Here's the new URL for Mahout 0.9 Release:
 
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can
  unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run

Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Dmitriy Lyubimov
I'll try to test out soon


Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Stevo Slavić
+1 (binding)


On Sun, Jan 19, 2014 at 7:49 PM, Dmitriy Lyubimov dlie...@gmail.com wrote:

 I'll try to test out soon



Re: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Suneel Marthi
Stevo,  could u test streaming kmeans?

Sent from my iPhone

 On Jan 19, 2014, at 8:10 PM, Stevo Slavić ssla...@gmail.com wrote:
 
 +1 (binding)
 
 
 On Sun, Jan 19, 2014 at 7:49 PM, Dmitriy Lyubimov dlie...@gmail.com wrote:
 
 I'll try to test out soon
 


RE: MAHOUT 0.9 Release - New URL

2014-01-19 Thread Andrew Palumbo
I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM.  Had a bit 
of trouble getting the Hadoop natives to compile and therefore may have run 
into some problems because of the hadoop setup.  Ran into some problems in the 
example scripts.  Particularly with ./cluster-syntheticcontrol.sh -4,5.  I 
will run through the rest of the examples when im sure I've got hadoop setup 
right.


Apache Maven 3.1.2-SNAPSHOT 
Java version: 1.6.0_45, vendor: Sun Microsystems Inc.
Java home: /usr/java/jdk1.6.0_45/jre
OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: 
unix
$MAHOUT_LOCAL=true
Hadoop 2.2.0


a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ]

b) Verify u r able to compile the distro

mvn compile- [passed with warnings]

[WARNING]  Expected all dependencies to require Scala version: 2.9.3
[WARNING]  org.apache.mahout:mahout-math-scala:0.9 requires scala version: 
2.9.3
[WARNING]  org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2
[WARNING] Multiple versions of scala libraries detected!

c)  Run through the unit tests: mvn clean test
mvn clean test [passed]

d) Run the example scripts under $MAHOUT_HOME/examples/bin. 
Please run through all the different options in each script

Running example scripts with $MAHOUT_LOCAL=true

./cluster-syntheticcontrol.sh -1 [works]
./cluster-syntheticcontrol.sh -2 [works]
./cluster-syntheticcontrol.sh -3 [works]


./cluster-syntheticcontrol.sh -4 [exits, throws exception]
[...]
WARNING: Unable to add class: 
org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
java.lang.ClassNotFoundException: 
org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:171)
at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn


./cluster-syntheticcontrol.sh -5 [exits, throws exception]

WARNING: Unable to add class: 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
java.lang.ClassNotFoundException: 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:171)
at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128)
Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn
WARNING: No 
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on 
classpath, will use command-line arguments only
Unknown program 
'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen.


./classify-20newsgroups.sh -1 [works]
./classify-20newsgroups.sh -2 [works]


cluster-reuters.sh -1 [works]
cluster-reuters.sh -2 [works]
cluster-reuters.sh -3 [works]

Same error as noted previosly in the thread:

cluster-reuters.sh -4 [0 clusters]

[...]

WARNING: No qualcluster.props found on classpath, will use command-line 
arguments only
Num clusters: 0; maxDistance: 0.00
[Dunn Index] First: Infinity
[Davies-Bouldin Index] First: NaN
Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info
INFO: Program took 669 ms (Minutes: 0.01115)

cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train





 Date: Thu, 16 Jan 2014 06:41:09 -0800
 From: suneel_mar...@yahoo.com
 Subject: MAHOUT 0.9 Release - New URL 
 To: u...@mahout.apache.org; dev@mahout.apache.org
 
 Third time's a Charm!!!
 
 
 Here's the new URL for Mahout 0.9 Release:
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 For those volunteering to test this, some of the things to be verified:
 
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through

Re: Mahout 0.9 Release - Call for Volunteers

2014-01-18 Thread Grant Ingersoll
Sorry, I've been traveling.  Looking at it this AM

-Grant

On Jan 18, 2014, at 3:07 AM, Suneel Marthi suneel_mar...@yahoo.com wrote:

 Thanks to Andrew, Sotiris and others for helping test the release.  
 
 Need votes of fellow Mahout Committers and PMC members for the release to be 
 finalized.
 
 
 
 
 
 
 
 
 On Friday, January 17, 2014 5:27 AM, Andrew Psaltis 
 andrew.psal...@webtrends.com wrote:
 
 Suneel,
 I removed all Lucene entries from my local maven repo and indeed the tests 
 that were failing due to that succeed. Now I just get the good ole: Unable to 
 load realm mapping info from SCDynamicStore and the subsequently expected 
 KrbException
 
 Thanks,
 Andrew
 
 
 From: Suneel Marthi suneel_mar...@yahoo.com
 Reply-To: Suneel Marthi suneel_mar...@yahoo.com
 Date: Thursday, January 16, 2014 11:24 AM
 To: dev@mahout.apache.org dev@mahout.apache.org, Andrew Psaltis 
 andrew.psal...@webtrends.com, i...@eprice.gr i...@eprice.gr
 Cc: u...@mahout.apache.org u...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers
 
 
 This is not a maven issue.
 
 Andrew, r u on Mac OS 10.8?  If so u would be seeing these errors. 
 These errors being spewed by Carrot RandomizedRunner and per the 
 conversation in Mahout-1345 this happens on Mac OS X due to an issue in 
 Lucene 4.3.1 and below that was fixed in later Lucene releases.
 
 Do u have an older version of Lucene ( 4.4)  somewhere on ur system, that 
 could be causing this?
 
 Mahout 0.9 Release Lucene version is 4.6.0.
 
 
 
 
 
 
 On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis i...@eprice.gr 
 wrote:
 
 Hi Andrew, 
 
 Could please check if you have the latest maven?
 
 http://maven.apache.org/download.cgi 
 
 Or what version of Maven you are using, I believe this is the issue. 
 
 Regards
 Sotiris
 
 
 -Original Message-
 From: Andrew Psaltis [mailto:andrew.psal...@webtrends.com] 
 Sent: Thursday, January 16, 2014 8:01 PM
 To: dev@mahout.apache.org; i...@eprice.gr
 Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers
 
 Tests for Mahout Core fail on
 
 OS X 10.8.5 (12F45)
 java version 1.7.0_17
 Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM) 64-Bit
 Server VM (build 23.7-b01, mixed mode)
 
 Attached is the output from running: mvn clean test
 
 I have not started to look at the test cases that failed in a debugger, but
 would be glad to.
 
 Thanks,
 Andrew
 
 
 
 
 
 On 1/16/14 9:52 AM, Yexi Jiang yexiji...@gmail.com wrote:
 
 Tested on my mac and a server with ubuntu 12.04 LTS.
 
 All tests passed.
 
 [INFO]
 ---
 -
 
 [INFO] Reactor Summary:
 
 [INFO]
 
 [INFO] Mahout Build Tools  SUCCESS 
 [1.964s]
 
 [INFO] Apache Mahout . SUCCESS 
 [0.400s]
 
 [INFO] Mahout Math ... SUCCESS 
 [1:53.067s]
 
 [INFO] Mahout Core ... SUCCESS 
 [9:09.716s]
 
 [INFO] Mahout Integration  SUCCESS 
 [1:04.662s]
 
 [INFO] Mahout Examples ... SUCCESS 
 [3.331s]
 
 [INFO] Mahout Release Package  SUCCESS 
 [0.000s]
 
 [INFO] Mahout Math/Scala wrappers  SUCCESS 
 [11.356s]
 
 [INFO]
 ---
 -
 
 [INFO] BUILD SUCCESS
 
 [INFO]
 ---
 -
 
 Regards,
 Yexi
 
 2014/1/16 Sotiris Salloumis i...@eprice.gr
 
 From unix you should try the following with wget or curl, make sure 
 during  copy the email client will not wrap it up
 
 
 http://repository.apache.org/content/repositories/orgapachemahout-1002/
 org
 /a
 pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.
 gz
 
 Above link via Google url shortener for easy copy/paste
 http://goo.gl/gX6xGz
 
 
 Regards
 Sotiris
 
 -Original Message-
 From: Yexi Jiang [mailto:yexiji...@gmail.com]
 Sent: Thursday, January 16, 2014 5:59 PM
 To: mahout
 Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers
 
 Got the same error.
 
 Regards,
 Yexi
 
 
 2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com
 
 Hi Suneel,
 
 Still it getting 404 error.
 
 Thanks,
 Chameera
 
 
 On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi 
 suneel_mar...@yahoo.com
 wrote:
 
 Here's the new URL for Mahout 0.9 Release:
 
 
 https://repository.apache.org/content/repositories/orgapachemahout-
 100 1/org/apache/mahout/mahout-buildtools/0.9/
 
 For those volunteering to test this, some of the things to be
 verified:
 
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under

Re: MAHOUT 0.9 Release - New URL

2014-01-18 Thread Grant Ingersoll
Ran the tests, verified sigs, tried out a few of the examples.

+1 (binding)

On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote:

 Third time's a Charm!!!
 
 
 Here's the new URL for Mahout 0.9 Release:
 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/
 
 For those volunteering to test this, some of the things to be verified:
 
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run 
 through all the different options in each script.
  
 
 Committers
 and PMC members:
 ---
 
 Need 'at least 3 +1 votes' for the Release to pass. 
 
 
 Thanks and Regards.




Re: Mahout 0.9 Release - Call for Volunteers

2014-01-17 Thread Andrew Psaltis
Suneel,
I removed all Lucene entries from my local maven repo and indeed the tests that 
were failing due to that succeed. Now I just get the good ole: Unable to load 
realm mapping info from SCDynamicStore and the subsequently expected 
KrbException

Thanks,
Andrew


From: Suneel Marthi suneel_mar...@yahoo.commailto:suneel_mar...@yahoo.com
Reply-To: Suneel Marthi 
suneel_mar...@yahoo.commailto:suneel_mar...@yahoo.com
Date: Thursday, January 16, 2014 11:24 AM
To: dev@mahout.apache.orgmailto:dev@mahout.apache.org 
dev@mahout.apache.orgmailto:dev@mahout.apache.org, Andrew Psaltis 
andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com, 
i...@eprice.grmailto:i...@eprice.gr i...@eprice.grmailto:i...@eprice.gr
Cc: u...@mahout.apache.orgmailto:u...@mahout.apache.org 
u...@mahout.apache.orgmailto:u...@mahout.apache.org
Subject: Re: Mahout 0.9 Release - Call for Volunteers

This is not a maven issue.

Andrew, r u on Mac OS 10.8?  If so u would be seeing these errors.
These errors being spewed by Carrot RandomizedRunner and per the conversation 
in Mahout-1345 this happens on Mac OS X due to an issue in Lucene 4.3.1 and 
below that was fixed in later Lucene releases.

Do u have an older version of Lucene ( 4.4)  somewhere on ur system, that 
could be causing this?

Mahout 0.9 Release Lucene version is 4.6.0.



On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis 
i...@eprice.grmailto:i...@eprice.gr wrote:
Hi Andrew,

Could please check if you have the latest maven?

http://maven.apache.org/download.cgi

Or what version of Maven you are using, I believe this is the issue.

Regards
Sotiris

-Original Message-
From: Andrew Psaltis 
[mailto:andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com]
Sent: Thursday, January 16, 2014 8:01 PM
To: dev@mahout.apache.orgmailto:dev@mahout.apache.org; 
i...@eprice.grmailto:i...@eprice.gr
Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org; 
priv...@mahout.apache.orgmailto:priv...@mahout.apache.org
Subject: Re: Mahout 0.9 Release - Call for Volunteers

Tests for Mahout Core fail on

OS X 10.8.5 (12F45)
java version 1.7.0_17
Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM) 64-Bit
Server VM (build 23.7-b01, mixed mode)

Attached is the output from running: mvn clean test

I have not started to look at the test cases that failed in a debugger, but
would be glad to.

Thanks,
Andrew





On 1/16/14 9:52 AM, Yexi Jiang 
yexiji...@gmail.commailto:yexiji...@gmail.com wrote:

Tested on my mac and a server with ubuntu 12.04 LTS.

All tests passed.

[INFO]
---
-

[INFO] Reactor Summary:

[INFO]

[INFO] Mahout Build Tools  SUCCESS
[1.964s]

[INFO] Apache Mahout . SUCCESS
[0.400s]

[INFO] Mahout Math ... SUCCESS
[1:53.067s]

[INFO] Mahout Core ... SUCCESS
[9:09.716s]

[INFO] Mahout Integration  SUCCESS
[1:04.662s]

[INFO] Mahout Examples ... SUCCESS
[3.331s]

[INFO] Mahout Release Package  SUCCESS
[0.000s]

[INFO] Mahout Math/Scala wrappers  SUCCESS
[11.356s]

[INFO]
---
-

[INFO] BUILD SUCCESS

[INFO]
---
-

Regards,
Yexi

2014/1/16 Sotiris Salloumis i...@eprice.grmailto:i...@eprice.gr

 From unix you should try the following with wget or curl, make sure
during  copy the email client will not wrap it up


http://repository.apache.org/content/repositories/orgapachemahout-1002/
org
/a
 pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.
 gz

 Above link via Google url shortener for easy copy/paste
http://goo.gl/gX6xGz


 Regards
 Sotiris

 -Original Message-
 From: Yexi Jiang [mailto:yexiji...@gmail.commailto:yexiji...@gmail.com]
 Sent: Thursday, January 16, 2014 5:59 PM
 To: mahout
 Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org; 
 priv...@mahout.apache.orgmailto:priv...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers

 Got the same error.

 Regards,
 Yexi


 2014/1/16 Chameera Wijebandara 
 chameerawijeband...@gmail.commailto:chameerawijeband...@gmail.com

  Hi Suneel,
 
  Still it getting 404 error.
 
  Thanks,
 Chameera
 
 
  On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi
  suneel_mar...@yahoo.commailto:suneel_mar...@yahoo.com
  wrote:
 
   Here's the new URL for Mahout 0.9 Release:
  
  
  https://repository.apache.org/content/repositories/orgapachemahout-
  100 1/org/apache/mahout/mahout-buildtools/0.9/
  
   For those volunteering to test this, some of the things to be
verified:
  
   a) Verify that u can unpack the release (tar or zip)
   b) Verify u r able to compile the distro
   c

Re: Mahout 0.9 Release - Call for Volunteers

2014-01-17 Thread Chameera Wijebandara
Hi,

Windows 7 Home Premium
Service Pack 1
64-bit

java version 1.7.0_45
Java(TM) SE Runtime Environment (build 1.7.0_45-b18)
Java HotSpot(TM) 64-Bit Server VM (build 24.45-b08, mixed mode)

I have download the zip file.

https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.zip

Unpack it - success
mvn compile - success
mvn install - build was success some test are failed. I have attached the
result of the test run with this mail.

Is this the way to test and bubit the test results. please update and
correct me if i made something wrong.


Thanks,
Chameera.






On Fri, Jan 17, 2014 at 3:57 PM, Andrew Psaltis 
andrew.psal...@webtrends.com wrote:

 Suneel,
 I removed all Lucene entries from my local maven repo and indeed the tests
 that were failing due to that succeed. Now I just get the good ole: Unable
 to load realm mapping info from SCDynamicStore and the subsequently
 expected KrbException

 Thanks,
 Andrew


 From: Suneel Marthi suneel_mar...@yahoo.commailto:
 suneel_mar...@yahoo.com
 Reply-To: Suneel Marthi suneel_mar...@yahoo.commailto:
 suneel_mar...@yahoo.com
 Date: Thursday, January 16, 2014 11:24 AM
 To: dev@mahout.apache.orgmailto:dev@mahout.apache.org 
 dev@mahout.apache.orgmailto:dev@mahout.apache.org, Andrew Psaltis 
 andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com, 
 i...@eprice.grmailto:i...@eprice.gr i...@eprice.grmailto:
 i...@eprice.gr
 Cc: u...@mahout.apache.orgmailto:u...@mahout.apache.org 
 u...@mahout.apache.orgmailto:u...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers

 This is not a maven issue.

 Andrew, r u on Mac OS 10.8?  If so u would be seeing these errors.
 These errors being spewed by Carrot RandomizedRunner and per the
 conversation in Mahout-1345 this happens on Mac OS X due to an issue in
 Lucene 4.3.1 and below that was fixed in later Lucene releases.

 Do u have an older version of Lucene ( 4.4)  somewhere on ur system, that
 could be causing this?

 Mahout 0.9 Release Lucene version is 4.6.0.



 On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis i...@eprice.gr
 mailto:i...@eprice.gr wrote:
 Hi Andrew,

 Could please check if you have the latest maven?

 http://maven.apache.org/download.cgi

 Or what version of Maven you are using, I believe this is the issue.

 Regards
 Sotiris

 -Original Message-
 From: Andrew Psaltis [mailto:andrew.psal...@webtrends.commailto:
 andrew.psal...@webtrends.com]
 Sent: Thursday, January 16, 2014 8:01 PM
 To: dev@mahout.apache.orgmailto:dev@mahout.apache.org; i...@eprice.gr
 mailto:i...@eprice.gr
 Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org;
 priv...@mahout.apache.orgmailto:priv...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers

 Tests for Mahout Core fail on

 OS X 10.8.5 (12F45)
 java version 1.7.0_17
 Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM)
 64-Bit
 Server VM (build 23.7-b01, mixed mode)

 Attached is the output from running: mvn clean test

 I have not started to look at the test cases that failed in a debugger, but
 would be glad to.

 Thanks,
 Andrew





 On 1/16/14 9:52 AM, Yexi Jiang yexiji...@gmail.commailto:
 yexiji...@gmail.com wrote:

 Tested on my mac and a server with ubuntu 12.04 LTS.
 
 All tests passed.
 
 [INFO]
 ---
 -
 
 [INFO] Reactor Summary:
 
 [INFO]
 
 [INFO] Mahout Build Tools  SUCCESS
 [1.964s]
 
 [INFO] Apache Mahout . SUCCESS
 [0.400s]
 
 [INFO] Mahout Math ... SUCCESS
 [1:53.067s]
 
 [INFO] Mahout Core ... SUCCESS
 [9:09.716s]
 
 [INFO] Mahout Integration  SUCCESS
 [1:04.662s]
 
 [INFO] Mahout Examples ... SUCCESS
 [3.331s]
 
 [INFO] Mahout Release Package  SUCCESS
 [0.000s]
 
 [INFO] Mahout Math/Scala wrappers  SUCCESS
 [11.356s]
 
 [INFO]
 ---
 -
 
 [INFO] BUILD SUCCESS
 
 [INFO]
 ---
 -
 
 Regards,
 Yexi
 
 2014/1/16 Sotiris Salloumis i...@eprice.grmailto:i...@eprice.gr
 
  From unix you should try the following with wget or curl, make sure
 during  copy the email client will not wrap it up
 
 
 http://repository.apache.org/content/repositories/orgapachemahout-1002/
 org
 /a
  pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.
  gz
 
  Above link via Google url shortener for easy copy/paste
 http://goo.gl/gX6xGz
 
 
  Regards
  Sotiris
 
  -Original Message-
  From: Yexi Jiang [mailto:yexiji...@gmail.commailto:yexiji...@gmail.com
 ]
  Sent: Thursday, January 16, 2014 5:59 PM
  To: mahout
  Cc

Re: Mahout 0.9 Release - Call for Volunteers

2014-01-17 Thread Sergey Svinarchuk
I think all tests have error with setting some permission to directory. And
mahout 0.9 not supported Windows.


On Fri, Jan 17, 2014 at 2:04 PM, Chameera Wijebandara 
chameerawijeband...@gmail.com wrote:

 Hi,

 Windows 7 Home Premium
 Service Pack 1
 64-bit

 java version 1.7.0_45
 Java(TM) SE Runtime Environment (build 1.7.0_45-b18)
 Java HotSpot(TM) 64-Bit Server VM (build 24.45-b08, mixed mode)

 I have download the zip file.


 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.zip

 Unpack it - success
 mvn compile - success
 mvn install - build was success some test are failed. I have attached the
 result of the test run with this mail.

 Is this the way to test and bubit the test results. please update and
 correct me if i made something wrong.


 Thanks,
 Chameera.






 On Fri, Jan 17, 2014 at 3:57 PM, Andrew Psaltis 
 andrew.psal...@webtrends.com wrote:

 Suneel,
 I removed all Lucene entries from my local maven repo and indeed the
 tests that were failing due to that succeed. Now I just get the good ole:
 Unable to load realm mapping info from SCDynamicStore and the subsequently
 expected KrbException

 Thanks,
 Andrew


 From: Suneel Marthi suneel_mar...@yahoo.commailto:
 suneel_mar...@yahoo.com
 Reply-To: Suneel Marthi suneel_mar...@yahoo.commailto:
 suneel_mar...@yahoo.com
 Date: Thursday, January 16, 2014 11:24 AM
 To: dev@mahout.apache.orgmailto:dev@mahout.apache.org 
 dev@mahout.apache.orgmailto:dev@mahout.apache.org, Andrew Psaltis 
 andrew.psal...@webtrends.commailto:andrew.psal...@webtrends.com, 
 i...@eprice.grmailto:i...@eprice.gr i...@eprice.grmailto:
 i...@eprice.gr
 Cc: u...@mahout.apache.orgmailto:u...@mahout.apache.org 
 u...@mahout.apache.orgmailto:u...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers

 This is not a maven issue.

 Andrew, r u on Mac OS 10.8?  If so u would be seeing these errors.
 These errors being spewed by Carrot RandomizedRunner and per the
 conversation in Mahout-1345 this happens on Mac OS X due to an issue in
 Lucene 4.3.1 and below that was fixed in later Lucene releases.

 Do u have an older version of Lucene ( 4.4)  somewhere on ur system,
 that could be causing this?

 Mahout 0.9 Release Lucene version is 4.6.0.



 On Thursday, January 16, 2014 1:03 PM, Sotiris Salloumis i...@eprice.gr
 mailto:i...@eprice.gr wrote:
 Hi Andrew,

 Could please check if you have the latest maven?

 http://maven.apache.org/download.cgi

 Or what version of Maven you are using, I believe this is the issue.

 Regards
 Sotiris

 -Original Message-
 From: Andrew Psaltis [mailto:andrew.psal...@webtrends.commailto:
 andrew.psal...@webtrends.com]
 Sent: Thursday, January 16, 2014 8:01 PM
 To: dev@mahout.apache.orgmailto:dev@mahout.apache.org; i...@eprice.gr
 mailto:i...@eprice.gr
 Cc: Suneel Marthi; u...@mahout.apache.orgmailto:u...@mahout.apache.org;
 priv...@mahout.apache.orgmailto:priv...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers

 Tests for Mahout Core fail on

 OS X 10.8.5 (12F45)
 java version 1.7.0_17
 Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM)
 64-Bit
 Server VM (build 23.7-b01, mixed mode)

 Attached is the output from running: mvn clean test

 I have not started to look at the test cases that failed in a debugger,
 but
 would be glad to.

 Thanks,
 Andrew





 On 1/16/14 9:52 AM, Yexi Jiang yexiji...@gmail.commailto:
 yexiji...@gmail.com wrote:

 Tested on my mac and a server with ubuntu 12.04 LTS.
 
 All tests passed.
 
 [INFO]
 ---
 -
 
 [INFO] Reactor Summary:
 
 [INFO]
 
 [INFO] Mahout Build Tools  SUCCESS
 [1.964s]
 
 [INFO] Apache Mahout . SUCCESS
 [0.400s]
 
 [INFO] Mahout Math ... SUCCESS
 [1:53.067s]
 
 [INFO] Mahout Core ... SUCCESS
 [9:09.716s]
 
 [INFO] Mahout Integration  SUCCESS
 [1:04.662s]
 
 [INFO] Mahout Examples ... SUCCESS
 [3.331s]
 
 [INFO] Mahout Release Package  SUCCESS
 [0.000s]
 
 [INFO] Mahout Math/Scala wrappers  SUCCESS
 [11.356s]
 
 [INFO]
 ---
 -
 
 [INFO] BUILD SUCCESS
 
 [INFO]
 ---
 -
 
 Regards,
 Yexi
 
 2014/1/16 Sotiris Salloumis i...@eprice.grmailto:i...@eprice.gr
 
  From unix you should try the following with wget or curl, make sure
 during  copy the email client will not wrap it up
 
 
 http://repository.apache.org/content/repositories/orgapachemahout-1002/
 org
 /a
  pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.
  gz
 
  Above link via Google url shortener for easy

Re: MAHOUT 0.9 Release - New URL

2014-01-17 Thread Ted Dunning
On Thu, Jan 16, 2014 at 7:35 AM, Sotiris Salloumis i...@eprice.gr wrote:

 c)  Run through the unit tests: mvn clean test [ Passed: 370 milliseconds]



?!

Was that seconds?  Or really milliseconds?


RE: MAHOUT 0.9 Release - New URL

2014-01-17 Thread Sotiris Salloumis
Sorry my mistake milliseconds was the last test … below the full results 

 

~/mahout/apache-maven-3.1.1/bin/mvn -DskipTests clean install  

 

[INFO] BUILD SUCCESS

[INFO] 

[INFO] Total time: 52.312s

[INFO] Finished at: Sat Jan 18 02:04:29 CET 2014

[INFO] Final Memory: 46M/305M

[INFO] 

 

~/mahout/apache-maven-3.1.1/bin/mvn clean test

 

[INFO] 

[INFO] Reactor Summary:

[INFO]

[INFO] Mahout Build Tools  SUCCESS [1.166s]

[INFO] Apache Mahout . SUCCESS [0.264s]

[INFO] Mahout Math ... SUCCESS [58.639s]

[INFO] Mahout Core ... SUCCESS [4:01.640s]

[INFO] Mahout Integration  SUCCESS [21.481s]

[INFO] Mahout Examples ... SUCCESS [1.980s]

[INFO] Mahout Release Package  SUCCESS [0.003s]

[INFO] Mahout Math/Scala wrappers  SUCCESS [14.149s]

[INFO] 

[INFO] BUILD SUCCESS

[INFO] 

[INFO] Total time: 5:39.563s

[INFO] Finished at: Sat Jan 18 02:10:53 CET 2014

[INFO] Final Memory: 51M/1068M

[INFO] 

 

From: Ted Dunning [mailto:ted.dunn...@gmail.com] 
Sent: Saturday, January 18, 2014 2:50 AM
To: Mahout Dev List; Sotiris Salloumis
Cc: Suneel Marthi; u...@mahout.apache.org
Subject: Re: MAHOUT 0.9 Release - New URL

 

 

On Thu, Jan 16, 2014 at 7:35 AM, Sotiris Salloumis i...@eprice.gr 
mailto:i...@eprice.gr  wrote:

c)  Run through the unit tests: mvn clean test [ Passed: 370 milliseconds]

 

?!

 

Was that seconds?  Or really milliseconds?



Re: Mahout 0.9 Release Candidate - VOTE

2014-01-16 Thread Suneel Marthi
It would be .tar.gz file and you would find it under mahout/distribution.





On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara 
chameerawijeband...@gmail.com wrote:
 
Ok let's see after fixed the URL

Thank you Tharindu for your cooperation. :)


On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira
tharindurus...@gmail.comwrote:

 On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara 
 chameerawijeband...@gmail.com wrote:

  Tharindu,
 
  There is no .tar file in the given link (befor give the 404 error).
 
 Well, what about a .zip file? As I can remember, it was a .zip file.


  Thanks,
      Chameera
 
 
  On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira
  tharindurus...@gmail.comwrote:
 
   On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara 
   chameerawijeband...@gmail.com wrote:
  
Hi Tharindu,
   
   Chameera, sorry for the late reply. I'm having issues with my personal
   computer these days :)
  
  
Still I could not able to download the artifacts. Could you please
 hep
  me
to test the Release
   
   Have you figured out a way to proceed?
   I think the given URL is down as Suneel mentioned. Once the fixed
 release
   candidate is posted, you can download the source tar and check it as
 any
   other Mahout release.
  
   Regards,
  
   
Thanks
        Chameera
   
   
On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi 
  suneel_mar...@yahoo.com
wrote:
   
 Thanks Tharindu.





 On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira 
 tharindurus...@gmail.com wrote:

 Hi Suneel,
 I tested
 the
 installation process with unit tests and everything went well.
  (Ubuntu
 12.10 32bit, Java 1.7.0_40).
 Please note that I did not clean my local maven repository before
 the
 installation so I assumed maven dependencies are all available
 .


 On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi 
   suneel_mar...@yahoo.com
 wrote:

  Here's the link to Release artifacts for Mahout 0.9:
 
   
  https://repository.apache.org/content/repositories/orgapachemahout-1000/
 
  For those volunteering to test this, some of the stuff to look
 out
   for:
  a)  Verify u can unpack the Release tar.
 
 Verified


  b)  Verify u are able to compile the distribution
 
 Verified

 [INFO]

  
 
 [INFO] Reactor Summary:
 [INFO]
 [INFO] Mahout Build Tools  SUCCESS
[4.380s]
 [INFO] Apache Mahout . SUCCESS
[0.965s]
 [INFO] Mahout Math ... SUCCESS
 [2:07.687s]
 [INFO] Mahout Core ... SUCCESS
 [10:34.651s]
 [INFO] Mahout Integration  SUCCESS
 [1:03.250s]
 [INFO] Mahout Examples ... SUCCESS
 [16.607s]
 [INFO] Mahout Release Package  SUCCESS
[0.469s]
 [INFO] Mahout Math/Scala wrappers  SUCCESS
 [35.562s]
 [INFO]

  
 
 [INFO] BUILD SUCCESS
 [INFO]

  
 
 [INFO] Total time: 14:44.158s
 [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014
 [INFO] Final Memory: 41M/252M
 [INFO]

  
 

 c) Run through the unit tests: mvn clean test
 
 Verified.

 
  d)  Run the example scripts under $MAHOUT_HOME/examples/bin.
 

 I'm yet to test the example scripts and I will give an update soon.

 Regards,



 
  See
http://incubator.apache.org/guides/releasemanagement.html#check-list
  for more details.
 
 
 
  On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com 
 spa...@gmail.com
  wrote:
 
  I want to volunteer to test this release. What is the
  procedure/steps
to
  get started and what pre-reqs I need to have?
 
  Cheers
  .S
 
 
 
  On Tue, Jan 14, 2014 at 6:52 PM, Suneel Marthi 
suneel_mar...@yahoo.com
  wrote:
 
   Calling for volunteers to test this Release.
  
  
  
  
   On Friday, January 10, 2014 7:39 PM, Suneel Marthi 
   suneel_mar...@yahoo.com wrote:
  
   Pushed the Mahout 0.9 Release candidate.
   See
  

  
 https://repository.apache.org/content/repositories/orgapachemahout-1000/
  
   This is a call for Vote.
  
 
 
 
  --
  http://spawgi.wordpress.com
  We can do it and do it better.
 



 --
 M.P. Tharindu Rusira Kumara

 Department of Computer

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-16 Thread Chameera Wijebandara
Thanks Suneel.


On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 It would be .tar.gz file and you would find it under mahout/distribution.





 On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara 
 chameerawijeband...@gmail.com wrote:

 Ok let's see after fixed the URL

 Thank you Tharindu for your cooperation. :)


 On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira
 tharindurus...@gmail.comwrote:

  On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara 
  chameerawijeband...@gmail.com wrote:
 
   Tharindu,
  
   There is no .tar file in the given link (befor give the 404 error).
  
  Well, what about a .zip file? As I can remember, it was a .zip file.
 
 
   Thanks,
   Chameera
  
  
   On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira
   tharindurus...@gmail.comwrote:
  
On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara 
chameerawijeband...@gmail.com wrote:
   
 Hi Tharindu,

Chameera, sorry for the late reply. I'm having issues with my
 personal
computer these days :)
   
   
 Still I could not able to download the artifacts. Could you please
  hep
   me
 to test the Release

Have you figured out a way to proceed?
I think the given URL is down as Suneel mentioned. Once the fixed
  release
candidate is posted, you can download the source tar and check it as
  any
other Mahout release.
   
Regards,
   

 Thanks
 Chameera


 On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi 
   suneel_mar...@yahoo.com
 wrote:

  Thanks Tharindu.
 
 
 
 
 
  On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira 
  tharindurus...@gmail.com wrote:
 
  Hi Suneel,
  I tested
  the
  installation process with unit tests and everything went well.
   (Ubuntu
  12.10 32bit, Java 1.7.0_40).
  Please note that I did not clean my local maven repository before
  the
  installation so I assumed maven dependencies are all available
  .
 
 
  On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi 
suneel_mar...@yahoo.com
  wrote:
 
   Here's the link to Release artifacts for Mahout 0.9:
  

  
 https://repository.apache.org/content/repositories/orgapachemahout-1000/
  
   For those volunteering to test this, some of the stuff to look
  out
for:
   a)  Verify u can unpack the Release tar.
  
  Verified
 
 
   b)  Verify u are able to compile the distribution
  
  Verified
 
  [INFO]
 
   
  
  [INFO] Reactor Summary:
  [INFO]
  [INFO] Mahout Build Tools 
 SUCCESS
 [4.380s]
  [INFO] Apache Mahout .
 SUCCESS
 [0.965s]
  [INFO] Mahout Math ...
 SUCCESS
  [2:07.687s]
  [INFO] Mahout Core ...
 SUCCESS
  [10:34.651s]
  [INFO] Mahout Integration 
 SUCCESS
  [1:03.250s]
  [INFO] Mahout Examples ...
 SUCCESS
  [16.607s]
  [INFO] Mahout Release Package 
 SUCCESS
 [0.469s]
  [INFO] Mahout Math/Scala wrappers 
 SUCCESS
  [35.562s]
  [INFO]
 
   
  
  [INFO] BUILD SUCCESS
  [INFO]
 
   
  
  [INFO] Total time: 14:44.158s
  [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014
  [INFO] Final Memory: 41M/252M
  [INFO]
 
   
  
 
  c) Run through the unit tests: mvn clean test
  
  Verified.
 
  
   d)  Run the example scripts under $MAHOUT_HOME/examples/bin.
  
 
  I'm yet to test the example scripts and I will give an update
 soon.
 
  Regards,
 
 
 
  
   See

 http://incubator.apache.org/guides/releasemanagement.html#check-list
   for more details.
  
  
  
   On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com 
  spa...@gmail.com
   wrote:
  
   I want to volunteer to test this release. What is the
   procedure/steps
 to
   get started and what pre-reqs I need to have?
  
   Cheers
   .S
  
  
  
   On Tue, Jan 14, 2014 at 6:52 PM, Suneel Marthi 
 suneel_mar...@yahoo.com
   wrote:
  
Calling for volunteers to test this Release.
   
   
   
   
On Friday, January 10, 2014 7:39 PM, Suneel Marthi 
suneel_mar...@yahoo.com wrote:
   
Pushed the Mahout 0.9 Release candidate.
See
   
 
   
  https

Mahout 0.9 Release - Call for Volunteers

2014-01-16 Thread Suneel Marthi
Here's the new URL for Mahout 0.9 Release:
https://repository.apache.org/content/repositories/orgapachemahout-1001/org/apache/mahout/mahout-buildtools/0.9/

For those volunteering to test this, some of the things to be verified:

a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through 
all the different options in each script.
 


Committers and PMC members:
---

Need atleast 3 +1 votes from this group for the Release to pass. 


Thanks and Regards.

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-16 Thread spawgi
The link -
https://repository.apache.org/content/repositories/orgapachemahout-1000/ still
returns a 404.
Is the latest release candidate made available for 0.9?


On Thu, Jan 16, 2014 at 7:06 PM, Chameera Wijebandara 
chameerawijeband...@gmail.com wrote:

 Thanks Suneel.


 On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

  It would be .tar.gz file and you would find it under mahout/distribution.
 
 
 
 
 
  On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara 
  chameerawijeband...@gmail.com wrote:
 
  Ok let's see after fixed the URL
 
  Thank you Tharindu for your cooperation. :)
 
 
  On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira
  tharindurus...@gmail.comwrote:
 
   On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara 
   chameerawijeband...@gmail.com wrote:
  
Tharindu,
   
There is no .tar file in the given link (befor give the 404 error).
   
   Well, what about a .zip file? As I can remember, it was a .zip file.
  
  
Thanks,
Chameera
   
   
On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira
tharindurus...@gmail.comwrote:
   
 On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara 
 chameerawijeband...@gmail.com wrote:

  Hi Tharindu,
 
 Chameera, sorry for the late reply. I'm having issues with my
  personal
 computer these days :)


  Still I could not able to download the artifacts. Could you
 please
   hep
me
  to test the Release
 
 Have you figured out a way to proceed?
 I think the given URL is down as Suneel mentioned. Once the fixed
   release
 candidate is posted, you can download the source tar and check it
 as
   any
 other Mahout release.

 Regards,

 
  Thanks
  Chameera
 
 
  On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi 
suneel_mar...@yahoo.com
  wrote:
 
   Thanks Tharindu.
  
  
  
  
  
   On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira 
   tharindurus...@gmail.com wrote:
  
   Hi Suneel,
   I tested
   the
   installation process with unit tests and everything went well.
(Ubuntu
   12.10 32bit, Java 1.7.0_40).
   Please note that I did not clean my local maven repository
 before
   the
   installation so I assumed maven dependencies are all available
   .
  
  
   On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi 
 suneel_mar...@yahoo.com
   wrote:
  
Here's the link to Release artifacts for Mahout 0.9:
   
 
   
  https://repository.apache.org/content/repositories/orgapachemahout-1000/
   
For those volunteering to test this, some of the stuff to
 look
   out
 for:
a)  Verify u can unpack the Release tar.
   
   Verified
  
  
b)  Verify u are able to compile the distribution
   
   Verified
  
   [INFO]
  

  
 
   [INFO] Reactor Summary:
   [INFO]
   [INFO] Mahout Build Tools 
  SUCCESS
  [4.380s]
   [INFO] Apache Mahout .
  SUCCESS
  [0.965s]
   [INFO] Mahout Math ...
  SUCCESS
   [2:07.687s]
   [INFO] Mahout Core ...
  SUCCESS
   [10:34.651s]
   [INFO] Mahout Integration 
  SUCCESS
   [1:03.250s]
   [INFO] Mahout Examples ...
  SUCCESS
   [16.607s]
   [INFO] Mahout Release Package 
  SUCCESS
  [0.469s]
   [INFO] Mahout Math/Scala wrappers 
  SUCCESS
   [35.562s]
   [INFO]
  

  
 
   [INFO] BUILD SUCCESS
   [INFO]
  

  
 
   [INFO] Total time: 14:44.158s
   [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014
   [INFO] Final Memory: 41M/252M
   [INFO]
  

  
 
  
   c) Run through the unit tests: mvn clean test
   
   Verified.
  
   
d)  Run the example scripts under $MAHOUT_HOME/examples/bin.
   
  
   I'm yet to test the example scripts and I will give an update
  soon.
  
   Regards,
  
  
  
   
See
 
  http://incubator.apache.org/guides/releasemanagement.html#check-list
for more details.
   
   
   
On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com 
   spa...@gmail.com
wrote:
   
I want to volunteer to test this release. What is the
procedure/steps
  to
get started and what pre-reqs I need

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-16 Thread spawgi
Sorry, sent little too early :). Got email from Suneel.


On Thu, Jan 16, 2014 at 7:16 PM, spa...@gmail.com wrote:

 The link -
 https://repository.apache.org/content/repositories/orgapachemahout-1000/ still
 returns a 404.
 Is the latest release candidate made available for 0.9?


 On Thu, Jan 16, 2014 at 7:06 PM, Chameera Wijebandara 
 chameerawijeband...@gmail.com wrote:

 Thanks Suneel.


 On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

  It would be .tar.gz file and you would find it under
 mahout/distribution.
 
 
 
 
 
  On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara 
  chameerawijeband...@gmail.com wrote:
 
  Ok let's see after fixed the URL
 
  Thank you Tharindu for your cooperation. :)
 
 
  On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira
  tharindurus...@gmail.comwrote:
 
   On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara 
   chameerawijeband...@gmail.com wrote:
  
Tharindu,
   
There is no .tar file in the given link (befor give the 404 error).
   
   Well, what about a .zip file? As I can remember, it was a .zip file.
  
  
Thanks,
Chameera
   
   
On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira
tharindurus...@gmail.comwrote:
   
 On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara 
 chameerawijeband...@gmail.com wrote:

  Hi Tharindu,
 
 Chameera, sorry for the late reply. I'm having issues with my
  personal
 computer these days :)


  Still I could not able to download the artifacts. Could you
 please
   hep
me
  to test the Release
 
 Have you figured out a way to proceed?
 I think the given URL is down as Suneel mentioned. Once the fixed
   release
 candidate is posted, you can download the source tar and check it
 as
   any
 other Mahout release.

 Regards,

 
  Thanks
  Chameera
 
 
  On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi 
suneel_mar...@yahoo.com
  wrote:
 
   Thanks Tharindu.
  
  
  
  
  
   On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira 
   tharindurus...@gmail.com wrote:
  
   Hi Suneel,
   I tested
   the
   installation process with unit tests and everything went well.
(Ubuntu
   12.10 32bit, Java 1.7.0_40).
   Please note that I did not clean my local maven repository
 before
   the
   installation so I assumed maven dependencies are all available
   .
  
  
   On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi 
 suneel_mar...@yahoo.com
   wrote:
  
Here's the link to Release artifacts for Mahout 0.9:
   
 
   
 
 https://repository.apache.org/content/repositories/orgapachemahout-1000/
   
For those volunteering to test this, some of the stuff to
 look
   out
 for:
a)  Verify u can unpack the Release tar.
   
   Verified
  
  
b)  Verify u are able to compile the distribution
   
   Verified
  
   [INFO]
  

  
 
   [INFO] Reactor Summary:
   [INFO]
   [INFO] Mahout Build Tools 
  SUCCESS
  [4.380s]
   [INFO] Apache Mahout .
  SUCCESS
  [0.965s]
   [INFO] Mahout Math ...
  SUCCESS
   [2:07.687s]
   [INFO] Mahout Core ...
  SUCCESS
   [10:34.651s]
   [INFO] Mahout Integration 
  SUCCESS
   [1:03.250s]
   [INFO] Mahout Examples ...
  SUCCESS
   [16.607s]
   [INFO] Mahout Release Package 
  SUCCESS
  [0.469s]
   [INFO] Mahout Math/Scala wrappers 
  SUCCESS
   [35.562s]
   [INFO]
  

  
 
   [INFO] BUILD SUCCESS
   [INFO]
  

  
 
   [INFO] Total time: 14:44.158s
   [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014
   [INFO] Final Memory: 41M/252M
   [INFO]
  

  
 
  
   c) Run through the unit tests: mvn clean test
   
   Verified.
  
   
d)  Run the example scripts under $MAHOUT_HOME/examples/bin.
   
  
   I'm yet to test the example scripts and I will give an update
  soon.
  
   Regards,
  
  
  
   
See
 
  http://incubator.apache.org/guides/releasemanagement.html#check-list
for more details.
   
   
   
On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com 
   spa...@gmail.com
wrote:
   
I

Re: Mahout 0.9 Release Candidate - VOTE

2014-01-16 Thread Suneel Marthi
Please hold off on this, screwed up the future development version#. Have to 
redo this again.
Sorry about that. 





On Thursday, January 16, 2014 8:47 AM, spa...@gmail.com spa...@gmail.com 
wrote:
 
Sorry, sent little too early :). Got email from Suneel.



On Thu, Jan 16, 2014 at 7:16 PM, spa...@gmail.com wrote:

The link - 
https://repository.apache.org/content/repositories/orgapachemahout-1000/ still 
returns a 404.
Is the latest release candidate made available for 0.9?



On Thu, Jan 16, 2014 at 7:06 PM, Chameera Wijebandara 
chameerawijeband...@gmail.com wrote:

Thanks Suneel.


On Thu, Jan 16, 2014 at 7:04 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:


 It would be .tar.gz file and you would find it under mahout/distribution.





 On Wednesday, January 15, 2014 11:45 PM, Chameera Wijebandara 
 chameerawijeband...@gmail.com wrote:

 Ok let's see after fixed the URL

 Thank you Tharindu for your cooperation. :)


 On Thu, Jan 16, 2014 at 9:55 AM, Tharindu Rusira
 tharindurus...@gmail.comwrote:

  On Thu, Jan 16, 2014 at 9:31 AM, Chameera Wijebandara 
  chameerawijeband...@gmail.com wrote:
 
   Tharindu,
  
   There is no .tar file in the given link (befor give the 404 error).
  
  Well, what about a .zip file? As I can remember, it was a .zip file.
 
 
   Thanks,
       Chameera
  
  
   On Thu, Jan 16, 2014 at 9:27 AM, Tharindu Rusira
   tharindurus...@gmail.comwrote:
  
On Wed, Jan 15, 2014 at 6:48 PM, Chameera Wijebandara 
chameerawijeband...@gmail.com wrote:
   
 Hi Tharindu,

Chameera, sorry for the late reply. I'm having issues with my
 personal
computer these days :)
   
   
 Still I could not able to download the artifacts. Could you please
  hep
   me
 to test the Release

Have you figured out a way to proceed?
I think the given URL is down as Suneel mentioned. Once the fixed
  release
candidate is posted, you can download the source tar and check it as
  any
other Mahout release.
   
Regards,
   

 Thanks
     Chameera


 On Wed, Jan 15, 2014 at 12:21 PM, Suneel Marthi 
   suneel_mar...@yahoo.com
 wrote:

  Thanks Tharindu.
 
 
 
 
 
  On Tuesday, January 14, 2014 11:30 PM, Tharindu Rusira 
  tharindurus...@gmail.com wrote:
 
  Hi Suneel,
  I tested
  the
  installation process with unit tests and everything went well.
   (Ubuntu
  12.10 32bit, Java 1.7.0_40).
  Please note that I did not clean my local maven repository before
  the
  installation so I assumed maven dependencies are all available
  .
 
 
  On Tue, Jan 14, 2014 at 7:03 PM, Suneel Marthi 
suneel_mar...@yahoo.com
  wrote:
 
   Here's the link to Release artifacts for Mahout 0.9:
  

  
 https://repository.apache.org/content/repositories/orgapachemahout-1000/
  
   For those volunteering to test this, some of the stuff to look
  out
for:
   a)  Verify u can unpack the Release tar.
  
  Verified
 
 
   b)  Verify u are able to compile the distribution
  
  Verified
 
  [INFO]
 
   
  
  [INFO] Reactor Summary:
  [INFO]
  [INFO] Mahout Build Tools 
 SUCCESS
 [4.380s]
  [INFO] Apache Mahout .
 SUCCESS
 [0.965s]
  [INFO] Mahout Math ...
 SUCCESS
  [2:07.687s]
  [INFO] Mahout Core ...
 SUCCESS
  [10:34.651s]
  [INFO] Mahout Integration 
 SUCCESS
  [1:03.250s]
  [INFO] Mahout Examples ...
 SUCCESS
  [16.607s]
  [INFO] Mahout Release Package 
 SUCCESS
 [0.469s]
  [INFO] Mahout Math/Scala wrappers 
 SUCCESS
  [35.562s]
  [INFO]
 
   
  
  [INFO] BUILD SUCCESS
  [INFO]
 
   
  
  [INFO] Total time: 14:44.158s
  [INFO] Finished at: Wed Jan 15 09:06:26 IST 2014
  [INFO] Final Memory: 41M/252M
  [INFO]
 
   
  
 
  c) Run through the unit tests: mvn clean test
  
  Verified.
 
  
   d)  Run the example scripts under $MAHOUT_HOME/examples/bin.
  
 
  I'm yet to test the example scripts and I will give an update
 soon.
 
  Regards,
 
 
 
  
   See

 http://incubator.apache.org/guides/releasemanagement.html#check-list
   for more details.
  
  
  
   On Tuesday, January 14, 2014 8:26 AM, spa...@gmail.com 
  spa...@gmail.com
   wrote:
  
   I

Re: Mahout 0.9 Release - Call for Volunteers

2014-01-16 Thread Chameera Wijebandara
Hi Suneel,

Still it getting 404 error.

Thanks,
Chameera


On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Here's the new URL for Mahout 0.9 Release:

 https://repository.apache.org/content/repositories/orgapachemahout-1001/org/apache/mahout/mahout-buildtools/0.9/

 For those volunteering to test this, some of the things to be verified:

 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.



 Committers and PMC members:
 ---

 Need atleast 3 +1 votes from this group for the Release to pass.


 Thanks and Regards.




-- 
Thanks,
Chameera


MAHOUT 0.9 Release - New URL

2014-01-16 Thread Suneel Marthi
Third time's a Charm!!!


Here's the new URL for Mahout 0.9 Release:
https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/

For those volunteering to test this, some of the things to be verified:

a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through 
all the different options in each script.
 

Committers
 and PMC members:
---

Need 'at least 3 +1 votes' for the Release to pass. 


Thanks and Regards.


RE: MAHOUT 0.9 Release - New URL

2014-01-16 Thread Sotiris Salloumis
Hi Suneel, 

Below first round of tests, 

Environment: SMP Debian 3.2.51-1 x86_64
Machine: Intel(R) Core(TM) i7 CPU 950  @ 3.07GHz stepping 05 12GB
RAM
OpenJDK: javac 1.6.0_27

a) Verify that u can unpack the release (tar or zip)  [ Passed: tar -zxvf ]
b) Verify u r able to compile the distro  [ Passed: With OpenJDK, Latest
Maven on LatestDebian ]
c)  Run through the unit tests: mvn clean test [ Passed: 370 milliseconds]

d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
through all the different options in each script. [Ongoing will update
later]

Regards
Sotiris

-Original Message-
From: Suneel Marthi [mailto:suneel_mar...@yahoo.com] 
Sent: Thursday, January 16, 2014 4:41 PM
To: u...@mahout.apache.org; mahout
Subject: MAHOUT 0.9 Release - New URL 

Third time's a Charm!!!


Here's the new URL for Mahout 0.9 Release:
https://repository.apache.org/content/repositories/orgapachemahout-1002/org/
apache/mahout/mahout-distribution/0.9/

For those volunteering to test this, some of the things to be verified:

a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
through all the different options in each script.
 

Committers
 and PMC members:
---

Need 'at least 3 +1 votes' for the Release to pass. 


Thanks and Regards.



Re: MAHOUT 0.9 Release - New URL

2014-01-16 Thread Shannon Quinn
a), b), and c) all pass for me. Don't have the setup yet at work to go 
through d), will wait for others to verify.


On 1/16/14, 9:41 AM, Suneel Marthi wrote:

Third time's a Charm!!!


Here's the new URL for Mahout 0.9 Release:
https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/

For those volunteering to test this, some of the things to be verified:

a) Verify that u can unpack the release (tar or zip)
b) Verify u r able to compile the distro
c)  Run through the unit tests: mvn clean test
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through 
all the different options in each script.
  


Committers
  and PMC members:
---

Need 'at least 3 +1 votes' for the Release to pass.


Thanks and Regards.





Re: MAHOUT 0.9 Release - New URL

2014-01-16 Thread Sergey Svinarchuk
I tested mahout 0.9 on Ubuntu 12.04 64bit, java version 1.6.0_27

a) Verify that u can unpack the release (tar or zip) - passed
b) Verify u r able to compile the distro - passed
c)  Run through the unit tests: mvn clean test -passed
d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
through all the different options in each script. - will update later


On Thu, Jan 16, 2014 at 5:35 PM, Sotiris Salloumis i...@eprice.gr wrote:

 Hi Suneel,

 Below first round of tests,

 Environment: SMP Debian 3.2.51-1 x86_64
 Machine: Intel(R) Core(TM) i7 CPU 950  @ 3.07GHz stepping 05 12GB
 RAM
 OpenJDK: javac 1.6.0_27

 a) Verify that u can unpack the release (tar or zip)  [ Passed: tar -zxvf ]
 b) Verify u r able to compile the distro  [ Passed: With OpenJDK, Latest
 Maven on LatestDebian ]
 c)  Run through the unit tests: mvn clean test [ Passed: 370 milliseconds]

 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script. [Ongoing will update
 later]

 Regards
 Sotiris

 -Original Message-
 From: Suneel Marthi [mailto:suneel_mar...@yahoo.com]
 Sent: Thursday, January 16, 2014 4:41 PM
 To: u...@mahout.apache.org; mahout
 Subject: MAHOUT 0.9 Release - New URL

 Third time's a Charm!!!


 Here's the new URL for Mahout 0.9 Release:

 https://repository.apache.org/content/repositories/orgapachemahout-1002/org/
 apache/mahout/mahout-distribution/0.9/

 For those volunteering to test this, some of the things to be verified:

 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.


 Committers
  and PMC members:
 ---

 Need 'at least 3 +1 votes' for the Release to pass.


 Thanks and Regards.



-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.


Re: Mahout 0.9 Release - Call for Volunteers

2014-01-16 Thread Yexi Jiang
Got the same error.

Regards,
Yexi


2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com

 Hi Suneel,

 Still it getting 404 error.

 Thanks,
 Chameera


 On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi suneel_mar...@yahoo.com
 wrote:

  Here's the new URL for Mahout 0.9 Release:
 
 
 https://repository.apache.org/content/repositories/orgapachemahout-1001/org/apache/mahout/mahout-buildtools/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
  through all the different options in each script.
 
 
 
  Committers and PMC members:
  ---
 
  Need atleast 3 +1 votes from this group for the Release to pass.
 
 
  Thanks and Regards.




 --
 Thanks,
 Chameera




-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


RE: Mahout 0.9 Release - Call for Volunteers

2014-01-16 Thread Sotiris Salloumis
From unix you should try the following with wget or curl, make sure during
copy the email client will not wrap it up

http://repository.apache.org/content/repositories/orgapachemahout-1002/org/a
pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.gz 

Above link via Google url shortener for easy copy/paste http://goo.gl/gX6xGz


Regards
Sotiris

-Original Message-
From: Yexi Jiang [mailto:yexiji...@gmail.com] 
Sent: Thursday, January 16, 2014 5:59 PM
To: mahout
Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org
Subject: Re: Mahout 0.9 Release - Call for Volunteers

Got the same error.

Regards,
Yexi


2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com

 Hi Suneel,

 Still it getting 404 error.

 Thanks,
 Chameera


 On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi 
 suneel_mar...@yahoo.com
 wrote:

  Here's the new URL for Mahout 0.9 Release:
 
 
 https://repository.apache.org/content/repositories/orgapachemahout-100
 1/org/apache/mahout/mahout-buildtools/0.9/
 
  For those volunteering to test this, some of the things to be verified:
 
  a) Verify that u can unpack the release (tar or zip)
  b) Verify u r able to compile the distro
  c)  Run through the unit tests: mvn clean test
  d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please 
  run through all the different options in each script.
 
 
 
  Committers and PMC members:
  ---
 
  Need atleast 3 +1 votes from this group for the Release to pass.
 
 
  Thanks and Regards.




 --
 Thanks,
 Chameera




--
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science, Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/



Re: Mahout 0.9 Release - Call for Volunteers

2014-01-16 Thread Yexi Jiang
Tested on my mac and a server with ubuntu 12.04 LTS.

All tests passed.

[INFO]


[INFO] Reactor Summary:

[INFO]

[INFO] Mahout Build Tools  SUCCESS [1.964s]

[INFO] Apache Mahout . SUCCESS [0.400s]

[INFO] Mahout Math ... SUCCESS
[1:53.067s]

[INFO] Mahout Core ... SUCCESS
[9:09.716s]

[INFO] Mahout Integration  SUCCESS
[1:04.662s]

[INFO] Mahout Examples ... SUCCESS [3.331s]

[INFO] Mahout Release Package  SUCCESS [0.000s]

[INFO] Mahout Math/Scala wrappers  SUCCESS [11.356s]

[INFO]


[INFO] BUILD SUCCESS

[INFO]


Regards,
Yexi

2014/1/16 Sotiris Salloumis i...@eprice.gr

 From unix you should try the following with wget or curl, make sure during
 copy the email client will not wrap it up


http://repository.apache.org/content/repositories/orgapachemahout-1002/org/a
 pache/mahout/mahout-distribution/0.9/mahout-distribution-0.9-src.tar.gz

 Above link via Google url shortener for easy copy/paste
http://goo.gl/gX6xGz


 Regards
 Sotiris

 -Original Message-
 From: Yexi Jiang [mailto:yexiji...@gmail.com]
 Sent: Thursday, January 16, 2014 5:59 PM
 To: mahout
 Cc: Suneel Marthi; u...@mahout.apache.org; priv...@mahout.apache.org
 Subject: Re: Mahout 0.9 Release - Call for Volunteers

 Got the same error.

 Regards,
 Yexi


 2014/1/16 Chameera Wijebandara chameerawijeband...@gmail.com

  Hi Suneel,
 
  Still it getting 404 error.
 
  Thanks,
  Chameera
 
 
  On Thu, Jan 16, 2014 at 7:11 PM, Suneel Marthi
  suneel_mar...@yahoo.com
  wrote:
 
   Here's the new URL for Mahout 0.9 Release:
  
  
  https://repository.apache.org/content/repositories/orgapachemahout-100
  1/org/apache/mahout/mahout-buildtools/0.9/
  
   For those volunteering to test this, some of the things to be
verified:
  
   a) Verify that u can unpack the release (tar or zip)
   b) Verify u r able to compile the distro
   c)  Run through the unit tests: mvn clean test
   d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please
   run through all the different options in each script.
  
  
  
   Committers and PMC members:
   ---
  
   Need atleast 3 +1 votes from this group for the Release to pass.
  
  
   Thanks and Regards.
 
 
 
 
  --
  Thanks,
  Chameera
 



 --
 --
 Yexi Jiang,
 ECS 251,  yjian...@cs.fiu.edu
 School of Computer and Information Science, Florida International
University
 Homepage: http://users.cis.fiu.edu/~yjian004/




--
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


  1   2   >