Re: MAHOUT 0.9 Release - New URL
I'll be rolling back the 0.9 Release today that's presently in staging in light of the issues that have been reported in the last 2 days and need to be fixed as part of the Release. Please hold off from committing any new code to trunk meanwhile. Thanks. On Friday, January 24, 2014 7:36 PM, Ted Dunning ted.dunn...@gmail.com wrote: My schedule has opened up a bit and I can review as well. On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Rolled back trunk to 0.9-SNAPSHOT, please go ahead and commit any changes. On Saturday, January 25, 2014 4:19 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: I'll be rolling back the 0.9 Release today that's presently in staging in light of the issues that have been reported in the last 2 days and need to be fixed as part of the Release. Please hold off from committing any new code to trunk meanwhile. Thanks. On Friday, January 24, 2014 7:36 PM, Ted Dunning ted.dunn...@gmail.com wrote: My schedule has opened up a bit and I can review as well. On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
My schedule has opened up a bit and I can review as well. On Fri, Jan 24, 2014 at 3:06 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: I will try the next candidate agaim, so one vote is sure. Am 24.01.2014 23:54 schrieb Suneel Marthi suneel_mar...@yahoo.com: I am open to having the conversation (and a part of me feels that the clusteringId fix should be in 0.9). If we decide to incorporate that into 0.9, I need to rollback the 0.9 Release that's presently out there in staging (for the 5th time in a row now). I am fine with doing that. What do you think we should do? a) Go ahead with 0.9 release without the fix for M-1410 . b) Rollback 0.9 and include the fix for M-1410 c) Go ahead with 0.9, have an interim 1.0 Release Candidate that includes M-1410 and any other issues/enhancements that are fixed. I am leaning towards (b), my only concern being that from my experience in the past few weeks; its become real hard to muster the minimum 3 +1 PMC votes required for a release to pass. On Friday, January 24, 2014 5:45 PM, Ted Dunning ted.dunn...@gmail.com wrote: Can we hold a separate discussion about whether the clustering id issue has to be in 0.9 while extending the vote deadline if necessary? If not, then all these votes are great and the release can go forward. If it is the sense that that fix has to be in, we should leave time for people for people to reverse their votes to -1. On Fri, Jan 24, 2014 at 2:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks for all those that volunteered. The voting for 0.9 Release closes tomorrow. On Friday, January 24, 2014 4:05 AM, Gokhan Capan gkhn...@gmail.com wrote: Using CentOS 6.5 and hadoop 1.2.1, all passed. +1 from me Gokhan On Thu, Jan 23, 2014 at 6:01 PM, Andrew Palumbo ap@outlook.com wrote: a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.org wrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
+1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com
Re: MAHOUT 0.9 Release - New URL
I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.orgwrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
RE: MAHOUT 0.9 Release - New URL
a),b),c),d) all passed on CentOS for me Date: Thu, 23 Jan 2014 13:43:06 +0200 Subject: Re: MAHOUT 0.9 Release - New URL From: ssvinarc...@hortonworks.com To: dev@mahout.apache.org I did a), b), c), d) and all steps pass. +1 On Thu, Jan 23, 2014 at 1:40 PM, Grant Ingersoll gsing...@apache.orgwrote: +1 from me. On Jan 22, 2014, at 5:55 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized. Grant Ingersoll | @gsingers http://www.lucidworks.com -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
I did a) b) c) and d) without noting any problem so far. +1 from me. --sebastian On 01/22/2014 11:55 PM, Suneel Marthi wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
Same here. I did a), b), c) and d) too and all tests pass. Here's my +1, if my vote counts. On Wednesday, January 22, 2014 7:11 PM, Sebastian Schelter s...@apache.org wrote: I did a) b) c) and d) without noting any problem so far. +1 from me. --sebastian On 01/22/2014 11:55 PM, Suneel Marthi wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
Likewise, a) through d) work on an Amazon AMI and Ubuntu 12.04. +1 On Wed, Jan 22, 2014 at 6:38 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Same here. I did a), b), c) and d) too and all tests pass. Here's my +1, if my vote counts. On Wednesday, January 22, 2014 7:11 PM, Sebastian Schelter s...@apache.org wrote: I did a) b) c) and d) without noting any problem so far. +1 from me. --sebastian On 01/22/2014 11:55 PM, Suneel Marthi wrote: Fixed the issues that were reported this week and restored FP mining into the codebase. Here's the URL for the final release in staging:- https://repository.apache.org/content/repositories/orgapachemahout-1003/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC, need a minimum of 3 '+1' votes for the release to be finalized.
Re: MAHOUT 0.9 Release - New URL
Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0 : [distance-squared=0.9509142993214911]: /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor = [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048, 4419:0.076, 4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048, 7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077, 7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081, 10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087, 12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061, 14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124, 20346:0.159, 21233:0.075, 23657:0.089, 23939:0.078, 23974:0.105, 23998:0.146, 24962:0.122, 25003:0.093, 25084:0.151, 25128:0.052, 29143:0.095, 30459:0.046, 30806:0.075, 31559:0.046, 31727:0.104, 31895:0.105, 31900:0.153, 32149:0.079, 32993:0.069, 33112:0.177, 33208:0.101, 33351:0.089, 33533:0.079, 33638:0.042, 35795:0.066, 36189:0.078, 36491:0.046, 36500:0.093, 36625:0.200, 37111:0.071, 39336:0.079, 39789:0.067, 39933:0.073, 39967:0.079, 41155:0.167, 41280:0.065, 41696:0.072, 41947:0.118, 43685:0.086, 44077:0.308, 44353:0.215, 44423:0.085, 45215:0.151, 45775:0.052, 46766:0.074, 47823:0.082, 48120:0.080, 48212:0.109, 48436:0.110] [snip] *clustering; dirichlet:* Get this complaint: Running Dirichlet with K = 8 Running on hadoop, using /home/ec2-user/hadoop-1.2.1/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/ec2-user/mahout-distribution-0.9/examples/target/mahout-examples-0.9-job.jar 14/01/21 05:16:35 WARN driver.MahoutDriver: Unable to add class: dirichlet 14/01/21 05:16:35 WARN driver.MahoutDriver: No dirichlet.props found on classpath, will use command-line arguments only Unknown program 'dirichlet' chosen. *clustering: minhash:* Running Minhash Running on hadoop, using /home/ec2-user/hadoop-1.2.1/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/ec2-user/mahout-distribution-0.9/examples/target/mahout-examples-0.9-job.jar 14/01/21 05:17:27 WARN driver.MahoutDriver: Unable to add class: minhash
Re: MAHOUT 0.9 Release - New URL
Sure thing; continuing to smoke test the other examples tonight On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0 : [distance-squared=0.9509142993214911]: /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor = [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048, 4419:0.076, 4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048, 7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077, 7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081, 10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087, 12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061, 14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124, 20346:0.159, 21233:0.075, 23657:0.089, 23939:0.078, 23974:0.105, 23998:0.146, 24962:0.122, 25003:0.093, 25084:0.151, 25128:0.052, 29143:0.095, 30459:0.046, 30806:0.075, 31559:0.046, 31727:0.104, 31895:0.105, 31900:0.153, 32149:0.079, 32993:0.069, 33112:0.177, 33208:0.101, 33351:0.089, 33533:0.079, 33638:0.042, 35795:0.066, 36189:0.078, 36491:0.046, 36500:0.093, 36625:0.200, 37111:0.071, 39336:0.079, 39789:0.067, 39933:0.073, 39967:0.079, 41155:0.167, 41280:0.065, 41696:0.072, 41947:0.118, 43685:0.086, 44077:0.308, 44353:0.215, 44423:0.085, 45215:0.151, 45775:0.052, 46766:0.074, 47823:0.082, 48120:0.080, 48212:0.109, 48436:0.110] [snip] *clustering; dirichlet:* Get this complaint: Running Dirichlet with K = 8 Running on hadoop, using /home/ec2-user/hadoop-1.2.1/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/ec2-user/mahout-distribution-0.9/examples/target/mahout-examples-0.9-job.jar 14/01/21 05:16:35 WARN driver.MahoutDriver: Unable to add class: dirichlet 14/01/21 05:16:35 WARN driver.MahoutDriver: No dirichlet.props found on classpath, will use command-line arguments only Unknown program 'dirichlet' chosen. *clustering: minhash:* Running Minhash Running on
RE: MAHOUT 0.9 Release - New URL
from the asf-email-examples.sh script: # You will need to download or otherwise obtain some or all of the Amazon ASF Em ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this script. # To obtain a full copy you will need to launch an EC2 instance and mount the da taset to download it, otherwise you can get a sample of it at # http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout It looks like the: http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout link is down. Is there somewhere else that we can get a subset of the ASF emails? Date: Tue, 21 Jan 2014 09:48:06 -0800 Subject: Re: MAHOUT 0.9 Release - New URL From: andrew.mussel...@gmail.com To: dev@mahout.apache.org Sure thing; continuing to smoke test the other examples tonight On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0 : [distance-squared=0.9509142993214911]: /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor = [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048, 4419:0.076, 4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048, 7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077, 7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081, 10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087, 12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061, 14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124, 20346:0.159, 21233:0.075, 23657:0.089, 23939:0.078, 23974:0.105, 23998:0.146, 24962:0.122, 25003:0.093, 25084:0.151, 25128:0.052, 29143:0.095, 30459:0.046, 30806:0.075, 31559:0.046, 31727:0.104, 31895:0.105, 31900:0.153, 32149:0.079, 32993:0.069, 33112:0.177, 33208:0.101, 33351:0.089, 33533:0.079, 33638:0.042, 35795:0.066, 36189:0.078, 36491
Re: MAHOUT 0.9 Release - New URL
Thanks Andrew for reporting that. I rolled back the release to fix this and few other issues. We have removed asf-examples*.sh from trunk as the sample file at the url mentioned in ur email is not available. This is something we need to fix and restore in 1.0. On Tuesday, January 21, 2014 3:21 PM, Andrew Palumbo ap@outlook.com wrote: from the asf-email-examples.sh script: # You will need to download or otherwise obtain some or all of the Amazon ASF Em ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this script. # To obtain a full copy you will need to launch an EC2 instance and mount the da taset to download it, otherwise you can get a sample of it at # http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout It looks like the: http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout link is down. Is there somewhere else that we can get a subset of the ASF emails? Date: Tue, 21 Jan 2014 09:48:06 -0800 Subject: Re: MAHOUT 0.9 Release - New URL From: andrew.mussel...@gmail.com To: dev@mahout.apache.org Sure thing; continuing to smoke test the other examples tonight On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0 : [distance-squared=0.9509142993214911]: /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor = [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048, 4419:0.076, 4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048, 7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077, 7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081, 10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087, 12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061, 14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124, 20346
Re: MAHOUT 0.9 Release - New URL
75% Reliability70.6238% Reliability (standard deviation)0.2187 Log-likelihoodmean :-1.1182 25%-ile :-1.6911 75%-ile :-0.0803 Jan 21, 2014 9:46:39 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 10783 ms (Minutes: 0.17971) On Tue, Jan 21, 2014 at 1:08 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Andrew for reporting that. I rolled back the release to fix this and few other issues. We have removed asf-examples*.sh from trunk as the sample file at the url mentioned in ur email is not available. This is something we need to fix and restore in 1.0. On Tuesday, January 21, 2014 3:21 PM, Andrew Palumbo ap@outlook.com wrote: from the asf-email-examples.sh script: # You will need to download or otherwise obtain some or all of the Amazon ASF Em ail Public Dataset (http://aws.amazon.com/datasets/7791434387204566) to use this script. # To obtain a full copy you will need to launch an EC2 instance and mount the da taset to download it, otherwise you can get a sample of it at # http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout It looks like the: http://www.lucidimagination.com/devzone/technical-articles/scaling-mahout link is down. Is there somewhere else that we can get a subset of the ASF emails? Date: Tue, 21 Jan 2014 09:48:06 -0800 Subject: Re: MAHOUT 0.9 Release - New URL From: andrew.mussel...@gmail.com To: dev@mahout.apache.org Sure thing; continuing to smoke test the other examples tonight On Tue, Jan 21, 2014 at 9:23 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Andrew M., see that some of the example scripts need to be fixed as they still refer to the deprecated algorithms. See that the Streaming KMeans has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0
Re: MAHOUT 0.9 Release - New URL
Hmmm... that's an issue. Since both Dirichlet and Meanshift clustering have been removed from 0.9, cluster-syntheticcontrol.sh options 4,5 are not gonna work and should have been removed for 0.9. To PMC, - rollback the release, fix this issue (and other patches that were submitted in the last few days) and put out another release ? On Monday, January 20, 2014 12:33 AM, Andrew Palumbo ap@outlook.com wrote: I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To:
Re: MAHOUT 0.9 Release - New URL
This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging. Thanks for reporting this Andrew P. On Monday, January 20, 2014 12:34 AM, Andrew Palumbo ap@outlook.com wrote: I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u...@mahout.apache.org; dev@mahout.apache.org Third time's a Charm!!! Here's the new URL for Mahout 0.9
Re: MAHOUT 0.9 Release - New URL
Trying out the build today On Mon, Jan 20, 2014 at 6:00 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging. Thanks for reporting this Andrew P. On Monday, January 20, 2014 12:34 AM, Andrew Palumbo ap@outlook.com wrote: I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From:
Re: MAHOUT 0.9 Release - New URL
Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output of examples:* *asf-email-examples.sh, run on mahout.apache.org http://mahout.apache.org:* *recommendations:* [ec2-user@ip-10-73-146-199 bin]$ hadoop fs -cat /user/ec2-user/asf-output/prefs/recommendations/part-r-0 | less 1 [21935:1.0,23122:1.0,24084:1.0,26397:1.0,1755:1.0,20743:1.0,13428:1.0,19483:1.0,24067:1.0] 4 [14372:1.0,28069:1.0,12258:1.0,18412:1.0,26707:1.0,14610:1.0,2909:1.0,14777:1.0,11792:1.0,26764:1.0] 6 [5442:1.0,18416:1.0,17554:1.0,14610:1.0,16767:1.0,16740:1.0,26743:1.0,11792:1.0,26707:1.0,28116:1.0] 8 [12758:1.0,19409:1.0,2:1.0] 11 [25890:1.0,26743:1.0,9122:1.0,14512:1.0,28116:1.0,17499:1.0,14976:1.0,14561:1.0,3686:1.0,26707:1.0] 14 [29596:1.0,25567:1.0,19520:1.0,26327:1.0,13809:1.0,29435:1.0,17331:1.0,17290:1.0,17819:1.0,3829:1.0] 15 [15355:1.0,15322:1.0,23191:1.0,7990:1.0,15318:1.0,15236:1.0,17789:1.0,15286:1.0,20916:1.0,2812:1.0] 16 [23647:1.0,18137:1.0,1692:1.0,11490:1.0,4303:1.0,12906:1.0,5120:1.0,29503:1.0,19409:1.0,27700:1.0] 18 [29738:1.0,12070:1.0,24078:1.0,19449:1.0,17819:1.0,11549:1.0,25410:1.0,15228:1.0,24930:1.0,23708:1.0] 19 [28008:1.0,18416:1.0,2909:1.0,29250:1.0,28023:1.0,14974:1.0] 20 [19313:1.0,3464:1.0,12394:1.0,18665:1.0,16601:1.0,25816:1.0,10212:1.0,11626:1.0,18577:1.0,16734:1.0] [snip] *clustering; kmeans:* [snip] Weight : [props - optional]: Point: 1.0 : [distance-squared=1.0193102046188427]: /commits/200802.gz/20835820.1202052180347.JavaMail.www-data@brutus = [1065:0.195, 1977:0.355, 2246:0.091, 3008:0.078, 5336:0.110, 7573:0.204, 7683:0.126, 7715:0.365, 7812:0.180, 7832:0.075, 8268:0.093, 9779:0.159, 10257:0.133, 10972:0.158, 11663:0.143, 15313:0.065, 17007:0.244, 19359:0.183, 19399:0.338, 19525:0.139, 20224:0.140, 24649:0.095, 25003:0.076, 29143:0.156, 30459:0.075, 31537:0.156, 31559:0.075, 31668:0.139, 33208:0.117, 33425:0.218, 36491:0.075, 38378:0.130, 39789:0.110, 40743:0.190, 45775:0.086] 1.0 : [distance-squared=0.9823018320457279]: /commits/200808.gz/1722278226.1219149603005.JavaMail.www-data@brutus = [1065:0.188, 2246:0.088, 3008:0.076, 3620:0.239, 5200:0.104, 5336:0.106, 6404:0.088, 7552:0.335, 7683:0.122, 7715:0.376, 7812:0.173, 7832:0.072, 10257:0.128, 11663:0.195, 15313:0.063, 16660:0.094, 19359:0.177, 19525:0.134, 19551:0.101, 20025:0.183, 21233:0.098, 24649:0.092, 25003:0.112, 27650:0.283, 27653:0.216, 29143:0.150, 30459:0.072, 30868:0.208, 31559:0.126, 31565:0.203, 33208:0.113, 36491:0.073, 36610:0.141, 36767:0.208, 38378:0.125, 39789:0.106, 45775:0.083] 1.0 : [distance-squared=0.9509142993214911]: /commits/201006.gz/5844140.863.1277658000780.JavaMail.confluence@thor = [648:0.100, 914:0.066, 2040:0.076, 2246:0.078, 3008:0.048, 4419:0.076, 4452:0.070, 5200:0.065, 5203:0.140, 5336:0.067, 6404:0.056, 7235:0.048, 7310:0.077, 7464:0.067, 7471:0.060, 7489:0.093, 7505:0.123, 7683:0.077, 7715:0.145, 7814:0.072, 7912:0.155, 8268:0.098, 9835:0.118, 10225:0.081, 10257:0.114, 11127:0.112, 11510:0.086, 11589:0.139, 11663:0.087, 12641:0.117, 13837:0.052, 14030:0.062, 14089:0.051, 14352:0.061, 14396:0.185, 17015:0.115, 17240:0.097, 18767:0.149, 19774:0.124, 20346:0.159, 21233:0.075, 23657:0.089, 23939:0.078, 23974:0.105, 23998:0.146, 24962:0.122, 25003:0.093, 25084:0.151, 25128:0.052, 29143:0.095, 30459:0.046, 30806:0.075, 31559:0.046, 31727:0.104, 31895:0.105, 31900:0.153, 32149:0.079, 32993:0.069, 33112:0.177, 33208:0.101, 33351:0.089, 33533:0.079, 33638:0.042, 35795:0.066, 36189:0.078, 36491:0.046, 36500:0.093, 36625:0.200, 37111:0.071, 39336:0.079, 39789:0.067, 39933:0.073, 39967:0.079, 41155:0.167, 41280:0.065, 41696:0.072, 41947:0.118, 43685:0.086, 44077:0.308, 44353:0.215, 44423:0.085, 45215:0.151, 45775:0.052, 46766:0.074, 47823:0.082, 48120:0.080, 48212:0.109, 48436:0.110] [snip] *clustering; dirichlet:* Get this complaint: Running Dirichlet with K = 8 Running on hadoop, using /home/ec2-user/hadoop-1.2.1/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/ec2-user/mahout-distribution-0.9/examples/target/mahout-examples-0.9-job.jar 14/01/21 05:16:35 WARN driver.MahoutDriver: Unable to add class: dirichlet 14/01/21 05:16:35 WARN driver.MahoutDriver: No dirichlet.props found on classpath, will use command-line arguments only Unknown program 'dirichlet' chosen. *clustering: minhash:* Running Minhash Running on hadoop, using /home/ec2-user/hadoop-1.2.1/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/ec2-user/mahout-distribution-0.9/examples/target/mahout-examples-0.9-job.jar 14/01/21 05:17:27 WARN driver.MahoutDriver: Unable to add class: minhash 14/01/21 05:17:27 WARN driver.MahoutDriver: No minhash.props found on classpath, will use command-line arguments only Unknown program 'minhash' chosen. *classification; standard:* === Summary --- Correctly Classified Instances
Re: MAHOUT 0.9 Release - New URL
Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
-1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.aschttps://people.apache.org/keys/committer/pwendell.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
OK, running in MR mode now. On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
When I run in MR mode I get the same problem. See http://pastebin.com/TXJ5mQmt On Sun, Jan 19, 2014 at 5:31 PM, Frank Scholten fr...@frankscholten.nlwrote: OK, running in MR mode now. On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
It works when both MAHOUT_LOCAL=true and '-xm sequential' option are set. Guess will have to cut a release again with '-xm sequential' option set. On Sunday, January 19, 2014 11:31 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
The reason u r seeing the error is because there are were no sequence files in HDFS in MR mode to begin with = hence no term vectors generated = and hence no vectors to cluster. MR mode: 1. Set HADOOP_HOME 2. unset MAHOUT_LOCAL 3. clean up ur local /tmp/mahout-work-x directory 4. run ./examples/bin/cluster-reuters.sh = option 4 Sequential Mode: - 1. set MAHOUT_LOCAL=true 2. Add -xm sequential flag to cluster-reuters.sh script 3. run ./examples/bin/cluster-reuters.sh = option 4 On Sunday, January 19, 2014 12:22 PM, Frank Scholten fr...@frankscholten.nl wrote: When I run in MR mode I get the same problem. See http://pastebin.com/TXJ5mQmt On Sun, Jan 19, 2014 at 5:31 PM, Frank Scholten fr...@frankscholten.nl wrote: OK, running in MR mode now. On Sun, Jan 19, 2014 at 5:30 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Its presently setup to run in MR mode (the way its been coded in cluster-reuters.sh). So setting MAHOUT_LOCAL=true is gonna fail for this. I am able to see this fail locally when MAHOUT_LOCAL=true. On Sunday, January 19, 2014 11:17 AM, Frank Scholten fr...@frankscholten.nl wrote: Exported MAHOUT_LOCAL=true and still get the same results. On Sun, Jan 19, 2014 at 5:00 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Frank, Were u running this with MAHOUT_LOCAL=true? On Sunday, January 19, 2014 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: -1 The cluster reuters example results in zero clusters when choosing streaming k-means. The other steps, unpacking and building do work. I see this stacktrace: INFO: Number of Centroids: 0 Jan 19, 2014 3:51:08 PM org.apache.hadoop.mapred.LocalJobRunner$Job run WARNING: job_local797072544_0001 java.lang.IllegalArgumentException: Must have nonzero number of training and test vectors. Asked for %.1f %% of %d vectors for test [10.00149011612, 0] at com.google.common.base.Preconditions.checkArgument(Preconditions.java:120) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.splitTrainTest(BallKMeans.java:176) at org.apache.mahout.clustering.streaming.cluster.BallKMeans.cluster(BallKMeans.java:192) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.getBestCentroids(StreamingKMeansReducer.java:107) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:73) at org.apache.mahout.clustering.streaming.mapreduce.StreamingKMeansReducer.reduce(StreamingKMeansReducer.java:37) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:177) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:418) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 3:51:09 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 278 ms (Minutes: 0.004633) cluster,distance.mean,distance.sd ,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Here is the full log: http://pastebin.com/TxLV0rDr As of yet I am unfamiliar with the streaming k-means code and the algorithms behind it. If anyone has suggestion on what goes wrong in the code I am I happy to help where I can. Frank On Sun, Jan 19, 2014 at 10:55 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Thanks Grant. Not sure if I can vote given my role as the BuildMeister/ReleaseMeister for 0.9. Here's my +1 FWIW. a) Attached is the draft of the Release notes for 0.9, would definitely appreciate feedback on that. b) The vote is open until Monday, Jan 20, 2014 11:59PM EST and passes if a majority of atleast 3 +1 PMC votes are cast. The release files, including signatures, digests, etc can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ The staging repository for this release can be found at: https://repository.apache.org/content/repositories/orgapachemahout-1002 Release artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc On Saturday, January 18, 2014 12:27 PM, Grant Ingersoll gsing...@apache.org wrote: Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run
Re: MAHOUT 0.9 Release - New URL
I'll try to test out soon
Re: MAHOUT 0.9 Release - New URL
+1 (binding) On Sun, Jan 19, 2014 at 7:49 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: I'll try to test out soon
Re: MAHOUT 0.9 Release - New URL
Stevo, could u test streaming kmeans? Sent from my iPhone On Jan 19, 2014, at 8:10 PM, Stevo Slavić ssla...@gmail.com wrote: +1 (binding) On Sun, Jan 19, 2014 at 7:49 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: I'll try to test out soon
RE: MAHOUT 0.9 Release - New URL
I ran through the tests with on a CentOS VM AMD64 2 cores 4 GB RAM. Had a bit of trouble getting the Hadoop natives to compile and therefore may have run into some problems because of the hadoop setup. Ran into some problems in the example scripts. Particularly with ./cluster-syntheticcontrol.sh -4,5. I will run through the rest of the examples when im sure I've got hadoop setup right. Apache Maven 3.1.2-SNAPSHOT Java version: 1.6.0_45, vendor: Sun Microsystems Inc. Java home: /usr/java/jdk1.6.0_45/jre OS name: linux, version: 2.6.32-358.23.2.el6.x86_64, arch: amd64, family: unix $MAHOUT_LOCAL=true Hadoop 2.2.0 a) Verify that u can unpack the release (tar or zip) ...passed (tar) [passed ] b) Verify u r able to compile the distro mvn compile- [passed with warnings] [WARNING] Expected all dependencies to require Scala version: 2.9.3 [WARNING] org.apache.mahout:mahout-math-scala:0.9 requires scala version: 2.9.3 [WARNING] org.scalatest:scalatest_2.9.2:1.9.1 requires scala version: 2.9.2 [WARNING] Multiple versions of scala libraries detected! c) Run through the unit tests: mvn clean test mvn clean test [passed] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script Running example scripts with $MAHOUT_LOCAL=true ./cluster-syntheticcontrol.sh -1 [works] ./cluster-syntheticcontrol.sh -2 [works] ./cluster-syntheticcontrol.sh -3 [works] ./cluster-syntheticcontrol.sh -4 [exits, throws exception] [...] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:55:31 PM org.slf4j.impl.JCLLoggerAdapter warn ./cluster-syntheticcontrol.sh -5 [exits, throws exception] WARNING: Unable to add class: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job java.lang.ClassNotFoundException: org.apache.mahout.clustering.syntheticcontrol.meanshift.Job at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:171) at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:128) Jan 19, 2014 7:59:51 PM org.slf4j.impl.JCLLoggerAdapter warn WARNING: No org.apache.mahout.clustering.syntheticcontrol.meanshift.Job.props found on classpath, will use command-line arguments only Unknown program 'org.apache.mahout.clustering.syntheticcontrol.meanshift.Job' chosen. ./classify-20newsgroups.sh -1 [works] ./classify-20newsgroups.sh -2 [works] cluster-reuters.sh -1 [works] cluster-reuters.sh -2 [works] cluster-reuters.sh -3 [works] Same error as noted previosly in the thread: cluster-reuters.sh -4 [0 clusters] [...] WARNING: No qualcluster.props found on classpath, will use command-line arguments only Num clusters: 0; maxDistance: 0.00 [Dunn Index] First: Infinity [Davies-Bouldin Index] First: NaN Jan 19, 2014 7:13:57 PM org.slf4j.impl.JCLLoggerAdapter info INFO: Program took 669 ms (Minutes: 0.01115) cluster,distance.mean,distance.sd,distance.q0,distance.q1,distance.q2,distance.q3,distance.q4,count,is.train Date: Thu, 16 Jan 2014 06:41:09 -0800 From: suneel_mar...@yahoo.com Subject: MAHOUT 0.9 Release - New URL To: u...@mahout.apache.org; dev@mahout.apache.org Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the
Re: MAHOUT 0.9 Release - New URL
Ran the tests, verified sigs, tried out a few of the examples. +1 (binding) On Jan 16, 2014, at 9:41 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
On Thu, Jan 16, 2014 at 7:35 AM, Sotiris Salloumis i...@eprice.gr wrote: c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] ?! Was that seconds? Or really milliseconds?
RE: MAHOUT 0.9 Release - New URL
Sorry my mistake milliseconds was the last test … below the full results ~/mahout/apache-maven-3.1.1/bin/mvn -DskipTests clean install [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 52.312s [INFO] Finished at: Sat Jan 18 02:04:29 CET 2014 [INFO] Final Memory: 46M/305M [INFO] ~/mahout/apache-maven-3.1.1/bin/mvn clean test [INFO] [INFO] Reactor Summary: [INFO] [INFO] Mahout Build Tools SUCCESS [1.166s] [INFO] Apache Mahout . SUCCESS [0.264s] [INFO] Mahout Math ... SUCCESS [58.639s] [INFO] Mahout Core ... SUCCESS [4:01.640s] [INFO] Mahout Integration SUCCESS [21.481s] [INFO] Mahout Examples ... SUCCESS [1.980s] [INFO] Mahout Release Package SUCCESS [0.003s] [INFO] Mahout Math/Scala wrappers SUCCESS [14.149s] [INFO] [INFO] BUILD SUCCESS [INFO] [INFO] Total time: 5:39.563s [INFO] Finished at: Sat Jan 18 02:10:53 CET 2014 [INFO] Final Memory: 51M/1068M [INFO] From: Ted Dunning [mailto:ted.dunn...@gmail.com] Sent: Saturday, January 18, 2014 2:50 AM To: Mahout Dev List; Sotiris Salloumis Cc: Suneel Marthi; u...@mahout.apache.org Subject: Re: MAHOUT 0.9 Release - New URL On Thu, Jan 16, 2014 at 7:35 AM, Sotiris Salloumis i...@eprice.gr mailto:i...@eprice.gr wrote: c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] ?! Was that seconds? Or really milliseconds?
RE: MAHOUT 0.9 Release - New URL
Hi Suneel, Below first round of tests, Environment: SMP Debian 3.2.51-1 x86_64 Machine: Intel(R) Core(TM) i7 CPU 950 @ 3.07GHz stepping 05 12GB RAM OpenJDK: javac 1.6.0_27 a) Verify that u can unpack the release (tar or zip) [ Passed: tar -zxvf ] b) Verify u r able to compile the distro [ Passed: With OpenJDK, Latest Maven on LatestDebian ] c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. [Ongoing will update later] Regards Sotiris -Original Message- From: Suneel Marthi [mailto:suneel_mar...@yahoo.com] Sent: Thursday, January 16, 2014 4:41 PM To: u...@mahout.apache.org; mahout Subject: MAHOUT 0.9 Release - New URL Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/ apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
a), b), and c) all pass for me. Don't have the setup yet at work to go through d), will wait for others to verify. On 1/16/14, 9:41 AM, Suneel Marthi wrote: Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.
Re: MAHOUT 0.9 Release - New URL
I tested mahout 0.9 on Ubuntu 12.04 64bit, java version 1.6.0_27 a) Verify that u can unpack the release (tar or zip) - passed b) Verify u r able to compile the distro - passed c) Run through the unit tests: mvn clean test -passed d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. - will update later On Thu, Jan 16, 2014 at 5:35 PM, Sotiris Salloumis i...@eprice.gr wrote: Hi Suneel, Below first round of tests, Environment: SMP Debian 3.2.51-1 x86_64 Machine: Intel(R) Core(TM) i7 CPU 950 @ 3.07GHz stepping 05 12GB RAM OpenJDK: javac 1.6.0_27 a) Verify that u can unpack the release (tar or zip) [ Passed: tar -zxvf ] b) Verify u r able to compile the distro [ Passed: With OpenJDK, Latest Maven on LatestDebian ] c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. [Ongoing will update later] Regards Sotiris -Original Message- From: Suneel Marthi [mailto:suneel_mar...@yahoo.com] Sent: Thursday, January 16, 2014 4:41 PM To: u...@mahout.apache.org; mahout Subject: MAHOUT 0.9 Release - New URL Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/ apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: MAHOUT 0.9 Release - New URL
OS X 10.9.1, java version 1.6.0_65. On 1/16/14, 10:41 AM, Sergey Svinarchuk wrote: I tested mahout 0.9 on Ubuntu 12.04 64bit, java version 1.6.0_27 a) Verify that u can unpack the release (tar or zip) - passed b) Verify u r able to compile the distro - passed c) Run through the unit tests: mvn clean test -passed d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. - will update later On Thu, Jan 16, 2014 at 5:35 PM, Sotiris Salloumis i...@eprice.gr wrote: Hi Suneel, Below first round of tests, Environment: SMP Debian 3.2.51-1 x86_64 Machine: Intel(R) Core(TM) i7 CPU 950 @ 3.07GHz stepping 05 12GB RAM OpenJDK: javac 1.6.0_27 a) Verify that u can unpack the release (tar or zip) [ Passed: tar -zxvf ] b) Verify u r able to compile the distro [ Passed: With OpenJDK, Latest Maven on LatestDebian ] c) Run through the unit tests: mvn clean test [ Passed: 370 milliseconds] d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. [Ongoing will update later] Regards Sotiris -Original Message- From: Suneel Marthi [mailto:suneel_mar...@yahoo.com] Sent: Thursday, January 16, 2014 4:41 PM To: u...@mahout.apache.org; mahout Subject: MAHOUT 0.9 Release - New URL Third time's a Charm!!! Here's the new URL for Mahout 0.9 Release: https://repository.apache.org/content/repositories/orgapachemahout-1002/org/ apache/mahout/mahout-distribution/0.9/ For those volunteering to test this, some of the things to be verified: a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Committers and PMC members: --- Need 'at least 3 +1 votes' for the Release to pass. Thanks and Regards.