Re: [VOTE] Mahout 0.10.1 Release Candidate
Thanks Stevo. This release has passed with 5 +1 PMC binding votes and the Voting is officially closed, will send the Release Announce later today. On Sun, May 31, 2015 at 3:35 PM, Stevo Slavić ssla...@gmail.com wrote: +1 (binding) Verified hashes and signatures; distribution sources tarball and zip unpack well, build passes from unpacked sources. On Sun, May 31, 2015 at 8:34 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: +1 (binding) Verified tests pass for src tarball and zip; I'm comfortable skipping EMR smoke testing for a point release given team opinion that it's not required. On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo ap@outlook.com wrote: +1 (binding) Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the |.tar.gz| binary archive in pseudo-cluster mode and one with MAHOUT_LOCAL=true with only the previously noted minor data issue, which I agree can wait for the next release. Ran a mix and match of the |.zip| binary archive examples with MAHOUT_LOCAL=true and in pseudo-cluster mode without issue. Tested the shell from both archives for qr and matrix display fixes. On 05/31/2015 12:09 PM, Pat Ferrel wrote: +1 (binding) Verified on Spark 1.3 psuedo-clustered HDFS 2.4 There are some cleanup of example data issues that can wait for next release. On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote: Verified locally build and tests for {source} * {zip, tar}. No issues found. +1 (binding) On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org wrote: Andrew Palumbo / Dmitriy: Please also verify the various scenarios as described in M-1693 On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org wrote: Here's the new 0.10.1 Release Candidate https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/ The Voting ends on Sunday, May 31 2015. Need a +1 from the PMC for each of the line items below for the release to pass. 1. Ted/Grant: Verify hashes and checksums - {binary,source} x {zip,tar} + pom 2. AKM: Verify examples on EMR - {binary, source} * {zip, tar} 3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar} 4. Suneel: Verify build and tests - {source} * {zip, tar} 5. Pat: Verify examples locally - {source} * {zip, tar} The LICENSE and NOTICE files have not been updated this time and will be addressed in future releases. On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi suneel.mar...@gmail.com wrote: Please hold ur votes, will be refreshing staging with another build in the next hour On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise source zip and tarballs build and pass tests. On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org wrote: Verified {source} * {zip, tar} and all tests pass. +1 (binding) On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org wrote: This is a call for VOTE to pass Mahout 0.10.1 release candidate that's available at https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/ Need atleast 3 PMC +1 (binding) votes to cut the release Below are the tasks breakdown for the PMC and committers: Andy Palumbo Pat Ferrel: verify the binary artifacts and run tests Suneel AKM: verify the src artifacts Ted/Grant/Drew: verify the hashes and Sigs The LICENSE.txt and NOTICE.txt still need to be updated and will not be addressed as part of 0.10.1 release.
Re: [VOTE] Mahout 0.10.1 Release Candidate
+1 (binding) Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the |.tar.gz| binary archive in pseudo-cluster mode and one with MAHOUT_LOCAL=true with only the previously noted minor data issue, which I agree can wait for the next release. Ran a mix and match of the |.zip| binary archive examples with MAHOUT_LOCAL=true and in pseudo-cluster mode without issue. Tested the shell from both archives for qr and matrix display fixes. On 05/31/2015 12:09 PM, Pat Ferrel wrote: +1 (binding) Verified on Spark 1.3 psuedo-clustered HDFS 2.4 There are some cleanup of example data issues that can wait for next release. On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote: Verified locally build and tests for {source} * {zip, tar}. No issues found. +1 (binding) On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org wrote: Andrew Palumbo / Dmitriy: Please also verify the various scenarios as described in M-1693 On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org wrote: Here's the new 0.10.1 Release Candidate https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/ The Voting ends on Sunday, May 31 2015. Need a +1 from the PMC for each of the line items below for the release to pass. 1. Ted/Grant: Verify hashes and checksums - {binary,source} x {zip,tar} + pom 2. AKM: Verify examples on EMR - {binary, source} * {zip, tar} 3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar} 4. Suneel: Verify build and tests - {source} * {zip, tar} 5. Pat: Verify examples locally - {source} * {zip, tar} The LICENSE and NOTICE files have not been updated this time and will be addressed in future releases. On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi suneel.mar...@gmail.com wrote: Please hold ur votes, will be refreshing staging with another build in the next hour On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise source zip and tarballs build and pass tests. On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org wrote: Verified {source} * {zip, tar} and all tests pass. +1 (binding) On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org wrote: This is a call for VOTE to pass Mahout 0.10.1 release candidate that's available at https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/ Need atleast 3 PMC +1 (binding) votes to cut the release Below are the tasks breakdown for the PMC and committers: Andy Palumbo Pat Ferrel: verify the binary artifacts and run tests Suneel AKM: verify the src artifacts Ted/Grant/Drew: verify the hashes and Sigs The LICENSE.txt and NOTICE.txt still need to be updated and will not be addressed as part of 0.10.1 release.
Re: [VOTE] Mahout 0.10.1 Release Candidate
+1 (binding) Verified hashes and signatures; distribution sources tarball and zip unpack well, build passes from unpacked sources. On Sun, May 31, 2015 at 8:34 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: +1 (binding) Verified tests pass for src tarball and zip; I'm comfortable skipping EMR smoke testing for a point release given team opinion that it's not required. On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo ap@outlook.com wrote: +1 (binding) Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the |.tar.gz| binary archive in pseudo-cluster mode and one with MAHOUT_LOCAL=true with only the previously noted minor data issue, which I agree can wait for the next release. Ran a mix and match of the |.zip| binary archive examples with MAHOUT_LOCAL=true and in pseudo-cluster mode without issue. Tested the shell from both archives for qr and matrix display fixes. On 05/31/2015 12:09 PM, Pat Ferrel wrote: +1 (binding) Verified on Spark 1.3 psuedo-clustered HDFS 2.4 There are some cleanup of example data issues that can wait for next release. On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote: Verified locally build and tests for {source} * {zip, tar}. No issues found. +1 (binding) On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org wrote: Andrew Palumbo / Dmitriy: Please also verify the various scenarios as described in M-1693 On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org wrote: Here's the new 0.10.1 Release Candidate https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/ The Voting ends on Sunday, May 31 2015. Need a +1 from the PMC for each of the line items below for the release to pass. 1. Ted/Grant: Verify hashes and checksums - {binary,source} x {zip,tar} + pom 2. AKM: Verify examples on EMR - {binary, source} * {zip, tar} 3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar} 4. Suneel: Verify build and tests - {source} * {zip, tar} 5. Pat: Verify examples locally - {source} * {zip, tar} The LICENSE and NOTICE files have not been updated this time and will be addressed in future releases. On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi suneel.mar...@gmail.com wrote: Please hold ur votes, will be refreshing staging with another build in the next hour On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise source zip and tarballs build and pass tests. On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org wrote: Verified {source} * {zip, tar} and all tests pass. +1 (binding) On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org wrote: This is a call for VOTE to pass Mahout 0.10.1 release candidate that's available at https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/ Need atleast 3 PMC +1 (binding) votes to cut the release Below are the tasks breakdown for the PMC and committers: Andy Palumbo Pat Ferrel: verify the binary artifacts and run tests Suneel AKM: verify the src artifacts Ted/Grant/Drew: verify the hashes and Sigs The LICENSE.txt and NOTICE.txt still need to be updated and will not be addressed as part of 0.10.1 release.
[ANNOUNCE] Apache Mahout 0.10.1 Released
The Apache Mahout PMC is pleased to announce the release of Mahout 0.10.1. Mahout's goal is to create an environment for quickly creating machine learning applications that scale and run on the highest performance parallel computation engines available. Mahout comprises an interactive environment and library that supports generalized scalable linear algebra and includes many modern machine learning algorithms. The Mahout Math environment we call “Samsara” for its symbol of universal renewal. It reflects a fundamental rethinking of how scalable machine learning algorithms are built and customized. Mahout-Samsara is here to help people create their own math while providing some off-the-shelf algorithm implementations. At its base are general linear algebra and statistical operations along with the data structures to support them. It’s written in Scala with Mahout-specific extensions, and runs most fully on Spark. To get started with Apache Mahout 0.10.1, download the release artifacts and signatures from http://www.apache.org/dist/mahout/0.10.1/. Many thanks to the contributors and committers who were part of this release. Please see below for the Release Highlights. RELEASE HIGHLIGHTS This is an incremental minor release over Mahout 0.10.0 meant to fix a few bugs and upgrade to Spark 1.2.2 or less. Mahout 0.10.1 1. This release fixes a major memory usage bug in co-occurrence analysis used by the driver spark-itemsimilarity MAHOUT-1707. This will now require far less memory in the executor. 2. Support Spark 1.2.2 or less - due to a bug in Spark 1.2+ in the JavaSerializer (SPARK-6069) we removed the use of Guava from any code executed in Spark Executors. To do this we created a Scala Collections based BiMap so any example code showing how to use the old Guava collections is obsolete. 3. Some minor fixes to Mahout-Samsara QR Decomposition and matrix ops. 4. Trim down packages size to 200MB - MAHOUT-1704. 5. Minor testing indicates binary compatibility with Spark 1.3 except for the Mahout Shell, which does not run. STATS A total of 9 separate JIRA issues are addressed in this release [2] with 5 bugfixes. Scope of Mahout 0.10.2 ~ targeted for June 28, 2015 1. In-core transpose view rewrites. Modifiable transpose views (for (col - a.t) col := 5). 2. Matrix structure flavor additions. (understand general matrix structure and stride direction). 3. %*% optimization based on matrix flavors. 4. In-core ::= sparse assignment functions. 5. Assign := optimization (do proper traversal based on matrix flavors, similarly to %*%). 6. Adding in-place elementwise functional assignment (e.g. mxA := exp _, mxA ::= exp _). 7. Distributed and in-core version of simple elementwise analogues of scala.math._. for example, for log(x) the convention is dlog(drm), mlog(mx), vlog(vec). Unfortunately we cannot overload these functions over what is done in scala.math, i.e. scala would not allow log(mx) or log(drm) and log(Double) at the same time, mainly because they are being defined in different packages. 8. Distributed performance bug fixes. This relates mostly to (a) matrix multiplication deficiencies, and (b) handling parallelism. 9. Distributed allreduceBlock predicate. 10. Distributed optimizer operators for elementwise functions. Rewrites recognizing e.g. 1+ drmX * dexp(drmX) as a single fused elementwise physical operator. 11. More cbind, rbind flavors (e.g. 1 cbind mxX, 1 cbind drmX or the other way around). Mahout 0.11.0-snapshot (ongoing, but available) 1. Support for Spark 1.3 sequence file write. 2. Spark Shell (timing TBD). 3. First release that would see integration of Apache Mahout with Apache Flink as a backend. GETTING STARTED Download the release artifacts and signatures at http://www.apache.org/dist/mahout/0.10.1/ The examples directory contains several working examples of the core functionality available in Mahout. These can be run via scripts in the examples/bin directory. Most examples do not need a Hadoop cluster in order to run. FUTURE PLANS We will continue bug fixes and enhancements on the 0.10.x branch, which will remain dependent on Spark 1.2.x. Support for Spark 1.3 will be in the master branch reflecting Mahout-0.11.0-SNAPSHOT. To see progress on this branch look here: https://github.com/apache/mahout/commits/master. As of this writing it is not ready yet to build for Spark 1.3. Integration with Apache Flink is in the works in collaboration with TU Berlin and Data Artisans to add Flink as the 3rd execution engine to Mahout. This would be in addition to existing Apache Spark and H2O engines. CONTRIBUTING If you are interested in contributing, please see our How to Contribute http://mahout.apache.org/developers/how-to-contribute.html[3] page or contact us via email at
Re: [VOTE] Mahout 0.10.1 Release Candidate
+1 (binding) Verified tests pass for src tarball and zip; I'm comfortable skipping EMR smoke testing for a point release given team opinion that it's not required. On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo ap@outlook.com wrote: +1 (binding) Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the |.tar.gz| binary archive in pseudo-cluster mode and one with MAHOUT_LOCAL=true with only the previously noted minor data issue, which I agree can wait for the next release. Ran a mix and match of the |.zip| binary archive examples with MAHOUT_LOCAL=true and in pseudo-cluster mode without issue. Tested the shell from both archives for qr and matrix display fixes. On 05/31/2015 12:09 PM, Pat Ferrel wrote: +1 (binding) Verified on Spark 1.3 psuedo-clustered HDFS 2.4 There are some cleanup of example data issues that can wait for next release. On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote: Verified locally build and tests for {source} * {zip, tar}. No issues found. +1 (binding) On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org wrote: Andrew Palumbo / Dmitriy: Please also verify the various scenarios as described in M-1693 On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org wrote: Here's the new 0.10.1 Release Candidate https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/ The Voting ends on Sunday, May 31 2015. Need a +1 from the PMC for each of the line items below for the release to pass. 1. Ted/Grant: Verify hashes and checksums - {binary,source} x {zip,tar} + pom 2. AKM: Verify examples on EMR - {binary, source} * {zip, tar} 3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar} 4. Suneel: Verify build and tests - {source} * {zip, tar} 5. Pat: Verify examples locally - {source} * {zip, tar} The LICENSE and NOTICE files have not been updated this time and will be addressed in future releases. On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi suneel.mar...@gmail.com wrote: Please hold ur votes, will be refreshing staging with another build in the next hour On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise source zip and tarballs build and pass tests. On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org wrote: Verified {source} * {zip, tar} and all tests pass. +1 (binding) On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org wrote: This is a call for VOTE to pass Mahout 0.10.1 release candidate that's available at https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/ Need atleast 3 PMC +1 (binding) votes to cut the release Below are the tasks breakdown for the PMC and committers: Andy Palumbo Pat Ferrel: verify the binary artifacts and run tests Suneel AKM: verify the src artifacts Ted/Grant/Drew: verify the hashes and Sigs The LICENSE.txt and NOTICE.txt still need to be updated and will not be addressed as part of 0.10.1 release.
[jira] [Commented] (MAHOUT-1699) Trim down Mahout packaging for next release
[ https://issues.apache.org/jira/browse/MAHOUT-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14566532#comment-14566532 ] Andrew Palumbo commented on MAHOUT-1699: MAHOUT-1705 and MAHOUT-1706 are more optimization and not critical. will resolve this. Trim down Mahout packaging for next release --- Key: MAHOUT-1699 URL: https://issues.apache.org/jira/browse/MAHOUT-1699 Project: Mahout Issue Type: Improvement Components: build Affects Versions: 0.10.0 Reporter: Suneel Marthi Priority: Critical Fix For: 0.10.1 Mahout 0.10.0 package size is 210MB, this needs to be trimmed down to a more manageable size. This also makes it hard to package Mahout into the BigTop distro and not to mention seeking an infra waiver at the time of release for the 200MB size. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (MAHOUT-1699) Trim down Mahout packaging for next release
[ https://issues.apache.org/jira/browse/MAHOUT-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Palumbo resolved MAHOUT-1699. Resolution: Fixed Assignee: Andrew Palumbo Trim down Mahout packaging for next release --- Key: MAHOUT-1699 URL: https://issues.apache.org/jira/browse/MAHOUT-1699 Project: Mahout Issue Type: Improvement Components: build Affects Versions: 0.10.0 Reporter: Suneel Marthi Assignee: Andrew Palumbo Priority: Critical Fix For: 0.10.1 Mahout 0.10.0 package size is 210MB, this needs to be trimmed down to a more manageable size. This also makes it hard to package Mahout into the BigTop distro and not to mention seeking an infra waiver at the time of release for the 200MB size. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (MAHOUT-1680) Rename project release artifact names to be prefixed with 'apache-mahout-*
[ https://issues.apache.org/jira/browse/MAHOUT-1680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stevo Slavic updated MAHOUT-1680: - Fix Version/s: (was: 0.11.0) 0.10.1 Rename project release artifact names to be prefixed with 'apache-mahout-* -- Key: MAHOUT-1680 URL: https://issues.apache.org/jira/browse/MAHOUT-1680 Project: Mahout Issue Type: Improvement Components: build Affects Versions: 0.10.0 Reporter: Suneel Marthi Assignee: Stevo Slavic Fix For: 0.10.1 The artifacts generated now read as 'mahout-distribution-*', need to change the build configuration to rename the artifacts as 'apache-mahout-distribution-*' -- This message was sent by Atlassian JIRA (v6.3.4#6332)