Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Suneel Marthi
Thanks Stevo.

This release has passed with 5 +1 PMC binding votes and the Voting is
officially closed, will send the Release Announce later today.



On Sun, May 31, 2015 at 3:35 PM, Stevo Slavić ssla...@gmail.com wrote:

 +1 (binding)

 Verified hashes and signatures; distribution sources tarball and zip unpack
 well, build passes from unpacked sources.

 On Sun, May 31, 2015 at 8:34 PM, Andrew Musselman 
 andrew.mussel...@gmail.com wrote:

  +1 (binding)
 
  Verified tests pass for src tarball and zip; I'm comfortable skipping EMR
  smoke testing for a point release given team opinion that it's not
  required.
 
  On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo ap@outlook.com
  wrote:
 
   +1 (binding)
  
   Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in
 the
   |.tar.gz| binary archive in pseudo-cluster mode and one with
   MAHOUT_LOCAL=true with only the previously noted minor data issue,
 which
  I
   agree can wait for the next release.
  
   Ran a mix and match of the |.zip| binary archive examples with
   MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.
  
   Tested the shell from both archives for qr and matrix display fixes.
  
  
  
   On 05/31/2015 12:09 PM, Pat Ferrel wrote:
  
   +1 (binding)
  
   Verified on Spark 1.3 psuedo-clustered HDFS 2.4
  
   There are some cleanup of example data issues that can wait for next
   release.
  
  
   On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org
 wrote:
  
   Verified locally build and tests for {source} * {zip, tar}. No issues
   found.
  
   +1 (binding)
  
   On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org
   wrote:
  
Andrew Palumbo / Dmitriy:  Please also verify the various scenarios
 as
   described in M-1693
  
   On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org
   wrote:
  
Here's the new 0.10.1 Release Candidate
  
  
  
  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/
  
   The Voting ends on Sunday, May 31 2015.
  
   Need a +1 from the PMC for each of the line items below for the
  release
   to pass.
  
   1. Ted/Grant:  Verify hashes and checksums - {binary,source} x
  {zip,tar}
   + pom
  
   2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}
  
   3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}
  
   4. Suneel: Verify build and tests - {source} * {zip, tar}
  
   5. Pat:  Verify examples locally - {source} * {zip, tar}
  
   The LICENSE and NOTICE files have not been updated this time and
 will
  be
   addressed in future releases.
  
  
  
   On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi 
  suneel.mar...@gmail.com
   
   wrote:
  
Please hold ur votes, will be refreshing staging with another build
  in
   the next hour
  
   On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman 
   andrew.mussel...@gmail.com wrote:
  
Likewise source zip and tarballs build and pass tests.
  
   On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi 
 smar...@apache.org
   wrote:
  
Verified {source} * {zip, tar} and all tests pass.
  
   +1 (binding)
  
   On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi 
 smar...@apache.org
  
  
   wrote:
  
   This is a call for VOTE to pass Mahout 0.10.1 release candidate
  
   that's
  
   available at
  
  
  
  
  
 
 https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/
  
   Need atleast 3 PMC +1 (binding) votes to cut the release
  
   Below are the tasks breakdown for the PMC and committers:
  
   Andy Palumbo  Pat Ferrel: verify the binary artifacts and run
  tests
  
   Suneel  AKM:  verify the src artifacts
  
   Ted/Grant/Drew: verify the hashes and Sigs
  
   The LICENSE.txt and NOTICE.txt still need to be updated and will
  
   not be
  
   addressed as part of 0.10.1 release.
  
  
  
  
  
  
 



Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Andrew Palumbo

+1 (binding)

Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the 
|.tar.gz| binary archive in pseudo-cluster mode and one with 
MAHOUT_LOCAL=true with only the previously noted minor data issue, which 
I agree can wait for the next release.


Ran a mix and match of the |.zip| binary archive examples with 
MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.


Tested the shell from both archives for qr and matrix display fixes.


On 05/31/2015 12:09 PM, Pat Ferrel wrote:

+1 (binding)

Verified on Spark 1.3 psuedo-clustered HDFS 2.4

There are some cleanup of example data issues that can wait for next release.


On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote:

Verified locally build and tests for {source} * {zip, tar}. No issues found.

+1 (binding)

On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org wrote:


Andrew Palumbo / Dmitriy:  Please also verify the various scenarios as
described in M-1693

On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org
wrote:


Here's the new 0.10.1 Release Candidate


https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/

The Voting ends on Sunday, May 31 2015.

Need a +1 from the PMC for each of the line items below for the release
to pass.

1. Ted/Grant:  Verify hashes and checksums - {binary,source} x {zip,tar}
+ pom

2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}

3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}

4. Suneel: Verify build and tests - {source} * {zip, tar}

5. Pat:  Verify examples locally - {source} * {zip, tar}

The LICENSE and NOTICE files have not been updated this time and will be
addressed in future releases.



On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi suneel.mar...@gmail.com
wrote:


Please hold ur votes, will be refreshing staging with another build in
the next hour

On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman 
andrew.mussel...@gmail.com wrote:


Likewise source zip and tarballs build and pass tests.

On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org
wrote:


Verified {source} * {zip, tar} and all tests pass.

+1 (binding)

On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org

wrote:

This is a call for VOTE to pass Mahout 0.10.1 release candidate

that's

available at




https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/

Need atleast 3 PMC +1 (binding) votes to cut the release

Below are the tasks breakdown for the PMC and committers:

Andy Palumbo  Pat Ferrel: verify the binary artifacts and run tests

Suneel  AKM:  verify the src artifacts

Ted/Grant/Drew: verify the hashes and Sigs

The LICENSE.txt and NOTICE.txt still need to be updated and will

not be

addressed as part of 0.10.1 release.









Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Stevo Slavić
+1 (binding)

Verified hashes and signatures; distribution sources tarball and zip unpack
well, build passes from unpacked sources.

On Sun, May 31, 2015 at 8:34 PM, Andrew Musselman 
andrew.mussel...@gmail.com wrote:

 +1 (binding)

 Verified tests pass for src tarball and zip; I'm comfortable skipping EMR
 smoke testing for a point release given team opinion that it's not
 required.

 On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo ap@outlook.com
 wrote:

  +1 (binding)
 
  Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the
  |.tar.gz| binary archive in pseudo-cluster mode and one with
  MAHOUT_LOCAL=true with only the previously noted minor data issue, which
 I
  agree can wait for the next release.
 
  Ran a mix and match of the |.zip| binary archive examples with
  MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.
 
  Tested the shell from both archives for qr and matrix display fixes.
 
 
 
  On 05/31/2015 12:09 PM, Pat Ferrel wrote:
 
  +1 (binding)
 
  Verified on Spark 1.3 psuedo-clustered HDFS 2.4
 
  There are some cleanup of example data issues that can wait for next
  release.
 
 
  On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote:
 
  Verified locally build and tests for {source} * {zip, tar}. No issues
  found.
 
  +1 (binding)
 
  On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org
  wrote:
 
   Andrew Palumbo / Dmitriy:  Please also verify the various scenarios as
  described in M-1693
 
  On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org
  wrote:
 
   Here's the new 0.10.1 Release Candidate
 
 
 
 
 https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/
 
  The Voting ends on Sunday, May 31 2015.
 
  Need a +1 from the PMC for each of the line items below for the
 release
  to pass.
 
  1. Ted/Grant:  Verify hashes and checksums - {binary,source} x
 {zip,tar}
  + pom
 
  2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}
 
  3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}
 
  4. Suneel: Verify build and tests - {source} * {zip, tar}
 
  5. Pat:  Verify examples locally - {source} * {zip, tar}
 
  The LICENSE and NOTICE files have not been updated this time and will
 be
  addressed in future releases.
 
 
 
  On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi 
 suneel.mar...@gmail.com
  
  wrote:
 
   Please hold ur votes, will be refreshing staging with another build
 in
  the next hour
 
  On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman 
  andrew.mussel...@gmail.com wrote:
 
   Likewise source zip and tarballs build and pass tests.
 
  On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org
  wrote:
 
   Verified {source} * {zip, tar} and all tests pass.
 
  +1 (binding)
 
  On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org
 
 
  wrote:
 
  This is a call for VOTE to pass Mahout 0.10.1 release candidate
 
  that's
 
  available at
 
 
 
 
 
 https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/
 
  Need atleast 3 PMC +1 (binding) votes to cut the release
 
  Below are the tasks breakdown for the PMC and committers:
 
  Andy Palumbo  Pat Ferrel: verify the binary artifacts and run
 tests
 
  Suneel  AKM:  verify the src artifacts
 
  Ted/Grant/Drew: verify the hashes and Sigs
 
  The LICENSE.txt and NOTICE.txt still need to be updated and will
 
  not be
 
  addressed as part of 0.10.1 release.
 
 
 
 
 
 



[ANNOUNCE] Apache Mahout 0.10.1 Released

2015-05-31 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.10.1.
Mahout's goal is to create an environment for quickly creating machine
learning applications that scale and run on the highest performance
parallel computation engines available. Mahout comprises an interactive
environment and library that supports generalized scalable linear algebra
and includes many modern machine learning algorithms.

The Mahout Math environment we call “Samsara” for its symbol of universal
renewal. It reflects a fundamental rethinking of how scalable machine
learning algorithms are built and customized. Mahout-Samsara is here to
help people create their own math while providing some off-the-shelf
algorithm implementations. At its base are general linear algebra and
statistical operations along with the data structures to support them. It’s
written in Scala with Mahout-specific extensions, and runs most fully on
Spark.

To get started with Apache Mahout 0.10.1, download the release artifacts
and signatures from http://www.apache.org/dist/mahout/0.10.1/.


Many thanks to the contributors and committers who were part of this
release. Please see below for the Release Highlights.


RELEASE HIGHLIGHTS

This is an incremental minor release over Mahout 0.10.0 meant to fix a few
bugs and upgrade to Spark 1.2.2 or less.

Mahout 0.10.1

   1.

   This release fixes a major memory usage bug in co-occurrence analysis
   used by the driver spark-itemsimilarity MAHOUT-1707. This will now
   require far less memory in the executor.
   2.

   Support Spark 1.2.2 or less - due to a bug in Spark 1.2+ in the
   JavaSerializer (SPARK-6069) we removed the use of Guava from any code
   executed in Spark Executors. To do this we created a Scala Collections
   based BiMap so any example code showing how to use the old Guava
   collections is obsolete.
   3.

   Some minor fixes to Mahout-Samsara QR Decomposition and matrix ops.
   4.

   Trim down packages size to  200MB - MAHOUT-1704.
   5.

   Minor testing indicates binary compatibility with Spark 1.3 except for
   the Mahout Shell, which does not run.


STATS

A total of 9 separate JIRA issues are addressed in this release [2] with 5
bugfixes.


Scope of Mahout 0.10.2 ~ targeted for June 28, 2015

   1.

   In-core transpose view rewrites. Modifiable transpose views (for (col -
   a.t) col := 5).
   2.

   Matrix structure flavor additions. (understand general matrix structure
   and stride direction).
   3.

   %*% optimization based on matrix flavors.
   4.

   In-core ::= sparse assignment functions.
   5.

   Assign := optimization (do proper traversal based on matrix flavors,
   similarly to %*%).
   6.

   Adding in-place elementwise functional assignment (e.g. mxA := exp _,
   mxA ::= exp _).
   7.

   Distributed and in-core version of simple elementwise analogues of
   scala.math._. for example, for log(x) the convention is dlog(drm),
   mlog(mx), vlog(vec). Unfortunately we cannot overload these functions over
   what is done in scala.math, i.e. scala would not allow log(mx) or log(drm)
   and log(Double) at the same time, mainly because they are being defined in
   different packages.
   8.

   Distributed performance bug fixes. This relates mostly to (a) matrix
   multiplication deficiencies, and (b) handling parallelism.
   9.

   Distributed allreduceBlock predicate.
   10.

   Distributed optimizer operators for elementwise functions. Rewrites
   recognizing e.g. 1+ drmX * dexp(drmX) as a single fused elementwise
   physical operator.
   11.

   More cbind, rbind flavors (e.g. 1 cbind mxX, 1 cbind drmX or the other
   way around).



Mahout 0.11.0-snapshot (ongoing, but available)

   1.

   Support for Spark 1.3 sequence file write.
   2.

   Spark Shell (timing TBD).
   3.

   First release that would see integration of Apache Mahout with Apache
   Flink as a backend.


GETTING STARTED

Download the release artifacts and signatures at
http://www.apache.org/dist/mahout/0.10.1/
The examples directory contains several working examples of the core
functionality available in Mahout. These can be run via scripts in the
examples/bin directory. Most examples do not need a Hadoop cluster in order
to run.

FUTURE PLANS

We will continue bug fixes and enhancements on the 0.10.x branch, which
will remain dependent on Spark 1.2.x. Support for Spark 1.3 will be in the
master branch reflecting Mahout-0.11.0-SNAPSHOT. To see progress on this
branch look here: https://github.com/apache/mahout/commits/master.  As of
this writing it is not ready yet to build for Spark 1.3.

Integration with Apache Flink is in the works in collaboration with TU
Berlin and Data Artisans to add Flink as the 3rd execution engine to
Mahout. This would be in addition to existing Apache Spark and H2O engines.

CONTRIBUTING


If you are interested in contributing, please see our How to Contribute
http://mahout.apache.org/developers/how-to-contribute.html[3] page or
contact us via email at 

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Andrew Musselman
+1 (binding)

Verified tests pass for src tarball and zip; I'm comfortable skipping EMR
smoke testing for a point release given team opinion that it's not required.

On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo ap@outlook.com wrote:

 +1 (binding)

 Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the
 |.tar.gz| binary archive in pseudo-cluster mode and one with
 MAHOUT_LOCAL=true with only the previously noted minor data issue, which I
 agree can wait for the next release.

 Ran a mix and match of the |.zip| binary archive examples with
 MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.

 Tested the shell from both archives for qr and matrix display fixes.



 On 05/31/2015 12:09 PM, Pat Ferrel wrote:

 +1 (binding)

 Verified on Spark 1.3 psuedo-clustered HDFS 2.4

 There are some cleanup of example data issues that can wait for next
 release.


 On May 30, 2015, at 8:16 PM, Suneel Marthi smar...@apache.org wrote:

 Verified locally build and tests for {source} * {zip, tar}. No issues
 found.

 +1 (binding)

 On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi smar...@apache.org
 wrote:

  Andrew Palumbo / Dmitriy:  Please also verify the various scenarios as
 described in M-1693

 On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi smar...@apache.org
 wrote:

  Here's the new 0.10.1 Release Candidate



 https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/

 The Voting ends on Sunday, May 31 2015.

 Need a +1 from the PMC for each of the line items below for the release
 to pass.

 1. Ted/Grant:  Verify hashes and checksums - {binary,source} x {zip,tar}
 + pom

 2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}

 3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}

 4. Suneel: Verify build and tests - {source} * {zip, tar}

 5. Pat:  Verify examples locally - {source} * {zip, tar}

 The LICENSE and NOTICE files have not been updated this time and will be
 addressed in future releases.



 On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi suneel.mar...@gmail.com
 
 wrote:

  Please hold ur votes, will be refreshing staging with another build in
 the next hour

 On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman 
 andrew.mussel...@gmail.com wrote:

  Likewise source zip and tarballs build and pass tests.

 On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org
 wrote:

  Verified {source} * {zip, tar} and all tests pass.

 +1 (binding)

 On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org

 wrote:

 This is a call for VOTE to pass Mahout 0.10.1 release candidate

 that's

 available at




 https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/

 Need atleast 3 PMC +1 (binding) votes to cut the release

 Below are the tasks breakdown for the PMC and committers:

 Andy Palumbo  Pat Ferrel: verify the binary artifacts and run tests

 Suneel  AKM:  verify the src artifacts

 Ted/Grant/Drew: verify the hashes and Sigs

 The LICENSE.txt and NOTICE.txt still need to be updated and will

 not be

 addressed as part of 0.10.1 release.








[jira] [Commented] (MAHOUT-1699) Trim down Mahout packaging for next release

2015-05-31 Thread Andrew Palumbo (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14566532#comment-14566532
 ] 

Andrew Palumbo commented on MAHOUT-1699:


MAHOUT-1705 and MAHOUT-1706 are more optimization and not critical.  will 
resolve this. 

 Trim down Mahout packaging for next release
 ---

 Key: MAHOUT-1699
 URL: https://issues.apache.org/jira/browse/MAHOUT-1699
 Project: Mahout
  Issue Type: Improvement
  Components: build
Affects Versions: 0.10.0
Reporter: Suneel Marthi
Priority: Critical
 Fix For: 0.10.1


 Mahout 0.10.0 package size is 210MB, this needs to be trimmed down to a more 
 manageable size.
 This also makes it hard to package Mahout into the BigTop distro and not to 
 mention seeking an infra waiver at the time of release for the  200MB size.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (MAHOUT-1699) Trim down Mahout packaging for next release

2015-05-31 Thread Andrew Palumbo (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Palumbo resolved MAHOUT-1699.

Resolution: Fixed
  Assignee: Andrew Palumbo

 Trim down Mahout packaging for next release
 ---

 Key: MAHOUT-1699
 URL: https://issues.apache.org/jira/browse/MAHOUT-1699
 Project: Mahout
  Issue Type: Improvement
  Components: build
Affects Versions: 0.10.0
Reporter: Suneel Marthi
Assignee: Andrew Palumbo
Priority: Critical
 Fix For: 0.10.1


 Mahout 0.10.0 package size is 210MB, this needs to be trimmed down to a more 
 manageable size.
 This also makes it hard to package Mahout into the BigTop distro and not to 
 mention seeking an infra waiver at the time of release for the  200MB size.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MAHOUT-1680) Rename project release artifact names to be prefixed with 'apache-mahout-*

2015-05-31 Thread Stevo Slavic (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stevo Slavic updated MAHOUT-1680:
-
Fix Version/s: (was: 0.11.0)
   0.10.1

 Rename project release artifact names to be prefixed with 'apache-mahout-*
 --

 Key: MAHOUT-1680
 URL: https://issues.apache.org/jira/browse/MAHOUT-1680
 Project: Mahout
  Issue Type: Improvement
  Components: build
Affects Versions: 0.10.0
Reporter: Suneel Marthi
Assignee: Stevo Slavic
 Fix For: 0.10.1


 The artifacts generated now read as 'mahout-distribution-*', need to change 
 the build configuration to rename the artifacts as 
 'apache-mahout-distribution-*'



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)