RE: Regression using MapReduce

2015-05-31 Thread Scott Lett, PhD
Ted,
Where is the first place to start learning Mahout-Samsara?  
Looking forward to learning more,
Scott

-Original Message-
From: Ted Dunning [mailto:ted.dunn...@gmail.com] 
Sent: Saturday, May 30, 2015 12:33 AM
To: user@mahout.apache.org
Subject: Re: Regression using MapReduce

Mahout is deprecating pretty much all of the classic MapReduce implementations 
in any case in favor of algorithms based fundamentally on a new linear algebra 
system known as Mahout-Samsara.



On Fri, May 29, 2015 at 10:52 PM, Punit Naik  wrote:

> Hello all users
>
> I just wanted to know if Mahout has MapReduce implementation of 
> Regression algorithms or not.
>
> Thank You
>
> Punit Naik
>



Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Pat Ferrel
+1 (binding)

Verified on Spark 1.3 psuedo-clustered HDFS 2.4

There are some cleanup of example data issues that can wait for next release.


On May 30, 2015, at 8:16 PM, Suneel Marthi  wrote:

Verified locally build and tests for {source} * {zip, tar}. No issues found.

+1 (binding)

On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi  wrote:

> Andrew Palumbo / Dmitriy:  Please also verify the various scenarios as
> described in M-1693
> 
> On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi 
> wrote:
> 
>> Here's the new 0.10.1 Release Candidate
>> 
>> 
>> https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/
>> 
>> The Voting ends on Sunday, May 31 2015.
>> 
>> Need a +1 from the PMC for each of the line items below for the release
>> to pass.
>> 
>> 1. Ted/Grant:  Verify hashes and checksums - {binary,source} x {zip,tar}
>> + pom
>> 
>> 2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}
>> 
>> 3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}
>> 
>> 4. Suneel: Verify build and tests - {source} * {zip, tar}
>> 
>> 5. Pat:  Verify examples locally - {source} * {zip, tar}
>> 
>> The LICENSE and NOTICE files have not been updated this time and will be
>> addressed in future releases.
>> 
>> 
>> 
>> On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi 
>> wrote:
>> 
>>> Please hold ur votes, will be refreshing staging with another build in
>>> the next hour
>>> 
>>> On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman <
>>> andrew.mussel...@gmail.com> wrote:
>>> 
 Likewise source zip and tarballs build and pass tests.
 
 On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi 
 wrote:
 
> Verified {source} * {zip, tar} and all tests pass.
> 
> +1 (binding)
> 
> On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi 
 wrote:
> 
>> This is a call for VOTE to pass Mahout 0.10.1 release candidate
 that's
>> available at
>> 
>> 
>> 
> 
 https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/
>> 
>> Need atleast 3 PMC +1 (binding) votes to cut the release
>> 
>> Below are the tasks breakdown for the PMC and committers:
>> 
>> Andy Palumbo & Pat Ferrel: verify the binary artifacts and run tests
>> 
>> Suneel & AKM:  verify the src artifacts
>> 
>> Ted/Grant/Drew: verify the hashes and Sigs
>> 
>> The LICENSE.txt and NOTICE.txt still need to be updated and will
 not be
>> addressed as part of 0.10.1 release.
>> 
>> 
>> 
> 
 
>>> 
>>> 
>> 
> 



Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Andrew Palumbo

+1 (binding)

Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the 
|.tar.gz| binary archive in pseudo-cluster mode and one with 
MAHOUT_LOCAL=true with only the previously noted minor data issue, which 
I agree can wait for the next release.


Ran a mix and match of the |.zip| binary archive examples with 
MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.


Tested the shell from both archives for qr and matrix display fixes.


On 05/31/2015 12:09 PM, Pat Ferrel wrote:

+1 (binding)

Verified on Spark 1.3 psuedo-clustered HDFS 2.4

There are some cleanup of example data issues that can wait for next release.


On May 30, 2015, at 8:16 PM, Suneel Marthi  wrote:

Verified locally build and tests for {source} * {zip, tar}. No issues found.

+1 (binding)

On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi  wrote:


Andrew Palumbo / Dmitriy:  Please also verify the various scenarios as
described in M-1693

On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi 
wrote:


Here's the new 0.10.1 Release Candidate


https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/

The Voting ends on Sunday, May 31 2015.

Need a +1 from the PMC for each of the line items below for the release
to pass.

1. Ted/Grant:  Verify hashes and checksums - {binary,source} x {zip,tar}
+ pom

2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}

3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}

4. Suneel: Verify build and tests - {source} * {zip, tar}

5. Pat:  Verify examples locally - {source} * {zip, tar}

The LICENSE and NOTICE files have not been updated this time and will be
addressed in future releases.



On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi 
wrote:


Please hold ur votes, will be refreshing staging with another build in
the next hour

On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman <
andrew.mussel...@gmail.com> wrote:


Likewise source zip and tarballs build and pass tests.

On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi 
wrote:


Verified {source} * {zip, tar} and all tests pass.

+1 (binding)

On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi 

wrote:

This is a call for VOTE to pass Mahout 0.10.1 release candidate

that's

available at




https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/

Need atleast 3 PMC +1 (binding) votes to cut the release

Below are the tasks breakdown for the PMC and committers:

Andy Palumbo & Pat Ferrel: verify the binary artifacts and run tests

Suneel & AKM:  verify the src artifacts

Ted/Grant/Drew: verify the hashes and Sigs

The LICENSE.txt and NOTICE.txt still need to be updated and will

not be

addressed as part of 0.10.1 release.









Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Andrew Musselman
+1 (binding)

Verified tests pass for src tarball and zip; I'm comfortable skipping EMR
smoke testing for a point release given team opinion that it's not required.

On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo  wrote:

> +1 (binding)
>
> Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the
> |.tar.gz| binary archive in pseudo-cluster mode and one with
> MAHOUT_LOCAL=true with only the previously noted minor data issue, which I
> agree can wait for the next release.
>
> Ran a mix and match of the |.zip| binary archive examples with
> MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.
>
> Tested the shell from both archives for qr and matrix display fixes.
>
>
>
> On 05/31/2015 12:09 PM, Pat Ferrel wrote:
>
>> +1 (binding)
>>
>> Verified on Spark 1.3 psuedo-clustered HDFS 2.4
>>
>> There are some cleanup of example data issues that can wait for next
>> release.
>>
>>
>> On May 30, 2015, at 8:16 PM, Suneel Marthi  wrote:
>>
>> Verified locally build and tests for {source} * {zip, tar}. No issues
>> found.
>>
>> +1 (binding)
>>
>> On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi 
>> wrote:
>>
>>  Andrew Palumbo / Dmitriy:  Please also verify the various scenarios as
>>> described in M-1693
>>>
>>> On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi 
>>> wrote:
>>>
>>>  Here's the new 0.10.1 Release Candidate



 https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/

 The Voting ends on Sunday, May 31 2015.

 Need a +1 from the PMC for each of the line items below for the release
 to pass.

 1. Ted/Grant:  Verify hashes and checksums - {binary,source} x {zip,tar}
 + pom

 2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}

 3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}

 4. Suneel: Verify build and tests - {source} * {zip, tar}

 5. Pat:  Verify examples locally - {source} * {zip, tar}

 The LICENSE and NOTICE files have not been updated this time and will be
 addressed in future releases.



 On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi >>> >
 wrote:

  Please hold ur votes, will be refreshing staging with another build in
> the next hour
>
> On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman <
> andrew.mussel...@gmail.com> wrote:
>
>  Likewise source zip and tarballs build and pass tests.
>>
>> On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi 
>> wrote:
>>
>>  Verified {source} * {zip, tar} and all tests pass.
>>>
>>> +1 (binding)
>>>
>>> On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi 
>>>
>> wrote:
>>
>>> This is a call for VOTE to pass Mahout 0.10.1 release candidate

>>> that's
>>
>>> available at




>> https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/
>>
>>> Need atleast 3 PMC +1 (binding) votes to cut the release

 Below are the tasks breakdown for the PMC and committers:

 Andy Palumbo & Pat Ferrel: verify the binary artifacts and run tests

 Suneel & AKM:  verify the src artifacts

 Ted/Grant/Drew: verify the hashes and Sigs

 The LICENSE.txt and NOTICE.txt still need to be updated and will

>>> not be
>>
>>> addressed as part of 0.10.1 release.




>
>


Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Stevo Slavić
+1 (binding)

Verified hashes and signatures; distribution sources tarball and zip unpack
well, build passes from unpacked sources.

On Sun, May 31, 2015 at 8:34 PM, Andrew Musselman <
andrew.mussel...@gmail.com> wrote:

> +1 (binding)
>
> Verified tests pass for src tarball and zip; I'm comfortable skipping EMR
> smoke testing for a point release given team opinion that it's not
> required.
>
> On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo 
> wrote:
>
> > +1 (binding)
> >
> > Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in the
> > |.tar.gz| binary archive in pseudo-cluster mode and one with
> > MAHOUT_LOCAL=true with only the previously noted minor data issue, which
> I
> > agree can wait for the next release.
> >
> > Ran a mix and match of the |.zip| binary archive examples with
> > MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.
> >
> > Tested the shell from both archives for qr and matrix display fixes.
> >
> >
> >
> > On 05/31/2015 12:09 PM, Pat Ferrel wrote:
> >
> >> +1 (binding)
> >>
> >> Verified on Spark 1.3 psuedo-clustered HDFS 2.4
> >>
> >> There are some cleanup of example data issues that can wait for next
> >> release.
> >>
> >>
> >> On May 30, 2015, at 8:16 PM, Suneel Marthi  wrote:
> >>
> >> Verified locally build and tests for {source} * {zip, tar}. No issues
> >> found.
> >>
> >> +1 (binding)
> >>
> >> On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi 
> >> wrote:
> >>
> >>  Andrew Palumbo / Dmitriy:  Please also verify the various scenarios as
> >>> described in M-1693
> >>>
> >>> On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi 
> >>> wrote:
> >>>
> >>>  Here's the new 0.10.1 Release Candidate
> 
> 
> 
> 
> https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/
> 
>  The Voting ends on Sunday, May 31 2015.
> 
>  Need a +1 from the PMC for each of the line items below for the
> release
>  to pass.
> 
>  1. Ted/Grant:  Verify hashes and checksums - {binary,source} x
> {zip,tar}
>  + pom
> 
>  2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}
> 
>  3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}
> 
>  4. Suneel: Verify build and tests - {source} * {zip, tar}
> 
>  5. Pat:  Verify examples locally - {source} * {zip, tar}
> 
>  The LICENSE and NOTICE files have not been updated this time and will
> be
>  addressed in future releases.
> 
> 
> 
>  On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi <
> suneel.mar...@gmail.com
>  >
>  wrote:
> 
>   Please hold ur votes, will be refreshing staging with another build
> in
> > the next hour
> >
> > On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman <
> > andrew.mussel...@gmail.com> wrote:
> >
> >  Likewise source zip and tarballs build and pass tests.
> >>
> >> On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi 
> >> wrote:
> >>
> >>  Verified {source} * {zip, tar} and all tests pass.
> >>>
> >>> +1 (binding)
> >>>
> >>> On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi  >
> >>>
> >> wrote:
> >>
> >>> This is a call for VOTE to pass Mahout 0.10.1 release candidate
> 
> >>> that's
> >>
> >>> available at
> 
> 
> 
> 
> >>
> https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/
> >>
> >>> Need atleast 3 PMC +1 (binding) votes to cut the release
> 
>  Below are the tasks breakdown for the PMC and committers:
> 
>  Andy Palumbo & Pat Ferrel: verify the binary artifacts and run
> tests
> 
>  Suneel & AKM:  verify the src artifacts
> 
>  Ted/Grant/Drew: verify the hashes and Sigs
> 
>  The LICENSE.txt and NOTICE.txt still need to be updated and will
> 
> >>> not be
> >>
> >>> addressed as part of 0.10.1 release.
> 
> 
> 
> 
> >
> >
>


Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Suneel Marthi
Thanks Stevo.

This release has passed with 5 +1 PMC binding votes and the Voting is
officially closed, will send the Release Announce later today.



On Sun, May 31, 2015 at 3:35 PM, Stevo Slavić  wrote:

> +1 (binding)
>
> Verified hashes and signatures; distribution sources tarball and zip unpack
> well, build passes from unpacked sources.
>
> On Sun, May 31, 2015 at 8:34 PM, Andrew Musselman <
> andrew.mussel...@gmail.com> wrote:
>
> > +1 (binding)
> >
> > Verified tests pass for src tarball and zip; I'm comfortable skipping EMR
> > smoke testing for a point release given team opinion that it's not
> > required.
> >
> > On Sun, May 31, 2015 at 9:43 AM, Andrew Palumbo 
> > wrote:
> >
> > > +1 (binding)
> > >
> > > Ran (on Hadoop 2.4.1 + spark 1.2.1) all examples with all options in
> the
> > > |.tar.gz| binary archive in pseudo-cluster mode and one with
> > > MAHOUT_LOCAL=true with only the previously noted minor data issue,
> which
> > I
> > > agree can wait for the next release.
> > >
> > > Ran a mix and match of the |.zip| binary archive examples with
> > > MAHOUT_LOCAL=true and in pseudo-cluster mode without issue.
> > >
> > > Tested the shell from both archives for qr and matrix display fixes.
> > >
> > >
> > >
> > > On 05/31/2015 12:09 PM, Pat Ferrel wrote:
> > >
> > >> +1 (binding)
> > >>
> > >> Verified on Spark 1.3 psuedo-clustered HDFS 2.4
> > >>
> > >> There are some cleanup of example data issues that can wait for next
> > >> release.
> > >>
> > >>
> > >> On May 30, 2015, at 8:16 PM, Suneel Marthi 
> wrote:
> > >>
> > >> Verified locally build and tests for {source} * {zip, tar}. No issues
> > >> found.
> > >>
> > >> +1 (binding)
> > >>
> > >> On Sat, May 30, 2015 at 11:14 PM, Suneel Marthi 
> > >> wrote:
> > >>
> > >>  Andrew Palumbo / Dmitriy:  Please also verify the various scenarios
> as
> > >>> described in M-1693
> > >>>
> > >>> On Sat, May 30, 2015 at 10:32 PM, Suneel Marthi 
> > >>> wrote:
> > >>>
> > >>>  Here's the new 0.10.1 Release Candidate
> > 
> > 
> > 
> > 
> >
> https://repository.apache.org/content/repositories/orgapachemahout-1009/org/apache/mahout/apache-mahout-distribution/0.10.1/
> > 
> >  The Voting ends on Sunday, May 31 2015.
> > 
> >  Need a +1 from the PMC for each of the line items below for the
> > release
> >  to pass.
> > 
> >  1. Ted/Grant:  Verify hashes and checksums - {binary,source} x
> > {zip,tar}
> >  + pom
> > 
> >  2. AKM:  Verify examples on EMR  - {binary, source} * {zip, tar}
> > 
> >  3. Andrew Palumbo: Verify examples locally - {binary} * {zip, tar}
> > 
> >  4. Suneel: Verify build and tests - {source} * {zip, tar}
> > 
> >  5. Pat:  Verify examples locally - {source} * {zip, tar}
> > 
> >  The LICENSE and NOTICE files have not been updated this time and
> will
> > be
> >  addressed in future releases.
> > 
> > 
> > 
> >  On Sat, May 30, 2015 at 8:32 PM, Suneel Marthi <
> > suneel.mar...@gmail.com
> >  >
> >  wrote:
> > 
> >   Please hold ur votes, will be refreshing staging with another build
> > in
> > > the next hour
> > >
> > > On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman <
> > > andrew.mussel...@gmail.com> wrote:
> > >
> > >  Likewise source zip and tarballs build and pass tests.
> > >>
> > >> On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi <
> smar...@apache.org>
> > >> wrote:
> > >>
> > >>  Verified {source} * {zip, tar} and all tests pass.
> > >>>
> > >>> +1 (binding)
> > >>>
> > >>> On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi <
> smar...@apache.org
> > >
> > >>>
> > >> wrote:
> > >>
> > >>> This is a call for VOTE to pass Mahout 0.10.1 release candidate
> > 
> > >>> that's
> > >>
> > >>> available at
> > 
> > 
> > 
> > 
> > >>
> >
> https://repository.apache.org/content/repositories/orgapachemahout-1008/org/apache/mahout/mahout-distribution/0.10.1/
> > >>
> > >>> Need atleast 3 PMC +1 (binding) votes to cut the release
> > 
> >  Below are the tasks breakdown for the PMC and committers:
> > 
> >  Andy Palumbo & Pat Ferrel: verify the binary artifacts and run
> > tests
> > 
> >  Suneel & AKM:  verify the src artifacts
> > 
> >  Ted/Grant/Drew: verify the hashes and Sigs
> > 
> >  The LICENSE.txt and NOTICE.txt still need to be updated and will
> > 
> > >>> not be
> > >>
> > >>> addressed as part of 0.10.1 release.
> > 
> > 
> > 
> > 
> > >
> > >
> >
>


[ANNOUNCE] Apache Mahout 0.10.1 Released

2015-05-31 Thread Suneel Marthi
The Apache Mahout PMC is pleased to announce the release of Mahout 0.10.1.
Mahout's goal is to create an environment for quickly creating machine
learning applications that scale and run on the highest performance
parallel computation engines available. Mahout comprises an interactive
environment and library that supports generalized scalable linear algebra
and includes many modern machine learning algorithms.

The Mahout Math environment we call “Samsara” for its symbol of universal
renewal. It reflects a fundamental rethinking of how scalable machine
learning algorithms are built and customized. Mahout-Samsara is here to
help people create their own math while providing some off-the-shelf
algorithm implementations. At its base are general linear algebra and
statistical operations along with the data structures to support them. It’s
written in Scala with Mahout-specific extensions, and runs most fully on
Spark.

To get started with Apache Mahout 0.10.1, download the release artifacts
and signatures from http://www.apache.org/dist/mahout/0.10.1/.


Many thanks to the contributors and committers who were part of this
release. Please see below for the Release Highlights.


RELEASE HIGHLIGHTS

This is an incremental minor release over Mahout 0.10.0 meant to fix a few
bugs and upgrade to Spark 1.2.2 or less.

Mahout 0.10.1

   1.

   This release fixes a major memory usage bug in co-occurrence analysis
   used by the driver spark-itemsimilarity MAHOUT-1707. This will now
   require far less memory in the executor.
   2.

   Support Spark 1.2.2 or less - due to a bug in Spark 1.2+ in the
   JavaSerializer (SPARK-6069) we removed the use of Guava from any code
   executed in Spark Executors. To do this we created a Scala Collections
   based BiMap so any example code showing how to use the old Guava
   collections is obsolete.
   3.

   Some minor fixes to Mahout-Samsara QR Decomposition and matrix ops.
   4.

   Trim down packages size to < 200MB - MAHOUT-1704.
   5.

   Minor testing indicates binary compatibility with Spark 1.3 except for
   the Mahout Shell, which does not run.


STATS

A total of 9 separate JIRA issues are addressed in this release [2] with 5
bugfixes.


Scope of Mahout 0.10.2 ~ targeted for June 28, 2015

   1.

   In-core transpose view rewrites. Modifiable transpose views (for (col <-
   a.t) col := 5).
   2.

   Matrix structure flavor additions. (understand general matrix structure
   and stride direction).
   3.

   %*% optimization based on matrix flavors.
   4.

   In-core ::= sparse assignment functions.
   5.

   Assign := optimization (do proper traversal based on matrix flavors,
   similarly to %*%).
   6.

   Adding in-place elementwise functional assignment (e.g. mxA := exp _,
   mxA ::= exp _).
   7.

   Distributed and in-core version of simple elementwise analogues of
   scala.math._. for example, for log(x) the convention is dlog(drm),
   mlog(mx), vlog(vec). Unfortunately we cannot overload these functions over
   what is done in scala.math, i.e. scala would not allow log(mx) or log(drm)
   and log(Double) at the same time, mainly because they are being defined in
   different packages.
   8.

   Distributed performance bug fixes. This relates mostly to (a) matrix
   multiplication deficiencies, and (b) handling parallelism.
   9.

   Distributed allreduceBlock predicate.
   10.

   Distributed optimizer operators for elementwise functions. Rewrites
   recognizing e.g. 1+ drmX * dexp(drmX) as a single fused elementwise
   physical operator.
   11.

   More cbind, rbind flavors (e.g. 1 cbind mxX, 1 cbind drmX or the other
   way around).



Mahout 0.11.0-snapshot (ongoing, but available)

   1.

   Support for Spark 1.3 sequence file write.
   2.

   Spark Shell (timing TBD).
   3.

   First release that would see integration of Apache Mahout with Apache
   Flink as a backend.


GETTING STARTED

Download the release artifacts and signatures at
http://www.apache.org/dist/mahout/0.10.1/
The examples directory contains several working examples of the core
functionality available in Mahout. These can be run via scripts in the
examples/bin directory. Most examples do not need a Hadoop cluster in order
to run.

FUTURE PLANS

We will continue bug fixes and enhancements on the 0.10.x branch, which
will remain dependent on Spark 1.2.x. Support for Spark 1.3 will be in the
master branch reflecting Mahout-0.11.0-SNAPSHOT. To see progress on this
branch look here: https://github.com/apache/mahout/commits/master.  As of
this writing it is not ready yet to build for Spark 1.3.

Integration with Apache Flink is in the works in collaboration with TU
Berlin and Data Artisans to add Flink as the 3rd execution engine to
Mahout. This would be in addition to existing Apache Spark and H2O engines.

CONTRIBUTING


If you are interested in contributing, please see our How to Contribute
[3] page or
contact us via email at d...@mahout.apac