This is great.

So we’ve talked about a name change and shortly we’ll be forced to come up with 
something the describes what Mahout has become. Most past users think of it as 
a scalable ML library on Hadoop. That may describe Mahout-Legacy but it seems 
like we need a name for the Scala DSL/Spark/other? part of the project. Lots of 
projects have sub-projects so we know there is no issue with naming 
sub-projects. So my question to everyone is:

Should (or can) the Top Level Project be renamed? If so to what?

If we don’t rename the TLP then what should we call legacy (not very appealing) 
and scala/DSL (not a name really)

My opinion:
Since we are deemphasizing legacy I’m not sure there is a need to call 
attention to it by giving it a subproject name. However it is not deprecated so 
we need to include it in releases and even fix the minimum of critical bugs for 
some time to come.

Mahout is getting beat up in the circles of those who talk about such things 
and much of this is because people don’t understand what it has become. 
Therefore I’d like to see a project rename to reset expectations. Leave the 
name Mahout for legacy stuff and give a new name to the Scala environment. 
Split the builds and create new docs for the Scala stuff. This would seem to 
make it easier to document since legacy is most of what the CMS documents, we 
could create whole new template for the new project name.

Failing this, many of the same benefits could be gained by creating legacy and 
scala sub-projects with better names. This I know we can do and recall that 
things like MLlib are generally not tied to Spark when speaking about them. So 
a subproject could have very much its own identity.

Looking at the long history of Mahout it seems like the current generality was 
hard gained through implementing many special purpose algorithms, some of which 
were grad student projects. This is where MLlib is today in some ways. So a 
general framework and environment makes a lot of sense as the evolution of 
Mahout. Let’s give it a name, something better than DSL.  

On Mar 5, 2015, at 7:43 PM, Andrew Musselman <andrew.mussel...@gmail.com> wrote:

Thanks AP

On Thursday, March 5, 2015, Andrew Palumbo <ap....@outlook.com> wrote:

> I went through all of the unresolved JIRA issues and marked all with at
> least a "legacy" or "scala". (for lack of a better name for all that is not
> legacy) label. Hopefully I got them all.
> 
> Some are labelled with both (math, build, documentation related to both or
> neither, etc.)
> 
> legacy issues:
> 
> https://issues.apache.org/jira/browse/MAHOUT-1522?jql=
> project%20%3D%20MAHOUT%20AND%20resolution%20%3D%
> 20Unresolved%20AND%20labels%20%3D%20scala%20ORDER%20BY%20priority%20DESC
> 
> "scala" issues:
> 
> https://issues.apache.org/jira/browse/MAHOUT-1522?jql=
> project%20%3D%20MAHOUT%20AND%20resolution%20%3D%
> 20Unresolved%20AND%20labels%20%3D%20legacy%20ORDER%20BY%20priority%20DESC
> 
> Hopefully this will help us get started closing up some old issues. I'll
> try to make another pass over them and close tomorrow and try to find some
> that need to be closed out.
> 

Reply via email to