GitHub user dbtsai opened a pull request:

    https://github.com/apache/spark/pull/12172

    [SPARK-13944][ML][WIP] Separate out local linear algebra as a standalone 
module without Spark dependency

    ## What changes were proposed in this pull request?
    
    Separate out linear algebra as a standalone module without Spark dependency 
to simplify production deployment. We can call the new module 
spark-mllib-local, which might contain local models in the future.
    
    The major issue is to remove dependencies on user-defined types.
    The package name will be changed from mllib to ml. For example, Vector will 
be changed from `org.apache.spark.mllib.linalg.Vector` to 
`org.apache.spark.ml.linalg.Vector`. The return vector type in the new ML 
pipeline will be the one in ML package; however, the existing mllib code will 
not be touched. As a result, this will potentially break the API. Also, when 
the vector is loaded from mllib vector by Spark SQL, the vector will 
automatically converted into the one in ml package.
    
    ## How was this patch tested?
    
    WIP


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/dbtsai/spark dbtsai-linear-algebra

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/12172.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #12172
    
----
commit 0cfc65d1aa4ab3c82459ab0cd3598fd2969387b6
Author: DB Tsai <d...@netflix.com>
Date:   2016-03-22T23:54:11Z

    dbtsai-linear-algebra

commit cb95b0c5194e0d53614c5ae9fd77f110bbd62826
Author: DB Tsai <d...@netflix.com>
Date:   2016-04-05T07:32:11Z

    more work

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to