GitHub user viirya reopened a pull request:

    https://github.com/apache/spark/pull/19442

    [SPARK-8515][ML][WIP] Improve ML Attribute API

    ## What changes were proposed in this pull request?
    
    The current ML attribute API has issues like inefficiency and not easy to 
use. This work tries to improve this API with main changes:
    
    * Support spark vector-typed attributes.
    * Simplify vector-typed attribute serialization. 
    * Keep minimum APIs to support ML attributes.
    
    ** THIS WORK is not ready and is working in progress.
    
    ## How was this patch tested?
    
    Added tests.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/viirya/spark-1 SPARK-8515

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/19442.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #19442
    
----
commit 77d657d8bc8102081e4b0d7b5d42a256e64514d4
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-02T15:03:54Z

    Init design of ml attribute.

commit 7837778e7cbbf83851b1a2b5047f4e6a8039f809
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-03T15:03:31Z

    revise.

commit 97f6848f0cbb1a76b4434930ce8938da50eaafbe
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-03T15:14:02Z

    revise.

commit 2e3a3541fc7a59ac63b2118228de8015c238de40
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-04T05:15:58Z

    revise.

commit 0d76eac84f5837aefebc763687fa9c5c7e1aeb4d
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-04T15:07:57Z

    revise.

commit 81cca5cccfa2556ff0bba5a73764d3f503040b13
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-05T04:30:48Z

    revise.

commit 4813fe8a4bd19a02b7b6bff138f04e7e50f7cdd7
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-05T06:15:53Z

    revise.

commit 7951f59027418962ad95465e439bff41876ecfa8
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-05T07:51:50Z

    revise.

commit a381af3edf52132086af64360789cb3a7d20d61e
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-05T09:00:02Z

    Add builder and test.

commit f25c89dbded0eb9dce25d8da63a1a1aa49ad459f
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-05T15:10:11Z

    revise test.

commit 7e237f38088f2375f40f9a4c97aee2e6acd54328
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-06T02:46:07Z

    Add new test.

commit 77ced957e7be2169ac0c59c76f60ab9d4fcac3ef
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-06T03:57:12Z

    Add more tests.

commit de0aa76199141255258d9d5b12a0d31b1758c6f1
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-06T06:17:29Z

    revise.

commit d828cf3d3b13a2b2b1990bdff9593b49e53f6cf9
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-06T13:55:41Z

    Add java-friendly APIs for attribute types.

commit 5844fbaef5d5825eafadb7c53196fb2132937e4e
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-09T03:24:26Z

    Revise APIs.

commit da0fcef7d3370ebca97d200f01e9f2814a9ed755
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-09T03:26:15Z

    revise.

commit 66be26cd7f25614137cfb9722f859f36d9f80c0c
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-09T03:47:43Z

    Add default constructors to attribute types.

commit ce80ed5b693745fa4a650e508c6cd9e24350c52e
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-10T12:52:22Z

    Use Array instead of Seq in APIs.

commit 2b94dd5c192b1d9302e24c0392fc9a5aaaedb596
Author: Liang-Chi Hsieh <vii...@gmail.com>
Date:   2017-10-14T00:21:04Z

    Add more compatibility tests.

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to