GitHub user wangmiao1981 opened a pull request:

    https://github.com/apache/spark/pull/15770

    [SPARK-15784][ML]:Add Power Iteration Clustering to spark.ml

    ## What changes were proposed in this pull request?
    
    As we discssed in the JIRA, `PowerIterationClustering` is added as a 
`Transformer`. The `featureCol` is `vector` type. In the `transform` method, it 
calls `MLlibPowerIterationClustering().run(rdd)` method and transforms the 
return value `assignments` (the Kmeans output of the pseudo-eigenvector) as a 
Dataframe (`id`: `LongType`, `cluster`: `IntegerType`).   
    
    ## How was this patch tested?
    Add new unit tests similar to `MLlibPowerIterationClustering`.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/wangmiao1981/spark pic

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/15770.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #15770
    
----
commit 33b2efe83aefb2c77f4e7bfee645f110a19681a8
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-13T19:47:42Z

    add pic framework (model, class etc)

commit a034a981a7979607dcbb03a687736f53660703c3
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-13T23:28:09Z

    change a comment

commit 9f7d66f44e4602421d3434b53b7004b4c7192878
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-17T17:27:55Z

    add missing functions fit predict load save etc.

commit 1ccc7ac291beae8ba65f73bdec997b418a5eebfa
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-18T01:12:41Z

    add unit test flie

commit cc68a25f6fd1cc2215b6b8a0bf43f3eeebb0645e
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-20T17:35:05Z

    add test cases part 1

commit 0cb2e5dad00e608ea669fe458be491c92d4c090c
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-20T20:29:54Z

    add unit test part 2: test fit, parameters etc.

commit f11ebab1acd293f30f97d2e0ee5d40aa9b416692
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-20T21:22:59Z

    fix a type issue

commit c2e2092450aa1adfd18003f13f09f94249874290
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-21T20:07:27Z

    add more unit tests

commit 98ec46a89b08b663e76ec296f9245b7dfa9285f7
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-06-21T21:46:25Z

    delete unused import and add comments

commit 0170775cafbaa982323458124c33687cc48190f3
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-10-25T21:28:12Z

    change version to 2.1.0

commit 2c315400b2437cec2ec53b8ecf47af7e5d623479
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-11-03T23:26:01Z

    change PIC as a Transformer

commit 8dd3ca273855895e4076ad1bfdc133b19af4dac4
Author: wm...@hotmail.com <wm...@hotmail.com>
Date:   2016-11-04T17:28:26Z

    add LabelCol

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to