[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207830320
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207830121
  
**[Test build #55435 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55435/consoleFull)**
 for PR 12274 at commit 
[`1a62563`](https://github.com/apache/spark/commit/1a62563960d3571379ba2e25f5a98de7f64dc78f).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207830324
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55435/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207826076
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55434/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207826073
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207825861
  
**[Test build #55434 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55434/consoleFull)**
 for PR 12274 at commit 
[`42bda51`](https://github.com/apache/spark/commit/42bda51968ba6ea0cef6df41451c025f94b63bd3).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207823845
  
**[Test build #55435 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55435/consoleFull)**
 for PR 12274 at commit 
[`1a62563`](https://github.com/apache/spark/commit/1a62563960d3571379ba2e25f5a98de7f64dc78f).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207821708
  
**[Test build #55434 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55434/consoleFull)**
 for PR 12274 at commit 
[`42bda51`](https://github.com/apache/spark/commit/42bda51968ba6ea0cef6df41451c025f94b63bd3).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207821055
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55433/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207821045
  
**[Test build #55433 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55433/consoleFull)**
 for PR 12274 at commit 
[`3f765dd`](https://github.com/apache/spark/commit/3f765dd75df8afb444bb433a209cf4237f584b29).
 * This patch **fails Scala style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207821051
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/12274#issuecomment-207820609
  
**[Test build #55433 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55433/consoleFull)**
 for PR 12274 at commit 
[`3f765dd`](https://github.com/apache/spark/commit/3f765dd75df8afb444bb433a209cf4237f584b29).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-14500] [ML] Accept Dataset[_] ins...

2016-04-09 Thread mengxr
GitHub user mengxr opened a pull request:

https://github.com/apache/spark/pull/12274

[WIP] [SPARK-14500] [ML] Accept Dataset[_] instead of DataFrame in MLlib 
APIs

## What changes were proposed in this pull request?

This PR updates MLlib APIs to accept `Dataset[_]` as input where 
`DataFrame` was the input type. This PR doesn't change the output type. In 
Java, `Dataset[_]` maps to `Dataset`, which includes `Dataset`. Some 
implementations were changed to return `DataFrame`. Tests and examples were 
updated.

TODOs:
- [ ] update MiMaExcludes
- [ ] Python
- [ ] add a new test to accept Dataset[LabeledPoint]

## How was this patch tested?

Exiting unit tests with some modifications.

cc: @rxin @jkbradley 


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/mengxr/spark SPARK-14500

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/12274.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #12274


commit 7b8fe962c90fec92b0c35f911e490aeb358c8c8a
Author: Xiangrui Meng 
Date:   2016-04-09T01:53:16Z

accept Dataset[_] instead of DataFrame in MLlib

commit 67fd643a401544e52fc98e87e7552c7b30460ce2
Author: Xiangrui Meng 
Date:   2016-04-09T16:19:54Z

fix compile

commit 8420014fea9fdaced225c3785908898debb7aff3
Author: Xiangrui Meng 
Date:   2016-04-09T16:54:40Z

fix tests

commit 82ee0d9c23a403b635b88b58cbd2f3e2cb5a6321
Author: Xiangrui Meng 
Date:   2016-04-09T17:01:09Z

Merge remote-tracking branch 'apache/master' into SPARK-14500

commit 3f765dd75df8afb444bb433a209cf4237f584b29
Author: Xiangrui Meng 
Date:   2016-04-09T17:27:40Z

fix examples




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org