[GitHub] spark pull request #14082: [SPARK-16381][SQL][SparkR] Update SQL examples an...

2016-07-07 Thread keypointt
Github user keypointt commented on a diff in the pull request: https://github.com/apache/spark/pull/14082#discussion_r70008864 --- Diff: examples/src/main/r/RSparkSQLExample.R --- @@ -0,0 +1,175 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark issue #14098: [SPARK-16380][SQL][Example]:Update SQL examples and prog...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14098 **[Test build #61940 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61940/consoleFull)** for PR 14098 at commit [`d92d933`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14096 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61941/ Test FAILed. ---

[GitHub] spark issue #14098: [SPARK-16380][SQL][Example]:Update SQL examples and prog...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14098 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61940/ Test PASSed. ---

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14096 **[Test build #61941 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61941/consoleFull)** for PR 14096 at commit [`08672d9`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14096 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14098: [SPARK-16380][SQL][Example]:Update SQL examples and prog...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14098 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark pull request #14082: [SPARK-16381][SQL][SparkR] Update SQL examples an...

2016-07-07 Thread shivaram
Github user shivaram commented on a diff in the pull request: https://github.com/apache/spark/pull/14082#discussion_r70009456 --- Diff: examples/src/main/r/RSparkSQLExample.R --- @@ -0,0 +1,175 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +

[GitHub] spark issue #14028: [SPARK-16351][SQL] Avoid per-record type dispatch in JSO...

2016-07-07 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/14028 Thanks @yhuai! I just addressed your comments. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14028: [SPARK-16351][SQL] Avoid per-record type dispatch in JSO...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14028 **[Test build #61943 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61943/consoleFull)** for PR 14028 at commit [`6570a98`](https://github.com/apache/spark/commit/6

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14096 **[Test build #61942 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61942/consoleFull)** for PR 14096 at commit [`c332c52`](https://github.com/apache/spark/commit/c

[GitHub] spark pull request #14099: [SPARK-16432] Empty blocks fail to serialize due ...

2016-07-07 Thread ericl
GitHub user ericl opened a pull request: https://github.com/apache/spark/pull/14099 [SPARK-16432] Empty blocks fail to serialize due to assert in ChunkedByteBuffer ## What changes were proposed in this pull request? It's possible to also change the callers to not pass in em

[GitHub] spark issue #14099: [SPARK-16432] Empty blocks fail to serialize due to asse...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14099 **[Test build #61944 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61944/consoleFull)** for PR 14099 at commit [`9ce8146`](https://github.com/apache/spark/commit/9

[GitHub] spark pull request #13680: [SPARK-15962][SQL] Introduce implementation with ...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13680#discussion_r70010528 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeArrayData.java --- @@ -341,63 +327,115 @@ public UnsafeArrayData copy

[GitHub] spark pull request #13680: [SPARK-15962][SQL] Introduce implementation with ...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13680#discussion_r70010656 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/UnsafeArrayDataBenchmark.scala --- @@ -0,0 +1,215 @@ +/* + * Licensed t

[GitHub] spark pull request #14082: [SPARK-16381][SQL][SparkR] Update SQL examples an...

2016-07-07 Thread keypointt
Github user keypointt commented on a diff in the pull request: https://github.com/apache/spark/pull/14082#discussion_r70011061 --- Diff: examples/src/main/r/RSparkSQLExample.R --- @@ -0,0 +1,175 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark pull request #14082: [SPARK-16381][SQL][SparkR] Update SQL examples an...

2016-07-07 Thread keypointt
Github user keypointt commented on a diff in the pull request: https://github.com/apache/spark/pull/14082#discussion_r70011133 --- Diff: examples/src/main/r/RSparkSQLExample.R --- @@ -0,0 +1,175 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark issue #13680: [SPARK-15962][SQL] Introduce implementation with a dense...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13680 LGTM except some minor comments, it's pretty close! One easy-to-ignore comment: https://github.com/apache/spark/pull/13680/files#r69849567 --- If your project is set up for it, you can reply to t

[GitHub] spark issue #13374: [SPARK-13638][SQL] Add escapeAll option to CSV DataFrame...

2016-07-07 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/13374 @jurriaan should this be called quoteAll rather than escapeAll? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #13969: [SPARK-16284][SQL] Implement reflect SQL function

2016-07-07 Thread petermaxlee
Github user petermaxlee commented on the issue: https://github.com/apache/spark/pull/13969 Ping! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14096 **[Test build #61942 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61942/consoleFull)** for PR 14096 at commit [`c332c52`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14096 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14096 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61942/ Test PASSed. ---

[GitHub] spark pull request #14082: [SPARK-16381][SQL][SparkR] Update SQL examples an...

2016-07-07 Thread keypointt
Github user keypointt commented on a diff in the pull request: https://github.com/apache/spark/pull/14082#discussion_r70011796 --- Diff: examples/src/main/r/RSparkSQLExample.R --- @@ -0,0 +1,175 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14096 Hi, @shivaram . Now, it's ready for review again. Please let me know if there is something to do more. Thank you! --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request #14100: [SPARK-16433][SQL]Improve StreamingQuery.explain ...

2016-07-07 Thread zsxwing
GitHub user zsxwing opened a pull request: https://github.com/apache/spark/pull/14100 [SPARK-16433][SQL]Improve StreamingQuery.explain when no data arrives ## What changes were proposed in this pull request? Display `No physical plan. Waiting for data.` instead of `N/A` for

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread shivaram
Github user shivaram commented on the issue: https://github.com/apache/spark/pull/14096 LGTM. Thanks @dongjoon-hyun -- Merging this to master, branch-2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project doe

[GitHub] spark pull request #14096: [SPARK-16425][R] `describe()` should not fail wit...

2016-07-07 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14096 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13991 **[Test build #61947 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61947/consoleFull)** for PR 13991 at commit [`2d48ae5`](https://github.com/apache/spark/commit/2

[GitHub] spark issue #14082: [SPARK-16381][SQL][SparkR] Update SQL examples and progr...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14082 **[Test build #61946 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61946/consoleFull)** for PR 14082 at commit [`1af09f3`](https://github.com/apache/spark/commit/1

[GitHub] spark issue #14100: [SPARK-16433][SQL]Improve StreamingQuery.explain when no...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14100 **[Test build #61945 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61945/consoleFull)** for PR 14100 at commit [`e00bc53`](https://github.com/apache/spark/commit/e

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/13991 For this one I think we should consider supporting only foldable literals for the path component. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHu

[GitHub] spark pull request #14082: [SPARK-16381][SQL][SparkR] Update SQL examples an...

2016-07-07 Thread shivaram
Github user shivaram commented on a diff in the pull request: https://github.com/apache/spark/pull/14082#discussion_r70012365 --- Diff: examples/src/main/r/RSparkSQLExample.R --- @@ -0,0 +1,175 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more +

[GitHub] spark issue #14094: [SPARK-16430][SQL][STREAMING] Add option maxFilesPerTrig...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14094 **[Test build #61936 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61936/consoleFull)** for PR 14094 at commit [`9663b42`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14094: [SPARK-16430][SQL][STREAMING] Add option maxFilesPerTrig...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14094 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61936/ Test PASSed. ---

[GitHub] spark issue #14094: [SPARK-16430][SQL][STREAMING] Add option maxFilesPerTrig...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14094 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14096: [SPARK-16425][R] `describe()` should not fail with non-n...

2016-07-07 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14096 Thank you! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark pull request #14071: [SPARK-16397][SQL] make CatalogTable more general...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/14071#discussion_r70012893 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/createDataSourceTables.scala --- @@ -403,17 +400,18 @@ object CreateDataSourceTab

[GitHub] spark pull request #14101: [SPARK-16431] [ML] Add a unified method that acce...

2016-07-07 Thread husseinhazimeh
GitHub user husseinhazimeh opened a pull request: https://github.com/apache/spark/pull/14101 [SPARK-16431] [ML] Add a unified method that accepts single instances to feature transformers and predictors ## What changes were proposed in this pull request? Current feature transform

[GitHub] spark issue #14065: [SPARK-14743][YARN][WIP] Add a configurable token manage...

2016-07-07 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/14065 Thanks a lot @tgravescs and @vanzin for your suggestions, I will change the codes accordingly, greatly appreciate your comments. --- If your project is set up for it, you can reply to this email

[GitHub] spark issue #14101: [SPARK-16431] [ML] Add a unified method that accepts sin...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14101 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark pull request #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dis...

2016-07-07 Thread HyukjinKwon
GitHub user HyukjinKwon opened a pull request: https://github.com/apache/spark/pull/14102 [SPARK-16434][SQL][WIP] Avoid record-per type dispatch in JSON when reading ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix)

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14102 **[Test build #61948 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61948/consoleFull)** for PR 14102 at commit [`74fa944`](https://github.com/apache/spark/commit/7

[GitHub] spark issue #14101: [SPARK-16431] [ML] Add a unified method that accepts sin...

2016-07-07 Thread husseinhazimeh
Github user husseinhazimeh commented on the issue: https://github.com/apache/spark/pull/14101 @mengxr @sethah can you review this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this f

[GitHub] spark issue #14082: [SPARK-16381][SQL][SparkR] Update SQL examples and progr...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14082 **[Test build #61946 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61946/consoleFull)** for PR 14082 at commit [`1af09f3`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14082: [SPARK-16381][SQL][SparkR] Update SQL examples and progr...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14082 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14082: [SPARK-16381][SQL][SparkR] Update SQL examples and progr...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14082 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61946/ Test PASSed. ---

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/13991 Also - rather than having concrete implementations for all of these, why don't we use RuntimeReplaceable? --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14102 **[Test build #61949 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61949/consoleFull)** for PR 14102 at commit [`2d77f66`](https://github.com/apache/spark/commit/2

[GitHub] spark issue #14028: [SPARK-16351][SQL] Avoid per-record type dispatch in JSO...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14028 **[Test build #61943 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61943/consoleFull)** for PR 14028 at commit [`6570a98`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14028: [SPARK-16351][SQL] Avoid per-record type dispatch in JSO...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14028 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14028: [SPARK-16351][SQL] Avoid per-record type dispatch in JSO...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14028 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61943/ Test PASSed. ---

[GitHub] spark pull request #14071: [SPARK-16397][SQL] make CatalogTable more general...

2016-07-07 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14071#discussion_r70017144 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/MetastoreRelation.scala --- @@ -162,25 +147,28 @@ private[hive] case class MetastoreRelation(

[GitHub] spark issue #14099: [SPARK-16432] Empty blocks fail to serialize due to asse...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14099 **[Test build #61944 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61944/consoleFull)** for PR 14099 at commit [`9ce8146`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14099: [SPARK-16432] Empty blocks fail to serialize due to asse...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14099 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14099: [SPARK-16432] Empty blocks fail to serialize due to asse...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14099 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61944/ Test FAILed. ---

[GitHub] spark issue #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan for inpu...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13890 **[Test build #61950 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61950/consoleFull)** for PR 13890 at commit [`e218f5f`](https://github.com/apache/spark/commit/e

[GitHub] spark pull request #14008: [SPARK-16281][SQL] Implement parse_url SQL functi...

2016-07-07 Thread janplus
Github user janplus commented on a diff in the pull request: https://github.com/apache/spark/pull/14008#discussion_r70018330 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -652,6 +654,145 @@ case class StringRPad(str:

[GitHub] spark issue #13701: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-07-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13701 @gatorsmile This is the benchmark results. No significant difference. Before this patch: Java HotSpot(TM) 64-Bit Server VM 1.8.0_71-b15 on Linux 3.19.0-25-generic Intel(R

[GitHub] spark issue #14100: [SPARK-16433][SQL]Improve StreamingQuery.explain when no...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14100 **[Test build #61945 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61945/consoleFull)** for PR 14100 at commit [`e00bc53`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14100: [SPARK-16433][SQL]Improve StreamingQuery.explain when no...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14100 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61945/ Test PASSed. ---

[GitHub] spark issue #14100: [SPARK-16433][SQL]Improve StreamingQuery.explain when no...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14100 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14102 **[Test build #61948 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61948/consoleFull)** for PR 14102 at commit [`74fa944`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14102 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61948/ Test FAILed. ---

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14102 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13991 **[Test build #61947 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61947/consoleFull)** for PR 13991 at commit [`2d48ae5`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13991 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61947/ Test PASSed. ---

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13991 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark pull request #14065: [SPARK-14743][YARN][WIP] Add a configurable token...

2016-07-07 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/14065#discussion_r70020146 --- Diff: yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -390,8 +390,9 @@ private[spark] class Client( // Upload Spark and th

[GitHub] spark issue #14008: [SPARK-16281][SQL] Implement parse_url SQL function

2016-07-07 Thread janplus
Github user janplus commented on the issue: https://github.com/apache/spark/pull/14008 cc @cloud-fan @rxin @liancheng I did optimization for Literal `part`, so we don't need to check for every row. But since we may not assume in all circumstances the `part` is Literal, I keep the

[GitHub] spark issue #14100: [SPARK-16433][SQL]Improve StreamingQuery.explain when no...

2016-07-07 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/14100 cc @tdas --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14102 **[Test build #61949 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61949/consoleFull)** for PR 14102 at commit [`2d77f66`](https://github.com/apache/spark/commit/

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14102 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61949/ Test FAILed. ---

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14102 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark pull request #14008: [SPARK-16281][SQL] Implement parse_url SQL functi...

2016-07-07 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/14008#discussion_r70020485 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -652,6 +654,152 @@ case class StringRPad(str: Ex

[GitHub] spark issue #14071: [SPARK-16397][SQL] make CatalogTable more general and le...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14071 **[Test build #61951 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61951/consoleFull)** for PR 14071 at commit [`9efe46e`](https://github.com/apache/spark/commit/9

[GitHub] spark pull request #14071: [SPARK-16397][SQL] make CatalogTable more general...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/14071#discussion_r70020600 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala --- @@ -45,35 +45,28 @@ case class CatalogFunction( */

[GitHub] spark issue #13778: [SPARK-16062][SPARK-15989][SQL] Fix two bugs of Python-o...

2016-07-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13778 Python UDT in python side only serializes the python data to sql type defined in the Python UDT. The problem now is happened at the serialization to row in java side on the serialized python data. I

[GitHub] spark pull request #14008: [SPARK-16281][SQL] Implement parse_url SQL functi...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/14008#discussion_r70020929 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -652,6 +654,152 @@ case class StringRPad(st

[GitHub] spark pull request #14008: [SPARK-16281][SQL] Implement parse_url SQL functi...

2016-07-07 Thread janplus
Github user janplus commented on a diff in the pull request: https://github.com/apache/spark/pull/14008#discussion_r70021325 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -652,6 +654,152 @@ case class StringRPad(str:

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13991 **[Test build #61953 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61953/consoleFull)** for PR 13991 at commit [`d30d891`](https://github.com/apache/spark/commit/d

[GitHub] spark issue #14083: [SPARK-16406][SQL] Improve performance of LogicalPlan.re...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14083 **[Test build #61952 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61952/consoleFull)** for PR 14083 at commit [`a5d1a4a`](https://github.com/apache/spark/commit/a

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread petermaxlee
Github user petermaxlee commented on the issue: https://github.com/apache/spark/pull/13991 I pushed a new change to this. We now have better error messages and test coverage for those. These expressions also now require foldable paths. I also changed the test values to make su

[GitHub] spark issue #13701: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-07-07 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/13701 Great! Around 5% performance penalty looks OK to me. Maybe we can send the code changes to the performance team for seeing TPC-DS improvement. CC @jfchen --- If your project is set up for it, y

[GitHub] spark issue #13991: [SPARK-16318][SQL] Implement various xpath functions

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13991 **[Test build #61954 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61954/consoleFull)** for PR 13991 at commit [`02df488`](https://github.com/apache/spark/commit/0

[GitHub] spark issue #14103: [SPARK-16436][SQL] checkEvaluation support NaN and Runti...

2016-07-07 Thread petermaxlee
Github user petermaxlee commented on the issue: https://github.com/apache/spark/pull/14103 cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, o

[GitHub] spark pull request #14103: [SPARK-16436][SQL] checkEvaluation support NaN an...

2016-07-07 Thread petermaxlee
GitHub user petermaxlee opened a pull request: https://github.com/apache/spark/pull/14103 [SPARK-16436][SQL] checkEvaluation support NaN and RuntimeReplaceable ## What changes were proposed in this pull request? This small patch modifies ExpressionEvalHelper. checkEvaluation to s

[GitHub] spark issue #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan for inpu...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13890 **[Test build #61950 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61950/consoleFull)** for PR 13890 at commit [`e218f5f`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan for inpu...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13890 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61950/ Test PASSed. ---

[GitHub] spark issue #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan for inpu...

2016-07-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13890 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #14103: [SPARK-16436][SQL] checkEvaluation support NaN and Runti...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14103 **[Test build #61955 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61955/consoleFull)** for PR 14103 at commit [`0278dea`](https://github.com/apache/spark/commit/0

[GitHub] spark issue #13701: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-07-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13701 @gatorsmile You know that the benchmark results will not the same every time even you run it with the same codes. If the difference is under a small range, we can assume they have no significant diff

[GitHub] spark issue #13701: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-07-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13701 @gatorsmile BTW, if the TPC-DS performance is measured with 2.0 codebase, this should benefit the performance. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark issue #14095: [SPARK-16429][SQL] Include `StringType` columns in `desc...

2016-07-07 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/14095 Can you fix Python? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #13701: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-07-07 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/13701 @viirya If you run multiple times and still see 5% performance difference, you can confirm the penalty is around 5%. However, this might also depend on the other factors, e.g., the total time. Th

[GitHub] spark issue #14102: [SPARK-16434][SQL][WIP] Avoid record-per type dispatch i...

2016-07-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14102 **[Test build #61956 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61956/consoleFull)** for PR 14102 at commit [`9dbcec7`](https://github.com/apache/spark/commit/9

[GitHub] spark issue #13701: [SPARK-15639][SQL] Try to push down filter at RowGroups ...

2016-07-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13701 @gatorsmile I think the times I run the benchmark is not enough to confirm there is 5% performance difference. But I think it is not important here because we don't want to measure the exact performa

[GitHub] spark pull request #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan f...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13890#discussion_r70024013 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/ExistingRDD.scala --- @@ -74,13 +74,71 @@ object RDDConversions { } }

[GitHub] spark issue #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan for inpu...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13890 LGTM, cc @liancheng to take another look --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #14103: [SPARK-16436][SQL] checkEvaluation support NaN an...

2016-07-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/14103#discussion_r70024354 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ExpressionEvalHelper.scala --- @@ -41,7 +41,12 @@ trait ExpressionEvalHelp

<    1   2   3   4   5   6   7   >