[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user guoxu1231 commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-162417597 BTW, There is a JIRA to suggest user to prefer using c['column'] instead of c.column. @davies, we encountered similar issue, could you paste the JIRA number for reference? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-157095257 @viirya This PR only help a two corner cases, and they are not blocker (could be easily workaround), I'd like to not fix these (avoid the complexity). If we merge this one, some one may continue to submit similar PR to address other corner cases. BTW, There is a JIRA to suggest user to prefer using `c['column']` instead of `c.column`. Does this make sense? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-157030709 ping @davies Any more comments? Or should we close this pr? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya closed the pull request at: https://github.com/apache/spark/pull/8934 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-157230557 @davies Yes. Thanks. I close this one now. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-147303344 ping @davies How about this updated version? Or you still think it is not worth due to the possible performance regressions? As in this version it only looks for `count` and `index` in the list `__fields__` in `__setattr__`, the performance impact can be ignored? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-147027117 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/8934#discussion_r41692322 --- Diff: python/pyspark/sql/types.py --- @@ -1209,6 +1219,12 @@ def __new__(self, *args, **kwargs): else: raise ValueError("No args or kwargs") +def __init__(self, *args, **kwargs): +if hasattr(self, "__fields__") and "count" in self.__fields__: --- End diff -- Now I updated it to looking for special column names in dictionary instead of list. It should be better. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-147027121 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-147028783 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43513/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-147028766 [Test build #43513 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43513/console) for PR 8934 at commit [`ded6853`](https://github.com/apache/spark/commit/ded6853a97c5616ff021a63b7968cc9e6eb33d1f). * This patch **passes all tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-147028782 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-147027398 [Test build #43513 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43513/consoleFull) for PR 8934 at commit [`ded6853`](https://github.com/apache/spark/commit/ded6853a97c5616ff021a63b7968cc9e6eb33d1f). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146302967 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146302941 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146303673 [Test build #43338 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43338/consoleFull) for PR 8934 at commit [`f48ad0d`](https://github.com/apache/spark/commit/f48ad0d4614b040324b69448b195cb583880872f). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146308987 [Test build #43338 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43338/console) for PR 8934 at commit [`f48ad0d`](https://github.com/apache/spark/commit/f48ad0d4614b040324b69448b195cb583880872f). * This patch **passes all tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146309175 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146309211 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43338/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146138449 [Test build #43325 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43325/consoleFull) for PR 8934 at commit [`d454512`](https://github.com/apache/spark/commit/d4545123fd573eca29dcf5aaee2476e8a36029e9). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146143887 [Test build #43325 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43325/console) for PR 8934 at commit [`d454512`](https://github.com/apache/spark/commit/d4545123fd573eca29dcf5aaee2476e8a36029e9). * This patch **fails PySpark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146143957 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146143958 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43325/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146136395 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146136421 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146260317 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146260284 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146264091 [Test build #43328 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43328/console) for PR 8934 at commit [`05fb9e6`](https://github.com/apache/spark/commit/05fb9e67d9b07c71c8629d31742b445ecbc58522). * This patch **fails Python style tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146264095 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146262065 [Test build #43328 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43328/consoleFull) for PR 8934 at commit [`05fb9e6`](https://github.com/apache/spark/commit/05fb9e67d9b07c71c8629d31742b445ecbc58522). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146264096 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43328/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146071744 @davies Yes. Currently as reported in the JIRA, using `x.asDict()["count"]` instead of `x.count` can be to work around this. But I think it would be better to avoid this. I updated this patch that should not introduce performance regressions. Please review it if you have time. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146072217 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146072238 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146072983 [Test build #43317 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43317/console) for PR 8934 at commit [`f7bdde3`](https://github.com/apache/spark/commit/f7bdde386482a2d60a7d7727cf3f9b9d4db27e0e). * This patch **fails Python style tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146072943 [Test build #43317 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43317/consoleFull) for PR 8934 at commit [`f7bdde3`](https://github.com/apache/spark/commit/f7bdde386482a2d60a7d7727cf3f9b9d4db27e0e). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146072985 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43317/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-146072984 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/8934#discussion_r41346674 --- Diff: python/pyspark/sql/types.py --- @@ -1209,6 +1219,12 @@ def __new__(self, *args, **kwargs): else: raise ValueError("No args or kwargs") +def __init__(self, *args, **kwargs): +if hasattr(self, "__fields__") and "count" in self.__fields__: --- End diff -- To do this check once per class might be not possible because we can only know the field names during initialise a row. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/8934#discussion_r41347200 --- Diff: python/pyspark/sql/types.py --- @@ -1189,6 +1189,16 @@ class Row(tuple):>>> Person("Alice", 11) Row(name='Alice', age=11) + +Some special column names such as aggregated column count, should --- End diff -- ok. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-145609363 @viirya It will be great we can fix it magically. I'm worried that the current approach will introduce some performance regressions. As we always have a way to workaround it using `row["count"]` (similar to escape column names in SQL), so it's not a blocker for uses. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/8934#discussion_r41173178 --- Diff: python/pyspark/sql/types.py --- @@ -1189,6 +1189,16 @@ class Row(tuple):>>> Person("Alice", 11) Row(name='Alice', age=11) + +Some special column names such as aggregated column count, should --- End diff -- These kind of tests should be in sql/tests.py --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/8934#discussion_r41172960 --- Diff: python/pyspark/sql/types.py --- @@ -1209,6 +1219,12 @@ def __new__(self, *args, **kwargs): else: raise ValueError("No args or kwargs") +def __init__(self, *args, **kwargs): +if hasattr(self, "__fields__") and "count" in self.__fields__: --- End diff -- Should we check all the names of method? `self.__fields__` is an list, the `in` could be expensive. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-145489665 ping @davies --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144103835 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43080/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144112736 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144101807 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144101851 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144103683 [Test build #43080 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43080/consoleFull) for PR 8934 at commit [`0140148`](https://github.com/apache/spark/commit/0140148b02969f6db8eea0144333a2b9cb0d8e11). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144112765 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144119798 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43096/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144103826 [Test build #43080 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43080/console) for PR 8934 at commit [`0140148`](https://github.com/apache/spark/commit/0140148b02969f6db8eea0144333a2b9cb0d8e11). * This patch **fails Python style tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144103832 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144113041 [Test build #43096 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43096/consoleFull) for PR 8934 at commit [`62c2b44`](https://github.com/apache/spark/commit/62c2b44777685cd33641bd9f0f49da0342f3d1ce). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144119797 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8934#issuecomment-144119646 [Test build #43096 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/43096/console) for PR 8934 at commit [`62c2b44`](https://github.com/apache/spark/commit/62c2b44777685cd33641bd9f0f49da0342f3d1ce). * This patch **passes all tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-10852][PySpark][SQL] Override built-in ...
GitHub user viirya opened a pull request: https://github.com/apache/spark/pull/8934 [SPARK-10852][PySpark][SQL] Override built-in methods for special column names JIRA: https://issues.apache.org/jira/browse/SPARK-10852 For few special columns such as `count` and `index`, because they are built-in methods of `Row(tuple)` in Python. We should override them in order to properly access the values. You can merge this pull request into a Git repository by running: $ git pull https://github.com/viirya/spark-1 fix-py-special-column Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/8934.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #8934 commit 017b7ce7beea266670c1f645950b0b7e8777ab27 Author: Liang-Chi HsiehDate: 2015-09-29T04:44:48Z Override built-in methods for special column names. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org