dbtsai commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902475893
> Everything depends on the data lifecycle. For the safety, we can control
it by reducing `spark.sql.fileMetaCache.ttlSinceLastAccessSec` to `10 secs` or
less which is still eff
Ngone51 closed pull request #33782:
URL: https://github.com/apache/spark/pull/33782
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: reviews-unsubs
Ngone51 commented on pull request #33782:
URL: https://github.com/apache/spark/pull/33782#issuecomment-902470428
GA passed. Merged to branch-3.0, thanks!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above t
SparkQA commented on pull request #33795:
URL: https://github.com/apache/spark/pull/33795#issuecomment-902469200
**[Test build #142665 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142665/testReport)**
for PR 33795 at commit
[`3ccc0f8`](https://github.com
Ngone51 commented on pull request #33795:
URL: https://github.com/apache/spark/pull/33795#issuecomment-902468291
cc @cloud-fan
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comme
Ngone51 opened a new pull request #33795:
URL: https://github.com/apache/spark/pull/33795
### What changes were proposed in this pull request?
Instead of exiting the executor within the RpcEnv's thread, exit the
executor in a separate thread.
### Why are the changes needed?
AmplabJenkins removed a comment on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902450754
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment
AmplabJenkins commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902466437
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47164/
--
T
itholic commented on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902452005
LGTM.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To u
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902452047
Kubernetes integration test status success
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47164/
--
This is an automated message from the A
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902450734
Kubernetes integration test status success
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47163/
--
This is an automated message from the A
AmplabJenkins commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902450754
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47163/
--
T
yoda-mon commented on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902450179
Updated.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
LuciferYang commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902447683
> Yea, but it adds complexity and more memory consumption like you mentioned
earlier, and you'll need to have the driver a long running process like a
Presto coordinator
LuciferYang commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902447020
> which I'm not sure how many people are using Spark this way.
There should be many. We can do some survey, haha ~
--
This is an automated message from the Apach
sunchao commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-90200
Yea, but it adds complexity and more memory consumption like you mentioned
earlier, and you'll need to have the driver a long running process like a
Presto coordinator, which I'
AmplabJenkins removed a comment on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902443736
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47165/
AmplabJenkins commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902443736
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47165/
--
T
AmplabJenkins commented on pull request #33790:
URL: https://github.com/apache/spark/pull/33790#issuecomment-902443745
Can one of the admins verify this patch?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL a
itholic edited a comment on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902442698
Nice!! Could you also update the screen captures to the PR description ?
--
This is an automated message from the Apache Git Service.
To respond to the message, please l
LuciferYang edited a comment on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902439693
> I understand you want to avoid the duplicate footer lookup. In Parquet at
least we can just pass the footer from either ParquetFileFormat or
ParquetPartitionReaderF
itholic commented on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902442698
> @gengliangwang @HyukjinKwon
> Thank you for your advice, I put screen shots of around the images bellow.
>
> Environment
>
> * Windows 10
> * Google Chrome 9
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902442032
Kubernetes integration test unable to build dist.
exiting with code: 1
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47165/
--
This
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902439939
Kubernetes integration test starting
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47164/
--
This is an automated message from the Apache
LuciferYang commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902439693
> I understand you want to avoid the duplicate footer lookup. In Parquet at
least we can just pass the footer from either ParquetFileFormat or
ParquetPartitionReaderFactory
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902438633
Kubernetes integration test starting
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47163/
--
This is an automated message from the Apache
sunchao commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902435389
> Can we add ctime or mtime of the file to the PartitionedFile and use this
information for check?
Yea file path + modification time seem like a good way to validate the c
ulysses-you commented on a change in pull request #32816:
URL: https://github.com/apache/spark/pull/32816#discussion_r692656713
##
File path:
sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AdaptiveSparkPlanExec.scala
##
@@ -100,24 +100,34 @@ case class Adaptiv
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902426885
**[Test build #142664 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142664/testReport)**
for PR 32816 at commit
[`8058fe9`](https://github.com
xuanyuanking commented on pull request #33763:
URL: https://github.com/apache/spark/pull/33763#issuecomment-902426850
cc @viirya
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comm
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902425654
**[Test build #142663 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142663/testReport)**
for PR 32816 at commit
[`f5ad40e`](https://github.com
yoda-mon commented on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902425495
@gengliangwang @HyukjinKwon
Thank you for your advice, I put screen shots of around the images bellow.
Environment
- Windows 10
- Google Chrome 92.0.4515.159
SparkQA commented on pull request #32816:
URL: https://github.com/apache/spark/pull/32816#issuecomment-902424243
**[Test build #142662 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142662/testReport)**
for PR 32816 at commit
[`b54e9c2`](https://github.com
AmplabJenkins removed a comment on pull request #33673:
URL: https://github.com/apache/spark/pull/33673#issuecomment-902423683
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142657/
-
AmplabJenkins removed a comment on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902423685
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47161/
AmplabJenkins removed a comment on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902423684
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47162/
AmplabJenkins removed a comment on pull request #33644:
URL: https://github.com/apache/spark/pull/33644#issuecomment-902423682
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142651/
-
AmplabJenkins commented on pull request #33644:
URL: https://github.com/apache/spark/pull/33644#issuecomment-902423682
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142651/
--
This
AmplabJenkins commented on pull request #33673:
URL: https://github.com/apache/spark/pull/33673#issuecomment-902423683
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142657/
--
This
AmplabJenkins commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902423684
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47162/
--
T
AmplabJenkins commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902423685
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47161/
--
T
ulysses-you commented on a change in pull request #32816:
URL: https://github.com/apache/spark/pull/32816#discussion_r692653369
##
File path:
sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/AdaptiveSparkPlanExec.scala
##
@@ -656,13 +687,54 @@ case class Adaptiv
SparkQA removed a comment on pull request #33673:
URL: https://github.com/apache/spark/pull/33673#issuecomment-902320624
**[Test build #142657 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142657/testReport)**
for PR 33673 at commit
[`c089a25`](https://gi
SparkQA removed a comment on pull request #33644:
URL: https://github.com/apache/spark/pull/33644#issuecomment-902186886
**[Test build #142651 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142651/testReport)**
for PR 33644 at commit
[`f728a02`](https://gi
SparkQA commented on pull request #33673:
URL: https://github.com/apache/spark/pull/33673#issuecomment-902418529
**[Test build #142657 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142657/testReport)**
for PR 33673 at commit
[`c089a25`](https://github.co
AngersZh commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902415615
> @AngersZh the change seems fine but let's make sure having a detailed
PR description e.g) with an example of requesting and output, how you tested,
etc.
How ab
SparkQA commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902413809
Kubernetes integration test status success
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47162/
--
This is an automated message from the A
SparkQA commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902411553
Kubernetes integration test status success
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47161/
--
This is an automated message from the A
ulysses-you commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902411428
thank you all for the approved, also FYI @dongjoon-hyun
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and
SparkQA commented on pull request #33644:
URL: https://github.com/apache/spark/pull/33644#issuecomment-902411198
**[Test build #142651 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142651/testReport)**
for PR 33644 at commit
[`f728a02`](https://github.co
AmplabJenkins removed a comment on pull request #33791:
URL: https://github.com/apache/spark/pull/33791#issuecomment-902405026
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142649/
-
AmplabJenkins commented on pull request #33791:
URL: https://github.com/apache/spark/pull/33791#issuecomment-902405026
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142649/
--
This
SparkQA commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902402284
Kubernetes integration test starting
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47162/
--
This is an automated message from the Apache
HyukjinKwon commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902402175
@AngersZh the change seems fine but let's make sure having a detailed PR
description e.g) with an example of requesting and output, how you tested, etc.
--
This is an
SparkQA removed a comment on pull request #33791:
URL: https://github.com/apache/spark/pull/33791#issuecomment-902148745
**[Test build #142649 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142649/testReport)**
for PR 33791 at commit
[`9ad5f55`](https://gi
SparkQA commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902401157
Kubernetes integration test starting
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47161/
--
This is an automated message from the Apache
HyukjinKwon commented on a change in pull request #32184:
URL: https://github.com/apache/spark/pull/32184#discussion_r692624659
##
File path: docs/job-scheduling.md
##
@@ -252,10 +252,11 @@ properties:
The pool properties can be set by creating an XML file, similar to
`conf
SparkQA commented on pull request #33791:
URL: https://github.com/apache/spark/pull/33791#issuecomment-902391748
**[Test build #142649 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142649/testReport)**
for PR 33791 at commit
[`9ad5f55`](https://github.co
SparkQA removed a comment on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902386507
**[Test build #142661 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142661/testReport)**
for PR 33794 at commit
[`1497e27`](https://gi
AmplabJenkins removed a comment on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902390150
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142661/
-
AmplabJenkins commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902390150
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142661/
--
This
SparkQA commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902390038
**[Test build #142661 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142661/testReport)**
for PR 33794 at commit
[`1497e27`](https://github.co
AmplabJenkins removed a comment on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902388917
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142660/
-
AmplabJenkins commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902388917
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/142660/
--
This
SparkQA removed a comment on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902385021
**[Test build #142660 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142660/testReport)**
for PR 33793 at commit
[`9ce909f`](https://gi
SparkQA commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902388816
**[Test build #142660 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142660/testReport)**
for PR 33793 at commit
[`9ce909f`](https://github.co
HyukjinKwon commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902388665
cc @gengliangwang and @Ngone51 FYI
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to
gengliangwang closed pull request #33791:
URL: https://github.com/apache/spark/pull/33791
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: reviews-
gengliangwang commented on pull request #33791:
URL: https://github.com/apache/spark/pull/33791#issuecomment-902388151
Merging to master/3.2
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the s
SparkQA commented on pull request #33794:
URL: https://github.com/apache/spark/pull/33794#issuecomment-902386507
**[Test build #142661 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142661/testReport)**
for PR 33794 at commit
[`1497e27`](https://github.com
ulysses-you opened a new pull request #33794:
URL: https://github.com/apache/spark/pull/33794
### What changes were proposed in this pull request?
* improve docs in `docs/job-scheduling.md`
* add migration guide docs in `docs/core-migration-guide.md`
### Why are the
SparkQA commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902385021
**[Test build #142660 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/142660/testReport)**
for PR 33793 at commit
[`9ce909f`](https://github.com
AmplabJenkins removed a comment on pull request #32397:
URL: https://github.com/apache/spark/pull/32397#issuecomment-902384464
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47160/
AmplabJenkins commented on pull request #32397:
URL: https://github.com/apache/spark/pull/32397#issuecomment-902384464
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/47160/
--
T
HyukjinKwon commented on pull request #33784:
URL: https://github.com/apache/spark/pull/33784#issuecomment-902384034
I am fine with reverting if somebody feels strongly on this.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub
LuciferYang edited a comment on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902383334
> In Hive it's common that the same file name (e.g., 00_0) gets used
when doing insert overwrite. Even if we check file size and other stuff, it
can't completely
AngersZh commented on pull request #33793:
URL: https://github.com/apache/spark/pull/33793#issuecomment-902383422
ping @zsxwing @srowen @HyukjinKwon
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above t
LuciferYang commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902383334
> In Hive it's common that the same file name (e.g., 00_0) gets used
when doing insert overwrite. Even if we check file size and other stuff, it
can't completely prevent
AngersZh opened a new pull request #33793:
URL: https://github.com/apache/spark/pull/33793
### What changes were proposed in this pull request?
Add taskStatus supports multiple value to monitoring doc
### Why are the changes needed?
Make doc clear
### Does thi
HyukjinKwon commented on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902382928
Yeah, it would be great to have some screenshots in the Pr description.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log o
LuciferYang edited a comment on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902378554
> Since the metadata is cached in the executor, does it mean the task
reading the same ORC file has to be scheduled on the same executor? How can we
guarantee this?
LuciferYang commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902378554
> Since the metadata is cached in the executor, does it mean the task
reading the same ORC file has to be scheduled on the same executor? How can we
guarantee this?
A
AngersZh commented on a change in pull request #31165:
URL: https://github.com/apache/spark/pull/31165#discussion_r692609348
##
File path: docs/monitoring.md
##
@@ -479,11 +479,14 @@ can be identified by their `[attempt-id]`. In the API
listed below, when running
/app
SparkQA commented on pull request #32397:
URL: https://github.com/apache/spark/pull/32397#issuecomment-902378474
Kubernetes integration test status failure
URL:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/47160/
--
This is an automated message from the A
beliefer commented on pull request #33787:
URL: https://github.com/apache/spark/pull/33787#issuecomment-902376762
@gengliangwang Thanks.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the speci
itholic edited a comment on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902372078
> Also, should we make it `pandas-APIs-on-Spark` instead of
`pandas-on-Spark`?
We use "pandas APIs on Spark" as an official name, but sometimes we use
"pandas-on-S
itholic edited a comment on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902372078
We use "pandas APIs on Spark" as an official name, but sometimes we use
"pandas-on-Spark" for abbreviation when the sentences look unnatural to read.
For example, l
itholic edited a comment on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902372078
We use "pandas APIs on Spark" as an official name, but sometimes we use
"pandas-on-Spark" for abbreviation when the sentences look unnatural to read.
For example, l
itholic edited a comment on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902372078
We use "pandas APIs on Spark" as an official name, but sometimes we use
"pandas-on-Spark" for abbreviation when the sentences look unnatural to read.
For example, i
itholic commented on pull request #33786:
URL: https://github.com/apache/spark/pull/33786#issuecomment-902372078
We use "pandas APIs on Spark" as an official name, but sometimes we use
"pandas-on-Spark" for shorten name since sometimes sentences look unnatural.
For example, in the ca
HeartSaVioR edited a comment on pull request #33784:
URL: https://github.com/apache/spark/pull/33784#issuecomment-902369462
Looks like performance benefit is something not everyone seems to be agreed
with, then it cannot be the rationalization of introducing the new dependency.
In ot
HeartSaVioR edited a comment on pull request #33784:
URL: https://github.com/apache/spark/pull/33784#issuecomment-902369462
Looks like performance benefit is something not everyone seems to be agreed
with, then it cannot be the rationalization of introducing the new dependency.
In ot
HeartSaVioR commented on pull request #33784:
URL: https://github.com/apache/spark/pull/33784#issuecomment-902369462
Looks like performance benefit is something not everyone seems to be agreed
with, then it cannot be the rationalization of introducing the new dependency.
In other per
ulysses-you commented on a change in pull request #32184:
URL: https://github.com/apache/spark/pull/32184#discussion_r692600086
##
File path: docs/job-scheduling.md
##
@@ -252,10 +252,11 @@ properties:
The pool properties can be set by creating an XML file, similar to
`conf
dongjoon-hyun commented on pull request #33748:
URL: https://github.com/apache/spark/pull/33748#issuecomment-902367102
@dbtsai and @sunchao .
Everything depends on the data lifecycle. For the safety, we can control it
by reducing `spark.sql.fileMetaCache.ttlSinceLastAccessSec` to `10
ulysses-you commented on a change in pull request #32184:
URL: https://github.com/apache/spark/pull/32184#discussion_r692598890
##
File path: docs/job-scheduling.md
##
@@ -252,10 +252,11 @@ properties:
The pool properties can be set by creating an XML file, similar to
`conf
HeartSaVioR closed pull request #33792:
URL: https://github.com/apache/spark/pull/33792
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: reviews-un
HeartSaVioR commented on pull request #33792:
URL: https://github.com/apache/spark/pull/33792#issuecomment-902365811
Thanks! Merging to master/3.2.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go t
dgd-contributor commented on a change in pull request #33752:
URL: https://github.com/apache/spark/pull/33752#discussion_r692597349
##
File path: python/pyspark/pandas/tests/test_ops_on_diff_frames.py
##
@@ -1955,6 +1955,28 @@ def test_pow_and_rpow(self):
with self.ass
dgd-contributor commented on a change in pull request #33752:
URL: https://github.com/apache/spark/pull/33752#discussion_r692597267
##
File path: python/pyspark/pandas/series.py
##
@@ -944,6 +944,50 @@ def between(self, left: Any, right: Any, inclusive: bool =
True) -> "Series
1 - 100 of 435 matches
Mail list logo