Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/20265
Thank you so much, @cloud-fan and @gatorsmile !
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For
Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/20265
thanks, merging to master/2.3!
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands,
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86257/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86257 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86257/testReport)**
for PR 20265 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86251/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86251 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86251/testReport)**
for PR 20265 at commit
Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/20265
There might be many questions about ORC (or Parquet) performance
benchmarks. We can do that later. We cannot enumerate all cases. Also, users
can do that for their own workload. In fact,
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86257 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86257/testReport)**
for PR 20265 at commit
Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/20265
@gatorsmile . The number of rows are also changed. Why do you think so?
---
-
To unsubscribe, e-mail:
Github user gatorsmile commented on the issue:
https://github.com/apache/spark/pull/20265
ORC performs further better when the number of columns is small. Maybe also
add test cases back to show this observations?
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86251 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86251/testReport)**
for PR 20265 at commit
Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/20265
LGTM except one comment. Let's worry about row group/stripe size later,
since both parquet and orc use default settings, I think it's still fair.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86200/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86200 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86200/testReport)**
for PR 20265 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86200 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86200/testReport)**
for PR 20265 at commit
Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/20265
I updated the PR (except one RowGroupSize/OrcStripeSize part).
---
-
To unsubscribe, e-mail:
Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/20265
I'll update the PR tomorrow.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands,
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86143/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86143 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86143/testReport)**
for PR 20265 at commit
Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/20265
I think we need to make sure parquet row group size and orc strip size is
same, to make this benchmark fair.
---
-
To
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86143 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86143/testReport)**
for PR 20265 at commit
Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/20265
Hi, @cloud-fan and @gatorsmile .
Your questions are valid for all PPD cases. According to the comments, I
added the following expressions (positive and negative) for both ORC/Parquet.
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/20265
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86116/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86116 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86116/testReport)**
for PR 20265 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/20265
**[Test build #86116 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86116/testReport)**
for PR 20265 at commit
Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/20265
cc @cloud-fan , @gatorsmile .
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands,
31 matches
Mail list logo