[GitHub] [spark] SparkQA commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672107180 **[Test build #127341 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127341/testReport)** for PR 28841 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29399: [SPARK-32581][CORE] update duration property for live ui application list and application api

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29399: URL: https://github.com/apache/spark/pull/29399#issuecomment-671325783 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] mridulm commented on pull request #29399: [SPARK-32581][CORE] update duration property for live ui application list and application api

2020-08-11 Thread GitBox
mridulm commented on pull request #29399: URL: https://github.com/apache/spark/pull/29399#issuecomment-672103211 ok to test This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672095098 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672091551 **[Test build #127339 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127339/testReport)** for PR 28841 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672095088 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672093140 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672095088 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672095051 **[Test build #127339 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127339/testReport)** for PR 28841 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28804: [SPARK-31973][SQL] Skip partial aggregates if grouping keys have high cardinality

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28804: URL: https://github.com/apache/spark/pull/28804#issuecomment-672092513 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672093114 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671985182 **[Test build #127330 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127330/testReport)** for PR 28841 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672093114 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28804: [SPARK-31973][SQL] Skip partial aggregates if grouping keys have high cardinality

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28804: URL: https://github.com/apache/spark/pull/28804#issuecomment-672092513 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672092463 **[Test build #127330 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127330/testReport)** for PR 28841 at commit

[GitHub] [spark] SparkQA commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672091551 **[Test build #127339 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127339/testReport)** for PR 28841 at commit

[GitHub] [spark] SparkQA commented on pull request #28804: [SPARK-31973][SQL] Skip partial aggregates if grouping keys have high cardinality

2020-08-11 Thread GitBox
SparkQA commented on pull request #28804: URL: https://github.com/apache/spark/pull/28804#issuecomment-672091607 **[Test build #127340 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127340/testReport)** for PR 28804 at commit

[GitHub] [spark] karuppayya commented on a change in pull request #28804: [SPARK-31973][SQL] Skip partial aggregates if grouping keys have high cardinality

2020-08-11 Thread GitBox
karuppayya commented on a change in pull request #28804: URL: https://github.com/apache/spark/pull/28804#discussion_r468725811 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/HashAggregateExec.scala ## @@ -838,13 +880,17 @@ case class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672086902 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-672086902 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] tinhto-000 edited a comment on pull request #29383: [SPARK-31703][SQL] Parquet RLE float/double are read incorrectly on big endian platforms

2020-08-11 Thread GitBox
tinhto-000 edited a comment on pull request #29383: URL: https://github.com/apache/spark/pull/29383#issuecomment-672072900 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29407: [SPARK-32588][TEST] Fix SizeEstimator initialization in tests.

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29407: URL: https://github.com/apache/spark/pull/29407#issuecomment-672081413 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29407: [SPARK-32588][TEST] Fix SizeEstimator initialization in tests.

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29407: URL: https://github.com/apache/spark/pull/29407#issuecomment-672081413 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29407: [SPARK-32588][TEST] Fix SizeEstimator initialization in tests.

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #29407: URL: https://github.com/apache/spark/pull/29407#issuecomment-671967193 **[Test build #127327 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127327/testReport)** for PR 29407 at commit

[GitHub] [spark] SparkQA commented on pull request #29407: [SPARK-32588][TEST] Fix SizeEstimator initialization in tests.

2020-08-11 Thread GitBox
SparkQA commented on pull request #29407: URL: https://github.com/apache/spark/pull/29407#issuecomment-672080116 **[Test build #127327 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127327/testReport)** for PR 29407 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-672077137 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-672077137 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
SparkQA commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-672075384 **[Test build #127338 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127338/testReport)** for PR 28617 at commit

[GitHub] [spark] tinhto-000 commented on pull request #29383: [SPARK-31703][SQL] Parquet RLE float/double are read incorrectly on big endian platforms

2020-08-11 Thread GitBox
tinhto-000 commented on pull request #29383: URL: https://github.com/apache/spark/pull/29383#issuecomment-672072900 @kiszk thanks for the great suggestion and point out. Turned out ColumnarBatchSuite.scala failed on big endian as well. The "Double APIs" and "Float APIs" are using

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29383: [SPARK-31703][SQL] Parquet RLE float/double are read incorrectly on big endian platforms

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29383: URL: https://github.com/apache/spark/pull/29383#issuecomment-672071427 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-672071382 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29383: [SPARK-31703][SQL] Parquet RLE float/double are read incorrectly on big endian platforms

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29383: URL: https://github.com/apache/spark/pull/29383#issuecomment-672071427 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-672071382 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-11 Thread GitBox
SparkQA commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-672070572 **[Test build #127337 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127337/testReport)** for PR 29334 at commit

[GitHub] [spark] SparkQA commented on pull request #29383: [SPARK-31703][SQL] Parquet RLE float/double are read incorrectly on big endian platforms

2020-08-11 Thread GitBox
SparkQA commented on pull request #29383: URL: https://github.com/apache/spark/pull/29383#issuecomment-672070547 **[Test build #127336 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127336/testReport)** for PR 29383 at commit

[GitHub] [spark] ScrapCodes commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-11 Thread GitBox
ScrapCodes commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-672067365 Jenkins, retest this please. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] stczwd commented on a change in pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
stczwd commented on a change in pull request #28617: URL: https://github.com/apache/spark/pull/28617#discussion_r468705991 ## File path: sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsAtomicPartitionManagement.java ## @@ -0,0 +1,87 @@ +/* + *

[GitHub] [spark] cloud-fan closed pull request #29369: [SPARK-32540][SQL] Eliminate the filter clause in aggregate

2020-08-11 Thread GitBox
cloud-fan closed pull request #29369: URL: https://github.com/apache/spark/pull/29369 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] cloud-fan edited a comment on pull request #29369: [SPARK-32540][SQL] Eliminate the filter clause in aggregate

2020-08-11 Thread GitBox
cloud-fan edited a comment on pull request #29369: URL: https://github.com/apache/spark/pull/29369#issuecomment-672065014 thanks, merging to master! This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] cloud-fan commented on pull request #29369: [SPARK-32540][SQL] Eliminate the filter clause in aggregate

2020-08-11 Thread GitBox
cloud-fan commented on pull request #29369: URL: https://github.com/apache/spark/pull/29369#issuecomment-672065014 thanks, merging to master/3.0! This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29395: [3.0][SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29395: URL: https://github.com/apache/spark/pull/29395#issuecomment-672060253 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29395: [3.0][SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29395: URL: https://github.com/apache/spark/pull/29395#issuecomment-672060253 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672059164 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672059164 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29395: [3.0][SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #29395: URL: https://github.com/apache/spark/pull/29395#issuecomment-671949201 **[Test build #127324 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127324/testReport)** for PR 29395 at commit

[GitHub] [spark] SparkQA commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
SparkQA commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672057796 **[Test build #127335 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127335/testReport)** for PR 29360 at commit

[GitHub] [spark] SparkQA commented on pull request #29395: [3.0][SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-11 Thread GitBox
SparkQA commented on pull request #29395: URL: https://github.com/apache/spark/pull/29395#issuecomment-672058069 **[Test build #127324 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127324/testReport)** for PR 29395 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
cloud-fan commented on a change in pull request #28617: URL: https://github.com/apache/spark/pull/28617#discussion_r468692685 ## File path: sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsAtomicPartitionManagement.java ## @@ -0,0 +1,87 @@ +/* + *

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29408: [WIP][SPARK-32250][CORE] Check whether the MasterSuite.test(SPARK-27510...) is still flaky in Github Action

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29408: URL: https://github.com/apache/spark/pull/29408#issuecomment-672040573 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29408: [WIP][SPARK-32250][CORE] Check whether the MasterSuite.test(SPARK-27510...) is still flaky in Github Action

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29408: URL: https://github.com/apache/spark/pull/29408#issuecomment-672040573 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] tgravescs commented on pull request #28972: [SPARK-30794][CORE] Stage Level scheduling: Add ability to set off heap memory

2020-08-11 Thread GitBox
tgravescs commented on pull request #28972: URL: https://github.com/apache/spark/pull/28972#issuecomment-672040063 maybe we need to document it better. its just the programatic api to set the config spark.memory.offHeap.size when spark.memory.offHeap.enabled=true.

[GitHub] [spark] SparkQA commented on pull request #29408: [WIP][SPARK-32250][CORE] Check whether the MasterSuite.test(SPARK-27510...) is still flaky in Github Action

2020-08-11 Thread GitBox
SparkQA commented on pull request #29408: URL: https://github.com/apache/spark/pull/29408#issuecomment-672036626 **[Test build #127323 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127323/testReport)** for PR 29408 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29408: [WIP][SPARK-32250][CORE] Check whether the MasterSuite.test(SPARK-27510...) is still flaky in Github Action

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #29408: URL: https://github.com/apache/spark/pull/29408#issuecomment-671940797 **[Test build #127323 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127323/testReport)** for PR 29408 at commit

[GitHub] [spark] mridulm commented on pull request #28972: [SPARK-30794][CORE] Stage Level scheduling: Add ability to set off heap memory

2020-08-11 Thread GitBox
mridulm commented on pull request #28972: URL: https://github.com/apache/spark/pull/28972#issuecomment-672031888 I was interpreting this config to mean - these tasks necessarily require upto `X mb` of offheap memory for the tasks themselves to run (for example, native code/library making

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29366: [SPARK-32550][SQL] Make SpecificInternalRow constructors faster by using while loops instead of maps

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29366: URL: https://github.com/apache/spark/pull/29366#issuecomment-672029353 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29366: [SPARK-32550][SQL] Make SpecificInternalRow constructors faster by using while loops instead of maps

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29366: URL: https://github.com/apache/spark/pull/29366#issuecomment-672029353 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29366: [SPARK-32550][SQL] Make SpecificInternalRow constructors faster by using while loops instead of maps

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #29366: URL: https://github.com/apache/spark/pull/29366#issuecomment-671886135 **[Test build #127322 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127322/testReport)** for PR 29366 at commit

[GitHub] [spark] SparkQA commented on pull request #29366: [SPARK-32550][SQL] Make SpecificInternalRow constructors faster by using while loops instead of maps

2020-08-11 Thread GitBox
SparkQA commented on pull request #29366: URL: https://github.com/apache/spark/pull/29366#issuecomment-672028198 **[Test build #127322 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127322/testReport)** for PR 29366 at commit

[GitHub] [spark] mridulm commented on a change in pull request #28939: [SPARK-32119][CORE] ExecutorPlugin doesn't work with Standalone Cluster and Kubernetes with --jars

2020-08-11 Thread GitBox
mridulm commented on a change in pull request #28939: URL: https://github.com/apache/spark/pull/28939#discussion_r468677043 ## File path: core/src/main/scala/org/apache/spark/executor/Executor.scala ## @@ -220,6 +220,21 @@ private[spark] class Executor(

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-672020096 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-672020096 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
SparkQA commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-672019338 **[Test build #127334 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127334/testReport)** for PR 28617 at commit

[GitHub] [spark] tgravescs commented on pull request #28972: [SPARK-30794][CORE] Stage Level scheduling: Add ability to set off heap memory

2020-08-11 Thread GitBox
tgravescs commented on pull request #28972: URL: https://github.com/apache/spark/pull/28972#issuecomment-672019171 this PR doesn't change how its actually used, just allows you to get it via ResourceProfile. The code always has just adding it to the container size requested from yarn.

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672012063 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672008431 **[Test build #127333 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127333/testReport)** for PR 29360 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672012047 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
SparkQA commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672012029 **[Test build #127333 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127333/testReport)** for PR 29360 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672012047 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672009312 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672009312 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] mridulm edited a comment on pull request #28972: [SPARK-30794][CORE] Stage Level scheduling: Add ability to set off heap memory

2020-08-11 Thread GitBox
mridulm edited a comment on pull request #28972: URL: https://github.com/apache/spark/pull/28972#issuecomment-672007797 @tgravescs Had a query regarding this PR : do we configure offheap memory (XX:MaxDirectMemorySize, etc) to ensure what is set via the flag can be honoured by jvm ? Or is

[GitHub] [spark] SparkQA commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
SparkQA commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672008431 **[Test build #127333 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127333/testReport)** for PR 29360 at commit

[GitHub] [spark] mridulm commented on pull request #28972: [SPARK-30794][CORE] Stage Level scheduling: Add ability to set off heap memory

2020-08-11 Thread GitBox
mridulm commented on pull request #28972: URL: https://github.com/apache/spark/pull/28972#issuecomment-672007797 @tgravescs Had a query regarding this PR : do we configure offheap memory (XX:MaxDirectMemorySize, etc) to ensure what is set via the flag can be honoured by jvm ? Or is this

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672004353 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672004353 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29360: [SPARK-32542][SQL]Add a Batch in Optimizer to improve performance in multidimensional analysis

2020-08-11 Thread GitBox
SparkQA commented on pull request #29360: URL: https://github.com/apache/spark/pull/29360#issuecomment-672003736 **[Test build #127332 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127332/testReport)** for PR 29360 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-671994718 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-671971592 **[Test build #127328 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127328/testReport)** for PR 28617 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-671994707 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-671994707 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28617: [SPARK-31694][SQL] Add SupportsPartitions APIs on DataSourceV2

2020-08-11 Thread GitBox
SparkQA commented on pull request #28617: URL: https://github.com/apache/spark/pull/28617#issuecomment-671994599 **[Test build #127328 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127328/testReport)** for PR 28617 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671990635 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671990635 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] cchighman commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671990105 > Ah, one more; could you add fine-grained tests for the path filter implementations? Probably, you can create a dedicated test suite like

[GitHub] [spark] SparkQA commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671989831 **[Test build #127331 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127331/testReport)** for PR 28841 at commit

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468636352 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/pathfilters/ModifiedAfterFilter.scala ## @@ -0,0 +1,59 @@ +/* + *

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468636840 ## File path: python/pyspark/sql/readwriter.py ## @@ -114,6 +114,14 @@ def option(self, key, value): * ``pathGlobFilter``: an optional

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468636699 ## File path: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala ## @@ -467,6 +467,14 @@ class DataFrameReader

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468636142 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/pathfilters/FileIndexFilter.scala ## @@ -0,0 +1,26 @@ +/* + *

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468636017 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/pathfilters/FileIndexFilter.scala ## @@ -0,0 +1,26 @@ +/* + *

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468635076 ## File path: sql/core/src/test/scala/org/apache/spark/sql/FileBasedDataSourceSuite.scala ## @@ -561,6 +564,367 @@ class FileBasedDataSourceSuite

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468634664 ## File path: sql/core/src/test/scala/org/apache/spark/sql/FileBasedDataSourceSuite.scala ## @@ -561,6 +564,367 @@ class FileBasedDataSourceSuite

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671985860 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671985860 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
SparkQA commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-671985182 **[Test build #127330 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127330/testReport)** for PR 28841 at commit

[GitHub] [spark] cchighman commented on a change in pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-08-11 Thread GitBox
cchighman commented on a change in pull request #28841: URL: https://github.com/apache/spark/pull/28841#discussion_r468630906 ## File path: docs/sql-data-sources-generic-options.md ## @@ -119,3 +119,37 @@ To load all files recursively, you can use: {% include_example

[GitHub] [spark] AmplabJenkins commented on pull request #29406: [SPARK-32352][SQL] Partially push down support data filter if it mixed in partition filters

2020-08-11 Thread GitBox
AmplabJenkins commented on pull request #29406: URL: https://github.com/apache/spark/pull/29406#issuecomment-671982538 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29406: [SPARK-32352][SQL] Partially push down support data filter if it mixed in partition filters

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29406: URL: https://github.com/apache/spark/pull/29406#issuecomment-671982538 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #29406: [SPARK-32352][SQL] Partially push down support data filter if it mixed in partition filters

2020-08-11 Thread GitBox
SparkQA removed a comment on pull request #29406: URL: https://github.com/apache/spark/pull/29406#issuecomment-671856304 **[Test build #127321 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127321/testReport)** for PR 29406 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29270: [SPARK-32466][TEST][SQL] Add PlanStabilitySuite to detect SparkPlan regression

2020-08-11 Thread GitBox
AmplabJenkins removed a comment on pull request #29270: URL: https://github.com/apache/spark/pull/29270#issuecomment-671981538 This is an automated message from the Apache Git Service. To respond to the message, please log on

<    1   2   3   4   5   6   7   >