[GitHub] [spark] AmplabJenkins commented on pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #29078: URL: https://github.com/apache/spark/pull/29078#issuecomment-657289386 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
SparkQA commented on pull request #29078: URL: https://github.com/apache/spark/pull/29078#issuecomment-657289370 **[Test build #125735 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125735/testReport)** for PR 29078 at commit

[GitHub] [spark] c21 commented on pull request #28123: [SPARK-31350][SQL] Coalesce bucketed tables for sort merge join if applicable

2020-07-12 Thread GitBox
c21 commented on pull request #28123: URL: https://github.com/apache/spark/pull/28123#issuecomment-657289241 Thank you @maropu and @imback82 > As for (1) and (3), IMO its worth digging into it for more improvements. For (1): I created a PR to cover shuffled hash join as well

[GitHub] [spark] maropu commented on pull request #29079: [SPARK-32286][SQL] Coalesce bucketed table for shuffled hash join if applicable

2020-07-12 Thread GitBox
maropu commented on pull request #29079: URL: https://github.com/apache/spark/pull/29079#issuecomment-657289124 Could you show us performance numbers in the PR description, first? Thanks. This is an automated message from

[GitHub] [spark] c21 commented on pull request #29079: [SPARK-32286][SQL] Coalesce bucketed table for shuffled hash join if applicable

2020-07-12 Thread GitBox
c21 commented on pull request #29079: URL: https://github.com/apache/spark/pull/29079#issuecomment-657288265 @maropu, @cloud-fan @gatorsmile @sameeragarwal Could you help check this PR? Thanks. This is an automated message

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29079: [SPARK-32286][SQL] Coalesce bucketed table for shuffled hash join if applicable

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #29079: URL: https://github.com/apache/spark/pull/29079#issuecomment-657288056 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29079: [SPARK-32286][SQL] Coalesce bucketed table for shuffled hash join if applicable

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #29079: URL: https://github.com/apache/spark/pull/29079#issuecomment-657288056 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657287698 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #29079: [SPARK-32286][SQL] Coalesce bucketed table for shuffled hash join if applicable

2020-07-12 Thread GitBox
SparkQA commented on pull request #29079: URL: https://github.com/apache/spark/pull/29079#issuecomment-657287917 **[Test build #125736 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125736/testReport)** for PR 29079 at commit

[GitHub] [spark] c21 opened a new pull request #29079: [SPARK-32286][SQL] Coalesce bucketed table for shuffled hash join if applicable

2020-07-12 Thread GitBox
c21 opened a new pull request #29079: URL: https://github.com/apache/spark/pull/29079 ### What changes were proposed in this pull request? Based on a follow up comment in https://github.com/apache/spark/pull/28123, where we can coalesce buckets for shuffled hash join as well.

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657287696 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
SparkQA removed a comment on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657271641 **[Test build #125726 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125726/testReport)** for PR 27694 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657287696 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
SparkQA commented on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657287574 **[Test build #125726 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125726/testReport)** for PR 27694 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #29078: URL: https://github.com/apache/spark/pull/29078#issuecomment-657287258 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #29078: URL: https://github.com/apache/spark/pull/29078#issuecomment-657287258 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] dongjoon-hyun commented on pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
dongjoon-hyun commented on pull request #29078: URL: https://github.com/apache/spark/pull/29078#issuecomment-657287265 Thank you, @srowen . This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-12 Thread GitBox
dongjoon-hyun commented on a change in pull request #29053: URL: https://github.com/apache/spark/pull/29053#discussion_r453373125 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/PropagateEmptyRelation.scala ## @@ -50,8 +50,24 @@ object

[GitHub] [spark] SparkQA commented on pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
SparkQA commented on pull request #29078: URL: https://github.com/apache/spark/pull/29078#issuecomment-657287127 **[Test build #125735 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125735/testReport)** for PR 29078 at commit

[GitHub] [spark] srowen commented on a change in pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
srowen commented on a change in pull request #29078: URL: https://github.com/apache/spark/pull/29078#discussion_r453373000 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/HadoopFileLinesReader.scala ## @@ -46,7 +46,7 @@ class

[GitHub] [spark] srowen opened a new pull request #29078: [SPARK-29292][STREAMING][SQL][BUILD] Get streaming, catalyst, sql compiling for Scala 2.13

2020-07-12 Thread GitBox
srowen opened a new pull request #29078: URL: https://github.com/apache/spark/pull/29078 ### What changes were proposed in this pull request? Continuation of https://github.com/apache/spark/pull/28971 which lets streaming, catalyst and sql compile for 2.13. Same idea. ### Why

[GitHub] [spark] dongjoon-hyun closed pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
dongjoon-hyun closed pull request #29061: URL: https://github.com/apache/spark/pull/29061 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
dongjoon-hyun commented on a change in pull request #29061: URL: https://github.com/apache/spark/pull/29061#discussion_r453371417 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NormalizeFloatingNumbers.scala ## @@ -116,6 +116,15 @@ object

[GitHub] [spark] viirya commented on a change in pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
viirya commented on a change in pull request #29061: URL: https://github.com/apache/spark/pull/29061#discussion_r453368110 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NormalizeFloatingNumbers.scala ## @@ -116,6 +116,15 @@ object

[GitHub] [spark] dongjoon-hyun closed pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
dongjoon-hyun closed pull request #29076: URL: https://github.com/apache/spark/pull/29076 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] dongjoon-hyun commented on pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
dongjoon-hyun commented on pull request #29076: URL: https://github.com/apache/spark/pull/29076#issuecomment-657280588 Yes. Right. Thanks, @viirya . Merged to master. This is an automated message from the Apache

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
dongjoon-hyun commented on a change in pull request #29061: URL: https://github.com/apache/spark/pull/29061#discussion_r453367243 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NormalizeFloatingNumbers.scala ## @@ -116,6 +116,15 @@ object

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due t

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-657279975 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due t

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-657279972 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due to spar

2020-07-12 Thread GitBox
SparkQA removed a comment on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-657263279 **[Test build #125723 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125723/testReport)** for PR 28287 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due to spark'

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-657279972 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due to spark's blac

2020-07-12 Thread GitBox
SparkQA commented on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-657279814 **[Test build #125723 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125723/testReport)** for PR 28287 at commit

[GitHub] [spark] viirya commented on a change in pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
viirya commented on a change in pull request #29061: URL: https://github.com/apache/spark/pull/29061#discussion_r453366149 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NormalizeFloatingNumbers.scala ## @@ -116,6 +116,15 @@ object

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #29061: URL: https://github.com/apache/spark/pull/29061#issuecomment-657277984 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #29061: URL: https://github.com/apache/spark/pull/29061#issuecomment-657277984 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
SparkQA removed a comment on pull request #29061: URL: https://github.com/apache/spark/pull/29061#issuecomment-657242968 **[Test build #125719 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125719/testReport)** for PR 29061 at commit

[GitHub] [spark] SparkQA commented on pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
SparkQA commented on pull request #29061: URL: https://github.com/apache/spark/pull/29061#issuecomment-657277811 **[Test build #125719 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125719/testReport)** for PR 29061 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-657277525 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-657277525 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-12 Thread GitBox
SparkQA removed a comment on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-657242990 **[Test build #125720 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125720/testReport)** for PR 29053 at commit

[GitHub] [spark] SparkQA commented on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-12 Thread GitBox
SparkQA commented on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-657277324 **[Test build #125720 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125720/testReport)** for PR 29053 at commit

[GitHub] [spark] dongjoon-hyun commented on pull request #29075: [SPARK-32284][SQL] Avoid expanding too many CNF predicates in partition pruning

2020-07-12 Thread GitBox
dongjoon-hyun commented on pull request #29075: URL: https://github.com/apache/spark/pull/29075#issuecomment-657275435 Thank you for updating, @gengliangwang . Shall we adjust this test case name accordingly together? ``` test("SPARK-32284: Avoid pushing down too many predicates in

[GitHub] [spark] dongjoon-hyun edited a comment on pull request #29075: [SPARK-32284][SQL] Avoid expanding too many CNF predicates in partition pruning

2020-07-12 Thread GitBox
dongjoon-hyun edited a comment on pull request #29075: URL: https://github.com/apache/spark/pull/29075#issuecomment-657275435 Thank you for updating, @gengliangwang . Shall we adjust this test case name accordingly together? ``` test("SPARK-32284: Avoid pushing down too many

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
dongjoon-hyun commented on a change in pull request #29061: URL: https://github.com/apache/spark/pull/29061#discussion_r453362715 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NormalizeFloatingNumbers.scala ## @@ -116,6 +116,15 @@ object

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #29061: [SPARK-32258][SQL] NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-12 Thread GitBox
dongjoon-hyun commented on a change in pull request #29061: URL: https://github.com/apache/spark/pull/29061#discussion_r453362715 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/NormalizeFloatingNumbers.scala ## @@ -116,6 +116,15 @@ object

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29077: [SPARk-31985][SS] Remove incomplete/undocumented stateful aggregation in continuous mode

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #29077: URL: https://github.com/apache/spark/pull/29077#issuecomment-657273250 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29077: [SPARk-31985][SS] Remove incomplete/undocumented stateful aggregation in continuous mode

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #29077: URL: https://github.com/apache/spark/pull/29077#issuecomment-657273250 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29077: [SPARk-31985][SS] Remove incomplete/undocumented stateful aggregation in continuous mode

2020-07-12 Thread GitBox
SparkQA commented on pull request #29077: URL: https://github.com/apache/spark/pull/29077#issuecomment-657273147 **[Test build #125734 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125734/testReport)** for PR 29077 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #24990: [SPARK-28191][SS] New data source - state - reader part

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #24990: URL: https://github.com/apache/spark/pull/24990#issuecomment-657272433 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
dongjoon-hyun commented on a change in pull request #29076: URL: https://github.com/apache/spark/pull/29076#discussion_r453361348 ## File path: .github/workflows/master.yml ## @@ -1,6 +1,9 @@ name: master on: + push: +branches: +- master Review comment: We

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
dongjoon-hyun commented on a change in pull request #29076: URL: https://github.com/apache/spark/pull/29076#discussion_r453361348 ## File path: .github/workflows/master.yml ## @@ -1,6 +1,9 @@ name: master on: + push: +branches: +- master Review comment: We

[GitHub] [spark] SparkQA removed a comment on pull request #24990: [SPARK-28191][SS] New data source - state - reader part

2020-07-12 Thread GitBox
SparkQA removed a comment on pull request #24990: URL: https://github.com/apache/spark/pull/24990#issuecomment-657271690 **[Test build #125732 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125732/testReport)** for PR 24990 at commit

[GitHub] [spark] HeartSaVioR opened a new pull request #29077: [SPARk-31985][SS] Remove incomplete/undocumented stateful aggregation in continuous mode

2020-07-12 Thread GitBox
HeartSaVioR opened a new pull request #29077: URL: https://github.com/apache/spark/pull/29077 ### What changes were proposed in this pull request? This removes the undocumented and incomplete feature of "stateful aggregation" in continuous mode, which would reduce 1100+ lines of

[GitHub] [spark] AmplabJenkins removed a comment on pull request #24990: [SPARK-28191][SS] New data source - state - reader part

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #24990: URL: https://github.com/apache/spark/pull/24990#issuecomment-657272431 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #24990: [SPARK-28191][SS] New data source - state - reader part

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #24990: URL: https://github.com/apache/spark/pull/24990#issuecomment-657272431 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #24990: [SPARK-28191][SS] New data source - state - reader part

2020-07-12 Thread GitBox
SparkQA commented on pull request #24990: URL: https://github.com/apache/spark/pull/24990#issuecomment-657272426 **[Test build #125732 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125732/testReport)** for PR 24990 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-657271860 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-657271835 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27620: [SPARK-30866][SS] FileStreamSource: Cache fetched list of files beyond maxFilesPerTrigger as unread files

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #27620: URL: https://github.com/apache/spark/pull/27620#issuecomment-657271828 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-657271823 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28422: [SPARK-17604][SS] FileStreamSource: provide a new option to have retention on input files

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28422: URL: https://github.com/apache/spark/pull/28422#issuecomment-657271786 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-657271806 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657271816 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-657271860 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-657271835 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-657271823 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657271814 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #27620: [SPARK-30866][SS] FileStreamSource: Cache fetched list of files beyond maxFilesPerTrigger as unread files

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #27620: URL: https://github.com/apache/spark/pull/27620#issuecomment-657271828 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-657271806 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-07-12 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-657271700 **[Test build #125733 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125733/testReport)** for PR 24173 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28422: [SPARK-17604][SS] FileStreamSource: provide a new option to have retention on input files

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #28422: URL: https://github.com/apache/spark/pull/28422#issuecomment-657271786 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #24990: [SPARK-28191][SS] New data source - state - reader part

2020-07-12 Thread GitBox
SparkQA commented on pull request #24990: URL: https://github.com/apache/spark/pull/24990#issuecomment-657271690 **[Test build #125732 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125732/testReport)** for PR 24990 at commit

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-07-12 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-657271658 **[Test build #125729 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125729/testReport)** for PR 27333 at commit

[GitHub] [spark] SparkQA commented on pull request #28422: [SPARK-17604][SS] FileStreamSource: provide a new option to have retention on input files

2020-07-12 Thread GitBox
SparkQA commented on pull request #28422: URL: https://github.com/apache/spark/pull/28422#issuecomment-657271684 **[Test build #125724 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125724/testReport)** for PR 28422 at commit

[GitHub] [spark] SparkQA commented on pull request #27620: [SPARK-30866][SS] FileStreamSource: Cache fetched list of files beyond maxFilesPerTrigger as unread files

2020-07-12 Thread GitBox
SparkQA commented on pull request #27620: URL: https://github.com/apache/spark/pull/27620#issuecomment-657271647 **[Test build #125728 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125728/testReport)** for PR 27620 at commit

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-07-12 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-657271639 **[Test build #125725 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125725/testReport)** for PR 28363 at commit

[GitHub] [spark] SparkQA commented on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
SparkQA commented on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657271641 **[Test build #125726 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125726/testReport)** for PR 27694 at commit

[GitHub] [spark] SparkQA commented on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-07-12 Thread GitBox
SparkQA commented on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-657271662 **[Test build #125730 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125730/testReport)** for PR 26935 at commit

[GitHub] [spark] SparkQA commented on pull request #25965: [SPARK-26425][SS] Add more constraint checks to avoid checkpoint corruption

2020-07-12 Thread GitBox
SparkQA commented on pull request #25965: URL: https://github.com/apache/spark/pull/25965#issuecomment-657271688 **[Test build #125731 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125731/testReport)** for PR 25965 at commit

[GitHub] [spark] SparkQA commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-07-12 Thread GitBox
SparkQA commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-657271683 **[Test build #125727 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125727/testReport)** for PR 27649 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-657270985 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] HeartSaVioR commented on pull request #25965: [SPARK-26425][SS] Add more constraint checks to avoid checkpoint corruption

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #25965: URL: https://github.com/apache/spark/pull/25965#issuecomment-657271117 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-657271102 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #27620: [SPARK-30866][SS] FileStreamSource: Cache fetched list of files beyond maxFilesPerTrigger as unread files

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #27620: URL: https://github.com/apache/spark/pull/27620#issuecomment-657271100 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #27694: [SPARK-30946][SS] Serde entry via DataInputStream/DataOutputStream with LZ4 compression on FileStream(Source/Sink)Log

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #27694: URL: https://github.com/apache/spark/pull/27694#issuecomment-657271094 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #28422: [SPARK-17604][SS] FileStreamSource: provide a new option to have retention on input files

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #28422: URL: https://github.com/apache/spark/pull/28422#issuecomment-657271083 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-657271138 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #24990: [SPARK-28191][SS] New data source - state - reader part

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #24990: URL: https://github.com/apache/spark/pull/24990#issuecomment-657271133 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-657271097 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins commented on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-657270985 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] HeartSaVioR commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-657270999 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-657270944 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins commented on pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #29076: URL: https://github.com/apache/spark/pull/29076#issuecomment-657269623 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #29076: URL: https://github.com/apache/spark/pull/29076#issuecomment-657269623 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
SparkQA removed a comment on pull request #29076: URL: https://github.com/apache/spark/pull/29076#issuecomment-657246675 **[Test build #125721 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125721/testReport)** for PR 29076 at commit

[GitHub] [spark] SparkQA commented on pull request #29076: [SPARK-32245][INFRA][FOLLOWUP] Reenable Github Actions on commit

2020-07-12 Thread GitBox
SparkQA commented on pull request #29076: URL: https://github.com/apache/spark/pull/29076#issuecomment-657269445 **[Test build #125721 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125721/testReport)** for PR 29076 at commit

[GitHub] [spark] HeartSaVioR commented on pull request #29069: [SPARK-31831][SQL][TESTS] Use subclasses for mock in HiveSessionImplSuite

2020-07-12 Thread GitBox
HeartSaVioR commented on pull request #29069: URL: https://github.com/apache/spark/pull/29069#issuecomment-657268918 Thanks! Merged into master. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] HeartSaVioR closed pull request #29069: [SPARK-31831][SQL][TESTS] Use subclasses for mock in HiveSessionImplSuite

2020-07-12 Thread GitBox
HeartSaVioR closed pull request #29069: URL: https://github.com/apache/spark/pull/29069 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due t

2020-07-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-657263420 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due to spark'

2020-07-12 Thread GitBox
AmplabJenkins commented on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-657263420 This is an automated message from the Apache Git Service. To respond to the message, please log on to

<    1   2   3   4   5   >