[jira] [Created] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread L. C. Hsieh (Jira)
L. C. Hsieh created SPARK-37371: --- Summary: UnionExec should support columnar if all children support columnar Key: SPARK-37371 URL: https://issues.apache.org/jira/browse/SPARK-37371 Project: Spark

[jira] [Assigned] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37371: Assignee: Apache Spark > UnionExec should support columnar if all children support column

[jira] [Commented] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445752#comment-17445752 ] Apache Spark commented on SPARK-37371: -- User 'viirya' has created a pull request fo

[jira] [Assigned] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37371: Assignee: (was: Apache Spark) > UnionExec should support columnar if all children sup

[jira] [Assigned] (SPARK-37277) Support DayTimeIntervalType in Arrow

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-37277: Assignee: Hyukjin Kwon > Support DayTimeIntervalType in Arrow > -

[jira] [Resolved] (SPARK-37277) Support DayTimeIntervalType in Arrow

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-37277. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34631 [https://gi

[jira] [Resolved] (SPARK-37275) Support ANSI intervals in PySpark

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-37275. -- Assignee: Hyukjin Kwon Resolution: Done > Support ANSI intervals in PySpark > --

[jira] [Assigned] (SPARK-37155) Inline type hints for python/pyspark/statcounter.py

2021-11-18 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz reassigned SPARK-37155: -- Assignee: Byron Hsu > Inline type hints for python/pyspark/statcounter.py > -

[jira] [Resolved] (SPARK-37155) Inline type hints for python/pyspark/statcounter.py

2021-11-18 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz resolved SPARK-37155. Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34435

[jira] [Created] (SPARK-37372) Remove redundant Pod label editition

2021-11-18 Thread Yikun Jiang (Jira)
Yikun Jiang created SPARK-37372: --- Summary: Remove redundant Pod label editition Key: SPARK-37372 URL: https://issues.apache.org/jira/browse/SPARK-37372 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-37357) Create skew partition specs should respect min partition size

2021-11-18 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Summary: Create skew partition specs should respect min partition size (was: Add merged last partitio

[jira] [Updated] (SPARK-37357) Create skew partition specs should respect min partition size

2021-11-18 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: For example `Rebalance` provide a functionality that split the large reduce partition in

[jira] [Assigned] (SPARK-37372) Remove redundant Pod label editition

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37372: Assignee: Apache Spark > Remove redundant Pod label editition > -

[jira] [Commented] (SPARK-37372) Remove redundant Pod label editition

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445909#comment-17445909 ] Apache Spark commented on SPARK-37372: -- User 'Yikun' has created a pull request for

[jira] [Assigned] (SPARK-37372) Remove redundant Pod label editition

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37372: Assignee: (was: Apache Spark) > Remove redundant Pod label editition > --

[jira] [Commented] (SPARK-37372) Remove redundant Pod label editition

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445911#comment-17445911 ] Apache Spark commented on SPARK-37372: -- User 'Yikun' has created a pull request for

[jira] [Updated] (SPARK-36180) Support TimestampNTZ type in Hive

2021-11-18 Thread jiaan.geng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiaan.geng updated SPARK-36180: --- Summary: Support TimestampNTZ type in Hive (was: HMS can not recognize timestamp_ntz) > Support Ti

[jira] [Commented] (SPARK-36180) Support TimestampNTZ type in Hive

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445925#comment-17445925 ] Apache Spark commented on SPARK-36180: -- User 'beliefer' has created a pull request

[jira] [Commented] (SPARK-37282) Add ExtendedLevelDBTest and disable LevelDB tests on Apple Silicon

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445935#comment-17445935 ] Apache Spark commented on SPARK-37282: -- User 'LuciferYang' has created a pull reque

[jira] [Commented] (SPARK-37189) pyspark.pandas histogram accepts the range option but does not use it

2021-11-18 Thread pralabhkumar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445960#comment-17445960 ] pralabhkumar commented on SPARK-37189: -- IMHO , the issue is in pyspark.pandas.plot

[jira] (SPARK-37189) pyspark.pandas histogram accepts the range option but does not use it

2021-11-18 Thread pralabhkumar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37189 ] pralabhkumar deleted comment on SPARK-37189: -- was (Author: pralabhkumar):     > pyspark.pandas histogram accepts the range option but does not use it >

[jira] [Comment Edited] (SPARK-37189) pyspark.pandas histogram accepts the range option but does not use it

2021-11-18 Thread pralabhkumar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445960#comment-17445960 ] pralabhkumar edited comment on SPARK-37189 at 11/18/21, 2:52 PM: -

[jira] [Commented] (SPARK-37188) pyspark.pandas histogram accepts the title option but does not add a title to the plot

2021-11-18 Thread pralabhkumar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445978#comment-17445978 ] pralabhkumar commented on SPARK-37188: -- IMHO , the issue is in pyspark.pandas.plot

[jira] [Comment Edited] (SPARK-37188) pyspark.pandas histogram accepts the title option but does not add a title to the plot

2021-11-18 Thread pralabhkumar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445978#comment-17445978 ] pralabhkumar edited comment on SPARK-37188 at 11/18/21, 2:56 PM: -

[jira] [Commented] (SPARK-37348) PySpark pmod function

2021-11-18 Thread Tim Schwab (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445988#comment-17445988 ] Tim Schwab commented on SPARK-37348: Fair enough. The reasoning to add a function as

[jira] [Commented] (SPARK-35672) Spark fails to launch executors with very large user classpath lists on YARN

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446044#comment-17446044 ] Apache Spark commented on SPARK-35672: -- User 'sarutak' has created a pull request f

[jira] [Created] (SPARK-37373) Collect LocalSparkContext worker logs in case of test failure

2021-11-18 Thread Attila Zsolt Piros (Jira)
Attila Zsolt Piros created SPARK-37373: -- Summary: Collect LocalSparkContext worker logs in case of test failure Key: SPARK-37373 URL: https://issues.apache.org/jira/browse/SPARK-37373 Project: Sp

[jira] [Commented] (SPARK-36664) Log time spent waiting for cluster resources

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446068#comment-17446068 ] Apache Spark commented on SPARK-36664: -- User 'holdenk' has created a pull request f

[jira] [Commented] (SPARK-36664) Log time spent waiting for cluster resources

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446069#comment-17446069 ] Apache Spark commented on SPARK-36664: -- User 'holdenk' has created a pull request f

[jira] [Assigned] (SPARK-36664) Log time spent waiting for cluster resources

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36664: Assignee: (was: Apache Spark) > Log time spent waiting for cluster resources > --

[jira] [Assigned] (SPARK-36664) Log time spent waiting for cluster resources

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36664: Assignee: Apache Spark > Log time spent waiting for cluster resources > -

[jira] [Assigned] (SPARK-37373) Collect LocalSparkContext worker logs in case of test failure

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37373: Assignee: Apache Spark (was: Attila Zsolt Piros) > Collect LocalSparkContext worker logs

[jira] [Assigned] (SPARK-37373) Collect LocalSparkContext worker logs in case of test failure

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37373: Assignee: Attila Zsolt Piros (was: Apache Spark) > Collect LocalSparkContext worker logs

[jira] [Commented] (SPARK-37373) Collect LocalSparkContext worker logs in case of test failure

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446090#comment-17446090 ] Apache Spark commented on SPARK-37373: -- User 'attilapiros' has created a pull reque

[jira] [Updated] (SPARK-37373) Collect LocalSparkContext worker logs in case of test failure

2021-11-18 Thread Attila Zsolt Piros (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-37373: --- Affects Version/s: 3.3.0 (was: 3.2.0) > Collect LocalSpar

[jira] [Commented] (SPARK-37224) Optimize write path on RocksDB state store provider

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446142#comment-17446142 ] Apache Spark commented on SPARK-37224: -- User 'HeartSaVioR' has created a pull reque

[jira] [Resolved] (SPARK-37356) Add fine grained locking to BlockInfoManager

2021-11-18 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-37356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell resolved SPARK-37356. --- Fix Version/s: 3.3.0 Resolution: Fixed > Add fine grained locking to BlockInf

[jira] [Created] (SPARK-37374) StatCounter should use mergeStats when merging with self.

2021-11-18 Thread Takuya Ueshin (Jira)
Takuya Ueshin created SPARK-37374: - Summary: StatCounter should use mergeStats when merging with self. Key: SPARK-37374 URL: https://issues.apache.org/jira/browse/SPARK-37374 Project: Spark I

[jira] [Assigned] (SPARK-37374) StatCounter should use mergeStats when merging with self.

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37374: Assignee: Apache Spark > StatCounter should use mergeStats when merging with self. >

[jira] [Commented] (SPARK-37374) StatCounter should use mergeStats when merging with self.

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446171#comment-17446171 ] Apache Spark commented on SPARK-37374: -- User 'ueshin' has created a pull request fo

[jira] [Assigned] (SPARK-37374) StatCounter should use mergeStats when merging with self.

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37374: Assignee: (was: Apache Spark) > StatCounter should use mergeStats when merging with s

[jira] [Resolved] (SPARK-37166) SPIP: Storage Partitioned Join

2021-11-18 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun resolved SPARK-37166. -- Fix Version/s: 3.3.0 Assignee: Chao Sun Resolution: Fixed > SPIP: Storage Partitioned

[jira] [Created] (SPARK-37375) Umbrella: Storage Partitioned Join

2021-11-18 Thread Chao Sun (Jira)
Chao Sun created SPARK-37375: Summary: Umbrella: Storage Partitioned Join Key: SPARK-37375 URL: https://issues.apache.org/jira/browse/SPARK-37375 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-37166) SPIP: Storage Partitioned Join

2021-11-18 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun updated SPARK-37166: - Parent: SPARK-37375 Issue Type: Sub-task (was: New Feature) > SPIP: Storage Partitioned Join >

[jira] [Created] (SPARK-37376) Introduce a new DataSource V2 interface HasPartitionKey

2021-11-18 Thread Chao Sun (Jira)
Chao Sun created SPARK-37376: Summary: Introduce a new DataSource V2 interface HasPartitionKey Key: SPARK-37376 URL: https://issues.apache.org/jira/browse/SPARK-37376 Project: Spark Issue Type:

[jira] [Created] (SPARK-37377) Refactor V2 Partitioning interface and remove deprecated usage of Distribution

2021-11-18 Thread Chao Sun (Jira)
Chao Sun created SPARK-37377: Summary: Refactor V2 Partitioning interface and remove deprecated usage of Distribution Key: SPARK-37377 URL: https://issues.apache.org/jira/browse/SPARK-37377 Project: Spark

[jira] [Created] (SPARK-37378) Convert V2 Transform expressions into catalyst expressions and load their associated functions from V2 FunctionCatalog

2021-11-18 Thread Chao Sun (Jira)
Chao Sun created SPARK-37378: Summary: Convert V2 Transform expressions into catalyst expressions and load their associated functions from V2 FunctionCatalog Key: SPARK-37378 URL: https://issues.apache.org/jira/browse

[jira] [Created] (SPARK-37379) Add tree pattern pruning to CTESubstitution rule

2021-11-18 Thread Josh Rosen (Jira)
Josh Rosen created SPARK-37379: -- Summary: Add tree pattern pruning to CTESubstitution rule Key: SPARK-37379 URL: https://issues.apache.org/jira/browse/SPARK-37379 Project: Spark Issue Type: Sub-

[jira] [Created] (SPARK-37380) Miscellaneous Python lint infra cleanup

2021-11-18 Thread Nicholas Chammas (Jira)
Nicholas Chammas created SPARK-37380: Summary: Miscellaneous Python lint infra cleanup Key: SPARK-37380 URL: https://issues.apache.org/jira/browse/SPARK-37380 Project: Spark Issue Type: I

[jira] [Assigned] (SPARK-37380) Miscellaneous Python lint infra cleanup

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37380: Assignee: (was: Apache Spark) > Miscellaneous Python lint infra cleanup > ---

[jira] [Assigned] (SPARK-37380) Miscellaneous Python lint infra cleanup

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37380: Assignee: Apache Spark > Miscellaneous Python lint infra cleanup > --

[jira] [Commented] (SPARK-37380) Miscellaneous Python lint infra cleanup

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446218#comment-17446218 ] Apache Spark commented on SPARK-37380: -- User 'nchammas' has created a pull request

[jira] [Commented] (SPARK-37380) Miscellaneous Python lint infra cleanup

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446219#comment-17446219 ] Apache Spark commented on SPARK-37380: -- User 'nchammas' has created a pull request

[jira] [Commented] (SPARK-37188) pyspark.pandas histogram accepts the title option but does not add a title to the plot

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446220#comment-17446220 ] Hyukjin Kwon commented on SPARK-37188: -- Yeah, [~pralabhkumar]. I think we should fi

[jira] [Commented] (SPARK-37376) Introduce a new DataSource V2 interface HasPartitionKey

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446222#comment-17446222 ] Apache Spark commented on SPARK-37376: -- User 'sunchao' has created a pull request f

[jira] [Assigned] (SPARK-37376) Introduce a new DataSource V2 interface HasPartitionKey

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37376: Assignee: Apache Spark > Introduce a new DataSource V2 interface HasPartitionKey > -

[jira] [Assigned] (SPARK-37376) Introduce a new DataSource V2 interface HasPartitionKey

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37376: Assignee: (was: Apache Spark) > Introduce a new DataSource V2 interface HasPartitionK

[jira] [Commented] (SPARK-37376) Introduce a new DataSource V2 interface HasPartitionKey

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446223#comment-17446223 ] Apache Spark commented on SPARK-37376: -- User 'sunchao' has created a pull request f

[jira] [Created] (SPARK-37381) Unify v1 and v2 SHOW CREATE TABLE tests

2021-11-18 Thread PengLei (Jira)
PengLei created SPARK-37381: --- Summary: Unify v1 and v2 SHOW CREATE TABLE tests Key: SPARK-37381 URL: https://issues.apache.org/jira/browse/SPARK-37381 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-37379) Add tree pattern pruning to CTESubstitution rule

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446232#comment-17446232 ] Apache Spark commented on SPARK-37379: -- User 'JoshRosen' has created a pull request

[jira] [Commented] (SPARK-37379) Add tree pattern pruning to CTESubstitution rule

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446234#comment-17446234 ] Apache Spark commented on SPARK-37379: -- User 'JoshRosen' has created a pull request

[jira] [Assigned] (SPARK-37336) Migrate _java2py to SparkSession

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-37336: Assignee: Nicholas Chammas > Migrate _java2py to SparkSession > -

[jira] [Resolved] (SPARK-37336) Migrate _java2py to SparkSession

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-37336. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34606 [https://gi

[jira] [Assigned] (SPARK-37270) Incorect result of filter using isNull condition

2021-11-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-37270: --- Assignee: Yuming Wang > Incorect result of filter using isNull condition >

[jira] [Resolved] (SPARK-37270) Incorect result of filter using isNull condition

2021-11-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-37270. - Fix Version/s: 3.2.1 Resolution: Fixed Issue resolved by pull request 34627 [https://gith

[jira] [Assigned] (SPARK-37374) StatCounter should use mergeStats when merging with self.

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-37374: Assignee: Takuya Ueshin > StatCounter should use mergeStats when merging with self. > ---

[jira] [Resolved] (SPARK-37374) StatCounter should use mergeStats when merging with self.

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-37374. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34653 [https://gi

[jira] [Commented] (SPARK-34863) Support nested column in Spark Parquet vectorized readers

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446248#comment-17446248 ] Apache Spark commented on SPARK-34863: -- User 'sunchao' has created a pull request f

[jira] [Created] (SPARK-37382) `with as` clause got inconsistent results

2021-11-18 Thread caican (Jira)
caican created SPARK-37382: -- Summary: `with as` clause got inconsistent results Key: SPARK-37382 URL: https://issues.apache.org/jira/browse/SPARK-37382 Project: Spark Issue Type: Bug Compo

[jira] [Assigned] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh reassigned SPARK-37371: --- Assignee: L. C. Hsieh > UnionExec should support columnar if all children support columnar

[jira] [Resolved] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-37371. - Resolution: Resolved > UnionExec should support columnar if all children support columnar >

[jira] [Commented] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446259#comment-17446259 ] L. C. Hsieh commented on SPARK-37371: - The issue was resolved at https://github.com/

[jira] [Updated] (SPARK-37371) UnionExec should support columnar if all children support columnar

2021-11-18 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh updated SPARK-37371: Fix Version/s: 3.3.0 > UnionExec should support columnar if all children support columnar > --

[jira] [Updated] (SPARK-37382) `with as` clause got inconsistent results

2021-11-18 Thread caican (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] caican updated SPARK-37382: --- Description: In Spark3.1, the `with as` clause in the same SQL is executed multiple times,  got different r

[jira] [Commented] (SPARK-37350) EventLoggingListener keep logging errors after hdfs restart all datanodes

2021-11-18 Thread Shefron Yudy (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446266#comment-17446266 ] Shefron Yudy commented on SPARK-37350: -- [~hyukjin.kwon] The issue still persists in

[jira] [Commented] (SPARK-37350) EventLoggingListener keep logging errors after hdfs restart all datanodes

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446271#comment-17446271 ] Hyukjin Kwon commented on SPARK-37350: -- which version of 3.x did you try? > EventL

[jira] [Commented] (SPARK-37038) Sample push down in DS v2

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446274#comment-17446274 ] Apache Spark commented on SPARK-37038: -- User 'huaxingao' has created a pull request

[jira] [Created] (SPARK-37383) Prints the parsing time for each phase of a SQL

2021-11-18 Thread caican (Jira)
caican created SPARK-37383: -- Summary: Prints the parsing time for each phase of a SQL Key: SPARK-37383 URL: https://issues.apache.org/jira/browse/SPARK-37383 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-37370) Add SQL configs to control newly added join code-gen in 3.3

2021-11-18 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-37370. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34643 [https://gith

[jira] [Created] (SPARK-37384) Flay test: HealthTrackerIntegrationSuite.If preferred node is bad, without excludeOnFailure job will fail

2021-11-18 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-37384: Summary: Flay test: HealthTrackerIntegrationSuite.If preferred node is bad, without excludeOnFailure job will fail Key: SPARK-37384 URL: https://issues.apache.org/jira/browse/SPAR

[jira] [Commented] (SPARK-37384) Flay test: HealthTrackerIntegrationSuite.If preferred node is bad, without excludeOnFailure job will fail

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446279#comment-17446279 ] Apache Spark commented on SPARK-37384: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-37384) Flay test: HealthTrackerIntegrationSuite.If preferred node is bad, without excludeOnFailure job will fail

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37384: Assignee: (was: Apache Spark) > Flay test: HealthTrackerIntegrationSuite.If preferred

[jira] [Assigned] (SPARK-37384) Flay test: HealthTrackerIntegrationSuite.If preferred node is bad, without excludeOnFailure job will fail

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37384: Assignee: Apache Spark > Flay test: HealthTrackerIntegrationSuite.If preferred node is ba

[jira] [Assigned] (SPARK-37370) Add SQL configs to control newly added join code-gen in 3.3

2021-11-18 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-37370: --- Assignee: Cheng Su > Add SQL configs to control newly added join code-gen in 3.3 >

[jira] [Created] (SPARK-37385) Add tests for TimestampNTZ and TimestampLTZ for Parquet data source

2021-11-18 Thread Ivan Sadikov (Jira)
Ivan Sadikov created SPARK-37385: Summary: Add tests for TimestampNTZ and TimestampLTZ for Parquet data source Key: SPARK-37385 URL: https://issues.apache.org/jira/browse/SPARK-37385 Project: Spark

[jira] [Created] (SPARK-37386) simplify OptimizeSkewedJoin to not run the cost evaluator

2021-11-18 Thread Wenchen Fan (Jira)
Wenchen Fan created SPARK-37386: --- Summary: simplify OptimizeSkewedJoin to not run the cost evaluator Key: SPARK-37386 URL: https://issues.apache.org/jira/browse/SPARK-37386 Project: Spark Issue

[jira] [Assigned] (SPARK-37386) simplify OptimizeSkewedJoin to not run the cost evaluator

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37386: Assignee: (was: Apache Spark) > simplify OptimizeSkewedJoin to not run the cost evalu

[jira] [Assigned] (SPARK-37386) simplify OptimizeSkewedJoin to not run the cost evaluator

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37386: Assignee: Apache Spark > simplify OptimizeSkewedJoin to not run the cost evaluator >

[jira] [Updated] (SPARK-37383) Print the parsing time for each phase of a SQL

2021-11-18 Thread caican (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] caican updated SPARK-37383: --- Summary: Print the parsing time for each phase of a SQL (was: Prints the parsing time for each phase of a S

[jira] [Updated] (SPARK-37383) Print the parsing time for each phase of a SQL

2021-11-18 Thread caican (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] caican updated SPARK-37383: --- Affects Version/s: 2.4.0 (was: 3.2.0) > Print the parsing time for each phase of

[jira] [Commented] (SPARK-37385) Add tests for TimestampNTZ and TimestampLTZ for Parquet data source

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446286#comment-17446286 ] Apache Spark commented on SPARK-37385: -- User 'sadikovi' has created a pull request

[jira] [Assigned] (SPARK-37385) Add tests for TimestampNTZ and TimestampLTZ for Parquet data source

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37385: Assignee: Apache Spark > Add tests for TimestampNTZ and TimestampLTZ for Parquet data sou

[jira] [Assigned] (SPARK-37385) Add tests for TimestampNTZ and TimestampLTZ for Parquet data source

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37385: Assignee: (was: Apache Spark) > Add tests for TimestampNTZ and TimestampLTZ for Parqu

[jira] [Assigned] (SPARK-37384) Flay test: HealthTrackerIntegrationSuite.If preferred node is bad, without excludeOnFailure job will fail

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-37384: Assignee: Hyukjin Kwon > Flay test: HealthTrackerIntegrationSuite.If preferred node is ba

[jira] [Resolved] (SPARK-37384) Flay test: HealthTrackerIntegrationSuite.If preferred node is bad, without excludeOnFailure job will fail

2021-11-18 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-37384. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34661 [https://gi

[jira] [Commented] (SPARK-37188) pyspark.pandas histogram accepts the title option but does not add a title to the plot

2021-11-18 Thread pralabhkumar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446305#comment-17446305 ] pralabhkumar commented on SPARK-37188: -- [~hyukjin.kwon]  Working on it . Thx > py

[jira] [Commented] (SPARK-35672) Spark fails to launch executors with very large user classpath lists on YARN

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446323#comment-17446323 ] Apache Spark commented on SPARK-35672: -- User 'sarutak' has created a pull request f

[jira] [Commented] (SPARK-37383) Print the parsing time for each phase of a SQL

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446327#comment-17446327 ] Apache Spark commented on SPARK-37383: -- User 'caican00' has created a pull request

[jira] [Assigned] (SPARK-37383) Print the parsing time for each phase of a SQL

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37383: Assignee: Apache Spark > Print the parsing time for each phase of a SQL > ---

[jira] [Assigned] (SPARK-37383) Print the parsing time for each phase of a SQL

2021-11-18 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37383: Assignee: (was: Apache Spark) > Print the parsing time for each phase of a SQL >

  1   2   >