[jira] [Created] (SPARK-36134) jackson-databind RCE vulnerability

2021-07-13 Thread Sumit (Jira)
Sumit created SPARK-36134: - Summary: jackson-databind RCE vulnerability Key: SPARK-36134 URL: https://issues.apache.org/jira/browse/SPARK-36134 Project: Spark Issue Type: Task Components:

[jira] [Commented] (SPARK-32915) RPC implementation to support pushing and merging shuffle blocks

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380365#comment-17380365 ] Apache Spark commented on SPARK-32915: -- User 'Victsm' has created a pull request fo

[jira] [Commented] (SPARK-32915) RPC implementation to support pushing and merging shuffle blocks

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380364#comment-17380364 ] Apache Spark commented on SPARK-32915: -- User 'Victsm' has created a pull request fo

[jira] [Assigned] (SPARK-36133) The catalog name keep consistent with the namespace naming rule

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36133: Assignee: Apache Spark > The catalog name keep consistent with the namespace naming rule

[jira] [Commented] (SPARK-36133) The catalog name keep consistent with the namespace naming rule

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380353#comment-17380353 ] Apache Spark commented on SPARK-36133: -- User 'Peng-Lei' has created a pull request

[jira] [Assigned] (SPARK-36133) The catalog name keep consistent with the namespace naming rule

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36133: Assignee: (was: Apache Spark) > The catalog name keep consistent with the namespace n

[jira] [Commented] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380350#comment-17380350 ] Apache Spark commented on SPARK-36129: -- User 'sarutak' has created a pull request f

[jira] [Commented] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380349#comment-17380349 ] Apache Spark commented on SPARK-36129: -- User 'sarutak' has created a pull request f

[jira] [Commented] (SPARK-35334) Spark should be more resilient to intermittent K8s flakiness

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380347#comment-17380347 ] Dongjoon Hyun commented on SPARK-35334: --- Hi, [~attilapiros]. This is reported as a

[jira] [Assigned] (SPARK-36132) Support initial state for flatMapGroupsWithState in batch mode

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36132: Assignee: (was: Apache Spark) > Support initial state for flatMapGroupsWithState in b

[jira] [Commented] (SPARK-36132) Support initial state for flatMapGroupsWithState in batch mode

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380346#comment-17380346 ] Apache Spark commented on SPARK-36132: -- User 'rahulsmahadev' has created a pull req

[jira] [Assigned] (SPARK-36132) Support initial state for flatMapGroupsWithState in batch mode

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36132: Assignee: Apache Spark > Support initial state for flatMapGroupsWithState in batch mode >

[jira] [Commented] (SPARK-36132) Support initial state for flatMapGroupsWithState in batch mode

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380345#comment-17380345 ] Apache Spark commented on SPARK-36132: -- User 'rahulsmahadev' has created a pull req

[jira] [Created] (SPARK-36133) The catalog name keep consistent with the namespace naming rule

2021-07-13 Thread PengLei (Jira)
PengLei created SPARK-36133: --- Summary: The catalog name keep consistent with the namespace naming rule Key: SPARK-36133 URL: https://issues.apache.org/jira/browse/SPARK-36133 Project: Spark Issue

[jira] [Created] (SPARK-36132) Support initial state for flatMapGroupsWithState in batch mode

2021-07-13 Thread Rahul Shivu Mahadev (Jira)
Rahul Shivu Mahadev created SPARK-36132: --- Summary: Support initial state for flatMapGroupsWithState in batch mode Key: SPARK-36132 URL: https://issues.apache.org/jira/browse/SPARK-36132 Project:

[jira] [Assigned] (SPARK-35640) Refactor Parquet vectorized reader to remove duplicated code paths

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-35640: - Assignee: Chao Sun > Refactor Parquet vectorized reader to remove duplicated code paths

[jira] [Assigned] (SPARK-35743) Improve Parquet vectorized reader

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-35743: - Assignee: Chao Sun > Improve Parquet vectorized reader > --

[jira] [Assigned] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-36123: - Assignee: Chao Sun > Parquet vectorized reader doesn't skip null values correctly > ---

[jira] [Resolved] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-36129. --- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 3 [https://

[jira] [Resolved] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-36131. --- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 4 [https://

[jira] [Assigned] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-36131: - Assignee: Chao Sun > Refactor ParquetColumnIndexSuite > ---

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380334#comment-17380334 ] Apache Spark commented on SPARK-36130: -- User 'cfmcgrady' has created a pull request

[jira] [Assigned] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36130: Assignee: Apache Spark > UnwrapCastInBinaryComparison fail when in.list contain CheckOver

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380333#comment-17380333 ] Apache Spark commented on SPARK-36130: -- User 'cfmcgrady' has created a pull request

[jira] [Assigned] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36130: Assignee: (was: Apache Spark) > UnwrapCastInBinaryComparison fail when in.list contai

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380326#comment-17380326 ] Chao Sun commented on SPARK-36128: -- Thanks, I'm slightly inclined to reuse the existing

[jira] [Updated] (SPARK-36034) Incorrect datetime filter when reading Parquet files written in legacy mode

2021-07-13 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-36034: Priority: Blocker (was: Major) > Incorrect datetime filter when reading Parquet files written in legacy m

[jira] [Updated] (SPARK-36034) Incorrect datetime filter when reading Parquet files written in legacy mode

2021-07-13 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-36034: Target Version/s: 3.2.0 > Incorrect datetime filter when reading Parquet files written in legacy mode > --

[jira] [Assigned] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36131: Assignee: (was: Apache Spark) > Refactor ParquetColumnIndexSuite > --

[jira] [Assigned] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36131: Assignee: Apache Spark > Refactor ParquetColumnIndexSuite > -

[jira] [Commented] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380315#comment-17380315 ] Apache Spark commented on SPARK-36131: -- User 'sunchao' has created a pull request f

[jira] [Commented] (SPARK-35743) Improve Parquet vectorized reader

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380313#comment-17380313 ] Apache Spark commented on SPARK-35743: -- User 'sunchao' has created a pull request f

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380307#comment-17380307 ] Hyukjin Kwon commented on SPARK-36128: -- That's okay. I was just thinking that we mi

[jira] [Created] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Chao Sun (Jira)
Chao Sun created SPARK-36131: Summary: Refactor ParquetColumnIndexSuite Key: SPARK-36131 URL: https://issues.apache.org/jira/browse/SPARK-36131 Project: Spark Issue Type: Sub-task Compo

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380299#comment-17380299 ] Chao Sun commented on SPARK-36128: -- [~hyukjin.kwon] you are right - I didn't know this

[jira] [Comment Edited] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380299#comment-17380299 ] Chao Sun edited comment on SPARK-36128 at 7/14/21, 4:24 AM:

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Fu Chen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380291#comment-17380291 ] Fu Chen commented on SPARK-36130: - Hi, [~hyukjin.kwon], Copy from [https://github.com/a

[jira] [Commented] (SPARK-36121) Write data loss caused by stage retry when enable v2 FileOutputCommitter

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380290#comment-17380290 ] Hyukjin Kwon commented on SPARK-36121: -- Can you see if this is fixed in Spark 3.1?

[jira] [Commented] (SPARK-36121) Write data loss caused by stage retry when enable v2 FileOutputCommitter

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380289#comment-17380289 ] Hyukjin Kwon commented on SPARK-36121: -- did you enable speculation? > Write data l

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380287#comment-17380287 ] Hyukjin Kwon commented on SPARK-36128: -- hm, isn't {{spark.sql.hive.metastorePartiti

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380282#comment-17380282 ] Hyukjin Kwon commented on SPARK-36130: -- cc [~sunchao] FYI > UnwrapCastInBinaryComp

[jira] [Commented] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380273#comment-17380273 ] Apache Spark commented on SPARK-36129: -- User 'sarutak' has created a pull request f

[jira] [Assigned] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36129: Assignee: Kousuke Saruta (was: Apache Spark) > Upgrade commons-compress to 1.21 to deal

[jira] [Assigned] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36129: Assignee: Apache Spark (was: Kousuke Saruta) > Upgrade commons-compress to 1.21 to deal

[jira] [Commented] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380272#comment-17380272 ] Apache Spark commented on SPARK-36129: -- User 'sarutak' has created a pull request f

[jira] [Created] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Fu Chen (Jira)
Fu Chen created SPARK-36130: --- Summary: UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression Key: SPARK-36130 URL: https://issues.apache.org/jira/browse/SPARK-36130 Project: Spark

[jira] [Created] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Kousuke Saruta (Jira)
Kousuke Saruta created SPARK-36129: -- Summary: Upgrade commons-compress to 1.21 to deal with CVEs Key: SPARK-36129 URL: https://issues.apache.org/jira/browse/SPARK-36129 Project: Spark Issue

[jira] [Created] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
Chao Sun created SPARK-36128: Summary: CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning Key: SPARK-36128 URL: https://issues.apache.org/jira/browse/SPARK-36128 Pro

[jira] [Assigned] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36125: Assignee: Apache Spark > Implement non-equality comparison operators between two Categori

[jira] [Assigned] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36125: Assignee: (was: Apache Spark) > Implement non-equality comparison operators between t

[jira] [Commented] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380222#comment-17380222 ] Apache Spark commented on SPARK-36125: -- User 'xinrong-databricks' has created a pul

[jira] [Reopened] (SPARK-36127) Adjust non-equality comparison operators to accept scalar

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reopened SPARK-36127: -- > Adjust non-equality comparison operators to accept scalar >

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators of Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Summary: Implement non-equality comparison operators of Categoricals (was: Implement non-equali

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Summary: Implement non-equality comparison operators between two Categoricals (was: Implement n

[jira] [Resolved] (SPARK-36127) Adjust non-equality comparison operators to accept scalar

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-36127. -- Resolution: Duplicate > Adjust non-equality comparison operators to accept scalar > --

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Description: Implement non-equality comparison operators between two Categoricals (was: Impleme

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Summary: Implement non-equality comparison operators between two Categoricals (was: Implement n

[jira] [Created] (SPARK-36127) Adjust non-equality comparison operators to accept scalar

2021-07-13 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-36127: Summary: Adjust non-equality comparison operators to accept scalar Key: SPARK-36127 URL: https://issues.apache.org/jira/browse/SPARK-36127 Project: Spark Iss

[jira] [Created] (SPARK-36126) Adjust equality comparison operators of Categorical to follow pandas

2021-07-13 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-36126: Summary: Adjust equality comparison operators of Categorical to follow pandas Key: SPARK-36126 URL: https://issues.apache.org/jira/browse/SPARK-36126 Project: Spark

[jira] [Created] (SPARK-36125) Implement non-equality comparison operators between two categories

2021-07-13 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-36125: Summary: Implement non-equality comparison operators between two categories Key: SPARK-36125 URL: https://issues.apache.org/jira/browse/SPARK-36125 Project: Spark

[jira] [Commented] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380214#comment-17380214 ] Apache Spark commented on SPARK-36123: -- User 'sunchao' has created a pull request f

[jira] [Assigned] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36123: Assignee: Apache Spark > Parquet vectorized reader doesn't skip null values correctly > -

[jira] [Commented] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380213#comment-17380213 ] Apache Spark commented on SPARK-36123: -- User 'sunchao' has created a pull request f

[jira] [Assigned] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36123: Assignee: (was: Apache Spark) > Parquet vectorized reader doesn't skip null values co

[jira] [Updated] (SPARK-36124) Support set operators to be on correlation paths

2021-07-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-36124: - Description: A correlation path is defined as the sub-tree of all the operators that are on the

[jira] [Commented] (SPARK-35917) Disable push-based shuffle until the feature is complete

2021-07-13 Thread shubhangi priya (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380189#comment-17380189 ] shubhangi priya commented on SPARK-35917: - How user 'otterc' creates a pull requ

[jira] [Assigned] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-28266: Assignee: Apache Spark > data duplication when `path` serde property is present > ---

[jira] [Assigned] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-28266: Assignee: (was: Apache Spark) > data duplication when `path` serde property is presen

[jira] [Updated] (SPARK-36124) Support set operators to be on correlation paths

2021-07-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-36124: - Summary: Support set operators to be on correlation paths (was: Support set operators to be on

[jira] [Updated] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-36109: -- Fix Version/s: 3.0.4 3.1.3 > Fix flaky KafkaSourceStressSuite > ---

[jira] [Updated] (SPARK-35553) Improve correlated subqueries

2021-07-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-35553: - Summary: Improve correlated subqueries (was: Improve correlated subquery) > Improve correlated

[jira] [Comment Edited] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Shardul Mahadik (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380170#comment-17380170 ] Shardul Mahadik edited comment on SPARK-28266 at 7/13/21, 9:27 PM: ---

[jira] [Comment Edited] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Shardul Mahadik (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380170#comment-17380170 ] Shardul Mahadik edited comment on SPARK-28266 at 7/13/21, 9:27 PM: ---

[jira] [Commented] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Shardul Mahadik (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380170#comment-17380170 ] Shardul Mahadik commented on SPARK-28266: - I would like to propose another angle

[jira] [Reopened] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Krogen reopened SPARK-28266: - Re-opening this issue based on [~shardulm]'s example above demonstrating that this is indeed a real

[jira] [Updated] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun updated SPARK-36123: - Labels: correctness (was: ) > Parquet vectorized reader doesn't skip null values correctly > --

[jira] [Created] (SPARK-36124) Support set operators to be on a correlation path

2021-07-13 Thread Allison Wang (Jira)
Allison Wang created SPARK-36124: Summary: Support set operators to be on a correlation path Key: SPARK-36124 URL: https://issues.apache.org/jira/browse/SPARK-36124 Project: Spark Issue Type:

[jira] [Created] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Chao Sun (Jira)
Chao Sun created SPARK-36123: Summary: Parquet vectorized reader doesn't skip null values correctly Key: SPARK-36123 URL: https://issues.apache.org/jira/browse/SPARK-36123 Project: Spark Issue T

[jira] [Commented] (SPARK-35917) Disable push-based shuffle until the feature is complete

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380099#comment-17380099 ] Apache Spark commented on SPARK-35917: -- User 'otterc' has created a pull request fo

[jira] [Commented] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380093#comment-17380093 ] Apache Spark commented on SPARK-28266: -- User 'shardulm94' has created a pull reques

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380083#comment-17380083 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request fo

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380081#comment-17380081 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request fo

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380079#comment-17380079 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request fo

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380080#comment-17380080 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request fo

[jira] [Commented] (SPARK-36065) date_trunc returns incorrect output

2021-07-13 Thread Sumeet (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17380050#comment-17380050 ] Sumeet commented on SPARK-36065: cc [~maxgekk] > date_trunc returns incorrect output >

[jira] [Updated] (SPARK-36065) date_trunc returns incorrect output

2021-07-13 Thread Sumeet (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sumeet updated SPARK-36065: --- Affects Version/s: 3.2.0 > date_trunc returns incorrect output > --- > >

[jira] [Updated] (SPARK-36108) Add error classes to QueryParsingErrors

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36108: --- Description: Add error classes to [QueryParsingErrors|https://github.com/apache/spark/blob/master/s

[jira] [Updated] (SPARK-36107) Add error classes to QueryExecutionErrors

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36107: --- Description: Add error classes to [QueryExecutionErrors|https://github.com/apache/spark/blob/master

[jira] [Updated] (SPARK-36106) Add error classes to QueryCompilationErrors

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36106: --- Description: Add error classes to [QueryCompilationErrors|https://github.com/apache/spark/blob/mast

[jira] [Updated] (SPARK-36094) Group SQL component error messages in Spark error class JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Description: To improve auditing, reduce duplication, and improve quality of error messages thrown

[jira] [Updated] (SPARK-36094) Group SQL component error messages in Spark error class JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Summary: Group SQL component error messages in Spark error class JSON file (was: Group error messag

[jira] [Updated] (SPARK-36094) Group error messages in JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Description: To improve auditing, reduce duplication, and improve quality of error messages thrown

[jira] [Updated] (SPARK-36094) Group error messages in JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Description: To improve auditing, reduce duplication, and improve quality of error messages thrown

[jira] [Resolved] (SPARK-34891) Introduce state store manager for session window in streaming query

2021-07-13 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-34891. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31989 [https://gith

[jira] [Assigned] (SPARK-34891) Introduce state store manager for session window in streaming query

2021-07-13 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh reassigned SPARK-34891: --- Assignee: Jungtaek Lim > Introduce state store manager for session window in streaming quer

[jira] [Updated] (SPARK-35739) [Spark Sql] Add Java-comptable Dataset.join overloads

2021-07-13 Thread Brandon Dahler (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brandon Dahler updated SPARK-35739: --- Description: h2. Problem When using Spark SQL with Java, the required syntax to utilize the

[jira] [Commented] (SPARK-35957) Cannot convert Avro schema to catalyst type because schema at path is not compatible

2021-07-13 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17379978#comment-17379978 ] Erik Krogen commented on SPARK-35957: - [~jkdll] would it be possible for you to try

[jira] [Commented] (SPARK-36076) [SQL] ArrayIndexOutOfBounds in CAST string to timestamp

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17379954#comment-17379954 ] Apache Spark commented on SPARK-36076: -- User 'dgd-contributor' has created a pull r

[jira] [Created] (SPARK-36122) Spark does not passon needClientAuth to Jetty SSLContextFactory. Does not allow to configure mTLS authentication.

2021-07-13 Thread Seetharama Khandrika (Jira)
Seetharama Khandrika created SPARK-36122: Summary: Spark does not passon needClientAuth to Jetty SSLContextFactory. Does not allow to configure mTLS authentication. Key: SPARK-36122 URL: https://issues.apa

[jira] [Resolved] (SPARK-36120) Support TimestampNTZ type in cache table

2021-07-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-36120. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33322 [https://github.com

  1   2   >