[jira] [Created] (SPARK-38895) Unify the AQE shuffle read canonicalized

2022-04-13 Thread XiDuo You (Jira)
XiDuo You created SPARK-38895: - Summary: Unify the AQE shuffle read canonicalized Key: SPARK-38895 URL: https://issues.apache.org/jira/browse/SPARK-38895 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-38725) Test the error class: DUPLICATE_KEY

2022-04-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-38725. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36188 [https://github.com

[jira] [Assigned] (SPARK-38725) Test the error class: DUPLICATE_KEY

2022-04-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-38725: Assignee: panbingkun > Test the error class: DUPLICATE_KEY > ---

[jira] [Commented] (SPARK-38724) Test the error class: DIVIDE_BY_ZERO

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522090#comment-17522090 ] Apache Spark commented on SPARK-38724: -- User 'panbingkun' has created a pull reques

[jira] [Assigned] (SPARK-38724) Test the error class: DIVIDE_BY_ZERO

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38724: Assignee: Apache Spark > Test the error class: DIVIDE_BY_ZERO > -

[jira] [Assigned] (SPARK-38724) Test the error class: DIVIDE_BY_ZERO

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38724: Assignee: (was: Apache Spark) > Test the error class: DIVIDE_BY_ZERO > --

[jira] [Commented] (SPARK-38724) Test the error class: DIVIDE_BY_ZERO

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522089#comment-17522089 ] Apache Spark commented on SPARK-38724: -- User 'panbingkun' has created a pull reques

[jira] [Assigned] (SPARK-38550) Use a disk-based store to save more information in live UI to help debug

2022-04-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-38550: --- Assignee: Linhong Liu > Use a disk-based store to save more information in live UI to help

[jira] [Resolved] (SPARK-38550) Use a disk-based store to save more information in live UI to help debug

2022-04-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-38550. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 35856 [https://gith

[jira] [Comment Edited] (SPARK-38888) Add `RocksDBProvider` similar to `LevelDBProvider`

2022-04-13 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-3?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522070#comment-17522070 ] Yang Jie edited comment on SPARK-3 at 4/14/22 5:44 AM: --- Ok

[jira] [Commented] (SPARK-38888) Add `RocksDBProvider` similar to `LevelDBProvider`

2022-04-13 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-3?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522070#comment-17522070 ] Yang Jie commented on SPARK-3: -- Ok ~ This may involve pre writing refactoring. I wi

[jira] [Commented] (SPARK-38721) Test the error class: CANNOT_PARSE_DECIMAL

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522066#comment-17522066 ] Apache Spark commented on SPARK-38721: -- User 'panbingkun' has created a pull reques

[jira] [Commented] (SPARK-38721) Test the error class: CANNOT_PARSE_DECIMAL

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522065#comment-17522065 ] Apache Spark commented on SPARK-38721: -- User 'panbingkun' has created a pull reques

[jira] [Commented] (SPARK-38894) Exclude pyspark.cloudpickle in test coverage report

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522063#comment-17522063 ] Apache Spark commented on SPARK-38894: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-38894) Exclude pyspark.cloudpickle in test coverage report

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38894: Assignee: (was: Apache Spark) > Exclude pyspark.cloudpickle in test coverage report >

[jira] [Commented] (SPARK-38894) Exclude pyspark.cloudpickle in test coverage report

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522062#comment-17522062 ] Apache Spark commented on SPARK-38894: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-38894) Exclude pyspark.cloudpickle in test coverage report

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38894: Assignee: Apache Spark > Exclude pyspark.cloudpickle in test coverage report > --

[jira] [Created] (SPARK-38894) Exclude pyspark.cloudpickle in test coverage report

2022-04-13 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-38894: Summary: Exclude pyspark.cloudpickle in test coverage report Key: SPARK-38894 URL: https://issues.apache.org/jira/browse/SPARK-38894 Project: Spark Issue Typ

[jira] [Updated] (SPARK-38894) Exclude pyspark.cloudpickle in test coverage report

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-38894: - Priority: Minor (was: Major) > Exclude pyspark.cloudpickle in test coverage report > --

[jira] [Assigned] (SPARK-38893) Test SourceProgress in PySpark

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38893: Assignee: (was: Apache Spark) > Test SourceProgress in PySpark >

[jira] [Commented] (SPARK-38893) Test SourceProgress in PySpark

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522055#comment-17522055 ] Apache Spark commented on SPARK-38893: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-38893) Test SourceProgress in PySpark

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38893: Assignee: Apache Spark > Test SourceProgress in PySpark > --

[jira] [Updated] (SPARK-38893) Test SourceProgress in PySpark

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-38893: - Issue Type: Test (was: Bug) > Test SourceProgress in PySpark > -- >

[jira] [Created] (SPARK-38893) Test SourceProgress in PySpark

2022-04-13 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-38893: Summary: Test SourceProgress in PySpark Key: SPARK-38893 URL: https://issues.apache.org/jira/browse/SPARK-38893 Project: Spark Issue Type: Bug Comp

[jira] [Assigned] (SPARK-38889) Invalid column name while querying bit type column in MSSQL

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38889: Assignee: Allison Wang > Invalid column name while querying bit type column in MSSQL > --

[jira] [Resolved] (SPARK-38889) Invalid column name while querying bit type column in MSSQL

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38889. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 36182 [https://gi

[jira] [Commented] (SPARK-38892) Fix the UT of schema equal assert

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522027#comment-17522027 ] Apache Spark commented on SPARK-38892: -- User 'fhygh' has created a pull request for

[jira] [Assigned] (SPARK-38892) Fix the UT of schema equal assert

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38892: Assignee: Apache Spark > Fix the UT of schema equal assert >

[jira] [Commented] (SPARK-38892) Fix the UT of schema equal assert

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522026#comment-17522026 ] Apache Spark commented on SPARK-38892: -- User 'fhygh' has created a pull request for

[jira] [Assigned] (SPARK-38892) Fix the UT of schema equal assert

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38892: Assignee: (was: Apache Spark) > Fix the UT of schema equal assert > -

[jira] [Created] (SPARK-38892) Fix the UT of schema equal assert

2022-04-13 Thread YuanGuanhu (Jira)
YuanGuanhu created SPARK-38892: -- Summary: Fix the UT of schema equal assert Key: SPARK-38892 URL: https://issues.apache.org/jira/browse/SPARK-38892 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-38725) Test the error class: DUPLICATE_KEY

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522008#comment-17522008 ] Apache Spark commented on SPARK-38725: -- User 'panbingkun' has created a pull reques

[jira] [Commented] (SPARK-38725) Test the error class: DUPLICATE_KEY

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522007#comment-17522007 ] Apache Spark commented on SPARK-38725: -- User 'panbingkun' has created a pull reques

[jira] [Commented] (SPARK-38884) java.util.NoSuchElementException: key not found: numPartitions

2022-04-13 Thread chopperChen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17522000#comment-17522000 ] chopperChen commented on SPARK-38884: - [~hyukjin.kwon] what`s self-contained reprodu

[jira] [Commented] (SPARK-36604) timestamp type column analyze result is wrong

2022-04-13 Thread YuanGuanhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521999#comment-17521999 ] YuanGuanhu commented on SPARK-36604: [~senthh] what's the session time zone? i test

[jira] [Assigned] (SPARK-38857) series name should be preserved in series.mode()

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38857: Assignee: Yikun Jiang > series name should be preserved in series.mode() > --

[jira] [Resolved] (SPARK-38857) series name should be preserved in series.mode()

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38857. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36159 [https://gi

[jira] [Commented] (SPARK-37643) when charVarcharAsString is true, char datatype partition table query incorrect

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521994#comment-17521994 ] Apache Spark commented on SPARK-37643: -- User 'fhygh' has created a pull request for

[jira] [Commented] (SPARK-37643) when charVarcharAsString is true, char datatype partition table query incorrect

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521993#comment-17521993 ] Apache Spark commented on SPARK-37643: -- User 'fhygh' has created a pull request for

[jira] [Commented] (SPARK-38884) java.util.NoSuchElementException: key not found: numPartitions

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521989#comment-17521989 ] Hyukjin Kwon commented on SPARK-38884: -- [~chopperChen] do you have a self-contained

[jira] [Assigned] (SPARK-38890) Implement `ignore_index` of `DataFrame.sort_index`.

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38890: Assignee: Xinrong Meng > Implement `ignore_index` of `DataFrame.sort_index`. > --

[jira] [Resolved] (SPARK-38797) Runtime Filter support pushdown through window

2022-04-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-38797. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36080 [https://gith

[jira] [Resolved] (SPARK-38890) Implement `ignore_index` of `DataFrame.sort_index`.

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38890. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36184 [https://gi

[jira] [Assigned] (SPARK-38797) Runtime Filter support pushdown through window

2022-04-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-38797: --- Assignee: Yuming Wang > Runtime Filter support pushdown through window > --

[jira] [Resolved] (SPARK-37014) Inline type hints for python/pyspark/streaming/context.py

2022-04-13 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz resolved SPARK-37014. Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34293

[jira] [Assigned] (SPARK-37014) Inline type hints for python/pyspark/streaming/context.py

2022-04-13 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz reassigned SPARK-37014: -- Assignee: dch nguyen > Inline type hints for python/pyspark/streaming/context

[jira] [Commented] (SPARK-36664) Log time spent waiting for cluster resources

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521930#comment-17521930 ] Apache Spark commented on SPARK-36664: -- User 'holdenk' has created a pull request f

[jira] [Commented] (SPARK-38812) when i clean data ,I hope one rdd spill two rdd according clean data rule

2022-04-13 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521923#comment-17521923 ] Erik Krogen commented on SPARK-38812: - You may want to check the discussion on SPARK

[jira] [Assigned] (SPARK-38890) Implement `ignore_index` of `DataFrame.sort_index`.

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38890: Assignee: (was: Apache Spark) > Implement `ignore_index` of `DataFrame.sort_index`. >

[jira] [Assigned] (SPARK-38890) Implement `ignore_index` of `DataFrame.sort_index`.

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38890: Assignee: Apache Spark > Implement `ignore_index` of `DataFrame.sort_index`. > --

[jira] [Commented] (SPARK-38890) Implement `ignore_index` of `DataFrame.sort_index`.

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521913#comment-17521913 ] Apache Spark commented on SPARK-38890: -- User 'xinrong-databricks' has created a pul

[jira] [Created] (SPARK-38891) Skipping allocating vector for repetition & definition levels when possible

2022-04-13 Thread Chao Sun (Jira)
Chao Sun created SPARK-38891: Summary: Skipping allocating vector for repetition & definition levels when possible Key: SPARK-38891 URL: https://issues.apache.org/jira/browse/SPARK-38891 Project: Spark

[jira] [Created] (SPARK-38890) Implement `ignore_index` of `DataFrame.sort_index`.

2022-04-13 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-38890: Summary: Implement `ignore_index` of `DataFrame.sort_index`. Key: SPARK-38890 URL: https://issues.apache.org/jira/browse/SPARK-38890 Project: Spark Issue Typ

[jira] [Commented] (SPARK-38823) Incorrect result of dataset reduceGroups in java

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521909#comment-17521909 ] Apache Spark commented on SPARK-38823: -- User 'bersprockets' has created a pull requ

[jira] [Assigned] (SPARK-38823) Incorrect result of dataset reduceGroups in java

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38823: Assignee: (was: Apache Spark) > Incorrect result of dataset reduceGroups in java > --

[jira] [Assigned] (SPARK-38823) Incorrect result of dataset reduceGroups in java

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38823: Assignee: Apache Spark > Incorrect result of dataset reduceGroups in java > -

[jira] [Commented] (SPARK-38823) Incorrect result of dataset reduceGroups in java

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521907#comment-17521907 ] Apache Spark commented on SPARK-38823: -- User 'bersprockets' has created a pull requ

[jira] [Commented] (SPARK-38823) Incorrect result of dataset reduceGroups in java

2022-04-13 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521894#comment-17521894 ] Bruce Robbins commented on SPARK-38823: --- By the way, here is some code that demos

[jira] [Resolved] (SPARK-38835) Refactor FsHistoryProviderSuite to test rocks db

2022-04-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-38835. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36119 [https://

[jira] [Assigned] (SPARK-38835) Refactor FsHistoryProviderSuite to test rocks db

2022-04-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-38835: - Assignee: Yang Jie > Refactor FsHistoryProviderSuite to test rocks db > ---

[jira] [Assigned] (SPARK-38889) Invalid column name while querying bit type column in MSSQL

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38889: Assignee: Apache Spark > Invalid column name while querying bit type column in MSSQL > --

[jira] [Assigned] (SPARK-38889) Invalid column name while querying bit type column in MSSQL

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38889: Assignee: (was: Apache Spark) > Invalid column name while querying bit type column in

[jira] [Commented] (SPARK-38889) Invalid column name while querying bit type column in MSSQL

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521874#comment-17521874 ] Apache Spark commented on SPARK-38889: -- User 'allisonwang-db' has created a pull re

[jira] [Updated] (SPARK-38889) Invalid column name while querying bit type column in MSSQL

2022-04-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-38889: - Description: After https://issues.apache.org/jira/browse/SPARK-36644 boolean column filters can

[jira] [Created] (SPARK-38889) Invalid column name while querying bit type column in MSSQL

2022-04-13 Thread Allison Wang (Jira)
Allison Wang created SPARK-38889: Summary: Invalid column name while querying bit type column in MSSQL Key: SPARK-38889 URL: https://issues.apache.org/jira/browse/SPARK-38889 Project: Spark

[jira] [Assigned] (SPARK-34659) Web UI does not correctly get appId

2022-04-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-34659: - Assignee: Gengliang Wang > Web UI does not correctly get appId > --

[jira] [Resolved] (SPARK-34659) Web UI does not correctly get appId

2022-04-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-34659. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36176 [https://

[jira] [Commented] (SPARK-38792) Regression in time executor takes to do work sometime after v3.0.1 ?

2022-04-13 Thread Danny Guinther (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521852#comment-17521852 ] Danny Guinther commented on SPARK-38792: I'm getting the impression that the pro

[jira] [Comment Edited] (SPARK-38852) Better Data Source V2 operator pushdown framework

2022-04-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521815#comment-17521815 ] Max Gekk edited comment on SPARK-38852 at 4/13/22 4:22 PM: --- SP

[jira] [Commented] (SPARK-38852) Better Data Source V2 operator pushdown framework

2022-04-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521815#comment-17521815 ] Max Gekk commented on SPARK-38852: -- SPARK-38788 was created specifically for the releas

[jira] [Updated] (SPARK-38852) Better Data Source V2 operator pushdown framework

2022-04-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk updated SPARK-38852: - Epic Link: SPARK-38788 > Better Data Source V2 operator pushdown framework > ---

[jira] [Commented] (SPARK-38788) More comprehensive DSV2 push down capabilities

2022-04-13 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521807#comment-17521807 ] Erik Krogen commented on SPARK-38788: - Yeah, I got that, but isn't SPARK-38852 tryin

[jira] [Commented] (SPARK-38888) Add `RocksDBProvider` similar to `LevelDBProvider`

2022-04-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-3?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521756#comment-17521756 ] Dongjoon Hyun commented on SPARK-3: --- If you need, please proceed it. Technical

[jira] [Assigned] (SPARK-38745) Move the tests for `NON_PARTITION_COLUMN` to QueryCompilationErrorsSuite

2022-04-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-38745: Assignee: Max Gekk > Move the tests for `NON_PARTITION_COLUMN` to QueryCompilationErrorsSuite > -

[jira] [Resolved] (SPARK-38745) Move the tests for `NON_PARTITION_COLUMN` to QueryCompilationErrorsSuite

2022-04-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-38745. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36175 [https://github.com

[jira] [Commented] (SPARK-38888) Add `RocksDBProvider` similar to `LevelDBProvider`

2022-04-13 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-3?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521701#comment-17521701 ] Yang Jie commented on SPARK-3: -- cc [~dongjoon] should we do this?   > Add `RocksD

[jira] [Created] (SPARK-38888) Add `RocksDBProvider` similar to `LevelDBProvider`

2022-04-13 Thread Yang Jie (Jira)
Yang Jie created SPARK-3: Summary: Add `RocksDBProvider` similar to `LevelDBProvider` Key: SPARK-3 URL: https://issues.apache.org/jira/browse/SPARK-3 Project: Spark Issue Type: Sub-ta

[jira] [Assigned] (SPARK-38887) Support switch inner join side for sort merge join

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38887: Assignee: Apache Spark > Support switch inner join side for sort merge join > ---

[jira] [Assigned] (SPARK-38887) Support switch inner join side for sort merge join

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38887: Assignee: (was: Apache Spark) > Support switch inner join side for sort merge join >

[jira] [Commented] (SPARK-38887) Support switch inner join side for sort merge join

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521660#comment-17521660 ] Apache Spark commented on SPARK-38887: -- User 'ulysses-you' has created a pull reque

[jira] [Updated] (SPARK-38887) Support switch inner join side for sort merge join

2022-04-13 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38887: -- Summary: Support switch inner join side for sort merge join (was: Support swtich inner join side for

[jira] [Created] (SPARK-38887) Support swtich inner join side for sort merge join

2022-04-13 Thread XiDuo You (Jira)
XiDuo You created SPARK-38887: - Summary: Support swtich inner join side for sort merge join Key: SPARK-38887 URL: https://issues.apache.org/jira/browse/SPARK-38887 Project: Spark Issue Type: Impr

[jira] [Resolved] (SPARK-38844) impl Series.interpolate and DataFrame.interpolate

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38844. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36127 [https://gi

[jira] [Assigned] (SPARK-38844) impl Series.interpolate and DataFrame.interpolate

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38844: Assignee: zhengruifeng > impl Series.interpolate and DataFrame.interpolate >

[jira] [Resolved] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-38832. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36117 [https://gith

[jira] [Commented] (SPARK-38867) Avoid OOM when bufferedPlan has a lot of duplicate keys in SortMergeJoin codegen

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521567#comment-17521567 ] Apache Spark commented on SPARK-38867: -- User 'mcdull-zhang' has created a pull requ

[jira] [Assigned] (SPARK-38867) Avoid OOM when bufferedPlan has a lot of duplicate keys in SortMergeJoin codegen

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38867: Assignee: Apache Spark > Avoid OOM when bufferedPlan has a lot of duplicate keys in SortM

[jira] [Assigned] (SPARK-38867) Avoid OOM when bufferedPlan has a lot of duplicate keys in SortMergeJoin codegen

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38867: Assignee: (was: Apache Spark) > Avoid OOM when bufferedPlan has a lot of duplicate ke

[jira] [Assigned] (SPARK-38774) impl Series.autocorr

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38774: Assignee: zhengruifeng > impl Series.autocorr > > >

[jira] [Resolved] (SPARK-38774) impl Series.autocorr

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38774. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36048 [https://gi

[jira] [Resolved] (SPARK-38829) New configuration for controlling timestamp inference of Parquet

2022-04-13 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-38829. Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 36137 [https:

[jira] [Commented] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521535#comment-17521535 ] Apache Spark commented on SPARK-38886: -- User 'ulysses-you' has created a pull reque

[jira] [Assigned] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38886: Assignee: (was: Apache Spark) > Remove outer join if aggregate functions are duplicat

[jira] [Commented] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521536#comment-17521536 ] Apache Spark commented on SPARK-38886: -- User 'ulysses-you' has created a pull reque

[jira] [Assigned] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38886: Assignee: Apache Spark > Remove outer join if aggregate functions are duplicate agnostic

[jira] [Updated] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side

2022-04-13 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38886: -- Description: If aggregate child is outer join, and the aggregate references are all coming from the s

[jira] [Assigned] (SPARK-38833) PySpark applyInPandas should allow to return empty DataFrame without columns

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38833: Assignee: Enrico Minack > PySpark applyInPandas should allow to return empty DataFrame wi

[jira] [Resolved] (SPARK-38833) PySpark applyInPandas should allow to return empty DataFrame without columns

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38833. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 36120 [https://gi

[jira] [Resolved] (SPARK-38883) smaller pyspark install if not using streaming?

2022-04-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38883. -- Resolution: Invalid Let's interact with Spark mailing list for questions. > smaller pyspark i

[jira] [Commented] (SPARK-34659) Web UI does not correctly get appId

2022-04-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521520#comment-17521520 ] Apache Spark commented on SPARK-34659: -- User 'gengliangwang' has created a pull req

  1   2   >