[jira] [Created] (SPARK-41046) Support CreateView in Connect DSL

2022-11-07 Thread Rui Wang (Jira)
Rui Wang created SPARK-41046: Summary: Support CreateView in Connect DSL Key: SPARK-41046 URL: https://issues.apache.org/jira/browse/SPARK-41046 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-41045) Pre-compute to eliminate ScalaReflection calls after deserializer is created

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630224#comment-17630224 ] Apache Spark commented on SPARK-41045: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41045) Pre-compute to eliminate ScalaReflection calls after deserializer is created

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41045: Assignee: Shixiong Zhu (was: Apache Spark) > Pre-compute to eliminate ScalaReflection

[jira] [Assigned] (SPARK-41045) Pre-compute to eliminate ScalaReflection calls after deserializer is created

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41045: Assignee: Apache Spark (was: Shixiong Zhu) > Pre-compute to eliminate ScalaReflection

[jira] [Commented] (SPARK-41045) Pre-compute to eliminate ScalaReflection calls after deserializer is created

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630223#comment-17630223 ] Apache Spark commented on SPARK-41045: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41045) Pre-compute to eliminate ScalaReflection calls after deserializer is created

2022-11-07 Thread Shixiong Zhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-41045: Assignee: Shixiong Zhu > Pre-compute to eliminate ScalaReflection calls after

[jira] [Created] (SPARK-41045) Pre-compute to eliminate ScalaReflection calls after deserializer is created

2022-11-07 Thread Shixiong Zhu (Jira)
Shixiong Zhu created SPARK-41045: Summary: Pre-compute to eliminate ScalaReflection calls after deserializer is created Key: SPARK-41045 URL: https://issues.apache.org/jira/browse/SPARK-41045

[jira] [Commented] (SPARK-41044) Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630215#comment-17630215 ] Apache Spark commented on SPARK-41044: -- User 'panbingkun' has created a pull request for this

[jira] [Assigned] (SPARK-41044) Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41044: Assignee: Apache Spark > Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR >

[jira] [Commented] (SPARK-41044) Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630214#comment-17630214 ] Apache Spark commented on SPARK-41044: -- User 'panbingkun' has created a pull request for this

[jira] [Assigned] (SPARK-41044) Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41044: Assignee: (was: Apache Spark) > Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to

[jira] [Updated] (SPARK-41017) Support column pruning with multiple nondeterministic Filters

2022-11-07 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-41017: Summary: Support column pruning with multiple nondeterministic Filters (was: Do not push Filter

[jira] [Created] (SPARK-41044) Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR

2022-11-07 Thread BingKun Pan (Jira)
BingKun Pan created SPARK-41044: --- Summary: Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR Key: SPARK-41044 URL: https://issues.apache.org/jira/browse/SPARK-41044 Project: Spark

[jira] [Commented] (SPARK-41042) Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630207#comment-17630207 ] Apache Spark commented on SPARK-41042: -- User 'itholic' has created a pull request for this issue:

[jira] [Commented] (SPARK-41042) Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630206#comment-17630206 ] Apache Spark commented on SPARK-41042: -- User 'itholic' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41042) Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41042: Assignee: Apache Spark > Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE >

[jira] [Assigned] (SPARK-41042) Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41042: Assignee: (was: Apache Spark) > Rename PARSE_CHAR_MISSING_LENGTH to

[jira] [Assigned] (SPARK-41040) Self-union streaming query may fail when using readStream.table

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41040: Assignee: Apache Spark (was: Shixiong Zhu) > Self-union streaming query may fail when

[jira] [Assigned] (SPARK-41040) Self-union streaming query may fail when using readStream.table

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41040: Assignee: Shixiong Zhu (was: Apache Spark) > Self-union streaming query may fail when

[jira] [Commented] (SPARK-41040) Self-union streaming query may fail when using readStream.table

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630205#comment-17630205 ] Apache Spark commented on SPARK-41040: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Commented] (SPARK-41040) Self-union streaming query may fail when using readStream.table

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630203#comment-17630203 ] Apache Spark commented on SPARK-41040: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Commented] (SPARK-41043) Assign a name to the legacy error class _LEGACY_ERROR_TEMP_2429

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630201#comment-17630201 ] Apache Spark commented on SPARK-41043: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41043) Assign a name to the legacy error class _LEGACY_ERROR_TEMP_2429

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41043: Assignee: Max Gekk (was: Apache Spark) > Assign a name to the legacy error class

[jira] [Commented] (SPARK-41043) Assign a name to the legacy error class _LEGACY_ERROR_TEMP_2429

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630200#comment-17630200 ] Apache Spark commented on SPARK-41043: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41043) Assign a name to the legacy error class _LEGACY_ERROR_TEMP_2429

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41043: Assignee: Apache Spark (was: Max Gekk) > Assign a name to the legacy error class

[jira] [Commented] (SPARK-40351) Spark Sum increases the precision of DecimalType arguments by 10

2022-11-07 Thread Tymofii (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630199#comment-17630199 ] Tymofii commented on SPARK-40351: - [~dwsmith1983] make sense. Thank you for pointing out to this doc >

[jira] [Created] (SPARK-41043) Assign a name to the legacy error class _LEGACY_ERROR_TEMP_2429

2022-11-07 Thread Max Gekk (Jira)
Max Gekk created SPARK-41043: Summary: Assign a name to the legacy error class _LEGACY_ERROR_TEMP_2429 Key: SPARK-41043 URL: https://issues.apache.org/jira/browse/SPARK-41043 Project: Spark

[jira] [Commented] (SPARK-41042) Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE

2022-11-07 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630197#comment-17630197 ] Haejoon Lee commented on SPARK-41042: - I'm working on this > Rename PARSE_CHAR_MISSING_LENGTH to

[jira] [Created] (SPARK-41042) Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE

2022-11-07 Thread Haejoon Lee (Jira)
Haejoon Lee created SPARK-41042: --- Summary: Rename PARSE_CHAR_MISSING_LENGTH to DATA_TYPE_MISSING_SIZE Key: SPARK-41042 URL: https://issues.apache.org/jira/browse/SPARK-41042 Project: Spark

[jira] [Commented] (SPARK-41041) Integrate _LEGACY_ERROR_TEMP_1279 into TABLE_OR_VIEW_ALREADY_EXISTS

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630192#comment-17630192 ] Apache Spark commented on SPARK-41041: -- User 'itholic' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41041) Integrate _LEGACY_ERROR_TEMP_1279 into TABLE_OR_VIEW_ALREADY_EXISTS

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41041: Assignee: (was: Apache Spark) > Integrate _LEGACY_ERROR_TEMP_1279 into

[jira] [Commented] (SPARK-41041) Integrate _LEGACY_ERROR_TEMP_1279 into TABLE_OR_VIEW_ALREADY_EXISTS

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630191#comment-17630191 ] Apache Spark commented on SPARK-41041: -- User 'itholic' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41041) Integrate _LEGACY_ERROR_TEMP_1279 into TABLE_OR_VIEW_ALREADY_EXISTS

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41041: Assignee: Apache Spark > Integrate _LEGACY_ERROR_TEMP_1279 into

[jira] [Updated] (SPARK-32082) Project Zen: Improving Python usability

2022-11-07 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-32082: Description: The importance of Python and PySpark has grown radically in the last few years. The number

[jira] [Updated] (SPARK-41041) Integrate _LEGACY_ERROR_TEMP_1279 into TABLE_OR_VIEW_ALREADY_EXISTS

2022-11-07 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haejoon Lee updated SPARK-41041: Summary: Integrate _LEGACY_ERROR_TEMP_1279 into TABLE_OR_VIEW_ALREADY_EXISTS (was: Integrate

[jira] [Created] (SPARK-41041) Integrate VIEW_ALREADY_EXISTS into TABLE_OR_VIEW_ALREADY_EXISTS

2022-11-07 Thread Haejoon Lee (Jira)
Haejoon Lee created SPARK-41041: --- Summary: Integrate VIEW_ALREADY_EXISTS into TABLE_OR_VIEW_ALREADY_EXISTS Key: SPARK-41041 URL: https://issues.apache.org/jira/browse/SPARK-41041 Project: Spark

[jira] [Commented] (SPARK-41041) Integrate VIEW_ALREADY_EXISTS into TABLE_OR_VIEW_ALREADY_EXISTS

2022-11-07 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630189#comment-17630189 ] Haejoon Lee commented on SPARK-41041: - I'm working on it > Integrate VIEW_ALREADY_EXISTS into

[jira] [Assigned] (SPARK-41040) Self-union streaming query may fail when using readStream.table

2022-11-07 Thread Shixiong Zhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-41040: Assignee: Shixiong Zhu > Self-union streaming query may fail when using readStream.table

[jira] [Created] (SPARK-41040) Self-union streaming query may fail when using readStream.table

2022-11-07 Thread Shixiong Zhu (Jira)
Shixiong Zhu created SPARK-41040: Summary: Self-union streaming query may fail when using readStream.table Key: SPARK-41040 URL: https://issues.apache.org/jira/browse/SPARK-41040 Project: Spark

[jira] [Commented] (SPARK-41038) Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS `

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630187#comment-17630187 ] Apache Spark commented on SPARK-41038: -- User 'itholic' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41038) Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS `

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41038: Assignee: Apache Spark > Rename `MULTI_VALUE_SUBQUERY_ERROR` to

[jira] [Assigned] (SPARK-41038) Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS `

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41038: Assignee: (was: Apache Spark) > Rename `MULTI_VALUE_SUBQUERY_ERROR` to

[jira] [Assigned] (SPARK-41039) Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41039: Assignee: (was: Apache Spark) > Upgrade `scala-parallel-collections` to 1.0.4 for

[jira] [Commented] (SPARK-41039) Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630186#comment-17630186 ] Apache Spark commented on SPARK-41039: -- User 'LuciferYang' has created a pull request for this

[jira] [Assigned] (SPARK-41039) Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41039: Assignee: Apache Spark > Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13 >

[jira] [Updated] (SPARK-41039) Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13

2022-11-07 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-41039: - Summary: Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13 (was: Upgrade

[jira] [Created] (SPARK-41039) Upgrade `scala-parallel-collections` to 1.0.4

2022-11-07 Thread Yang Jie (Jira)
Yang Jie created SPARK-41039: Summary: Upgrade `scala-parallel-collections` to 1.0.4 Key: SPARK-41039 URL: https://issues.apache.org/jira/browse/SPARK-41039 Project: Spark Issue Type:

[jira] [Commented] (SPARK-41026) Support Repartition in Connect DSL

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630177#comment-17630177 ] Apache Spark commented on SPARK-41026: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Commented] (SPARK-41026) Support Repartition in Connect DSL

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630178#comment-17630178 ] Apache Spark commented on SPARK-41026: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-40948) Introduce new error class: PATH_NOT_FOUND

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40948: Assignee: Apache Spark > Introduce new error class: PATH_NOT_FOUND >

[jira] [Assigned] (SPARK-40948) Introduce new error class: PATH_NOT_FOUND

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40948: Assignee: (was: Apache Spark) > Introduce new error class: PATH_NOT_FOUND >

[jira] [Updated] (SPARK-40948) Introduce new error class: PATH_NOT_FOUND

2022-11-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-40948: - Fix Version/s: (was: 3.4.0) > Introduce new error class: PATH_NOT_FOUND >

[jira] [Reopened] (SPARK-40948) Introduce new error class: PATH_NOT_FOUND

2022-11-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reopened SPARK-40948: -- Assignee: (was: Haejoon Lee) Reverted in

[jira] [Assigned] (SPARK-39883) Add DataFrame function parity check

2022-11-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-39883: Assignee: Andrew Ray > Add DataFrame function parity check >

[jira] [Resolved] (SPARK-39883) Add DataFrame function parity check

2022-11-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-39883. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 37303

[jira] [Created] (SPARK-41038) Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS `

2022-11-07 Thread Haejoon Lee (Jira)
Haejoon Lee created SPARK-41038: --- Summary: Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS ` Key: SPARK-41038 URL: https://issues.apache.org/jira/browse/SPARK-41038 Project: Spark

[jira] [Commented] (SPARK-41038) Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS `

2022-11-07 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630156#comment-17630156 ] Haejoon Lee commented on SPARK-41038: - I'm working on it > Rename `MULTI_VALUE_SUBQUERY_ERROR` to

[jira] [Commented] (SPARK-41037) Fix pandas_udf when return type is array of MapType working properly.

2022-11-07 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630153#comment-17630153 ] Haejoon Lee commented on SPARK-41037: - I'm working on it > Fix pandas_udf when return type is array

[jira] [Created] (SPARK-41037) Fix pandas_udf when return type is array of MapType working properly.

2022-11-07 Thread Haejoon Lee (Jira)
Haejoon Lee created SPARK-41037: --- Summary: Fix pandas_udf when return type is array of MapType working properly. Key: SPARK-41037 URL: https://issues.apache.org/jira/browse/SPARK-41037 Project: Spark

[jira] [Commented] (SPARK-40663) Migrate execution errors onto error classes

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630150#comment-17630150 ] Apache Spark commented on SPARK-40663: -- User 'itholic' has created a pull request for this issue:

[jira] [Commented] (SPARK-40663) Migrate execution errors onto error classes

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630149#comment-17630149 ] Apache Spark commented on SPARK-40663: -- User 'itholic' has created a pull request for this issue:

[jira] [Commented] (SPARK-40798) Alter partition should verify value

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630147#comment-17630147 ] Apache Spark commented on SPARK-40798: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-40798) Alter partition should verify value

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630146#comment-17630146 ] Apache Spark commented on SPARK-40798: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-41008) Isotonic regression result differs from sklearn implementation

2022-11-07 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630143#comment-17630143 ] Sean R. Owen commented on SPARK-41008: -- Yeah that doesn't look right. I tried understanding the

[jira] [Resolved] (SPARK-41032) Spark Web UI

2022-11-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-41032. -- Resolution: Invalid > Spark Web UI > - > > Key: SPARK-41032 >

[jira] [Commented] (SPARK-41032) Spark Web UI

2022-11-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630142#comment-17630142 ] Hyukjin Kwon commented on SPARK-41032: -- [~pgranat] for questions, let's interact with Spark

[jira] [Commented] (SPARK-41036) `columns` API should use `schema` API to avoid data fetching

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630139#comment-17630139 ] Apache Spark commented on SPARK-41036: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Commented] (SPARK-41036) `columns` API should use `schema` API to avoid data fetching

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630138#comment-17630138 ] Apache Spark commented on SPARK-41036: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41036) `columns` API should use `schema` API to avoid data fetching

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41036: Assignee: (was: Apache Spark) > `columns` API should use `schema` API to avoid data

[jira] [Assigned] (SPARK-41036) `columns` API should use `schema` API to avoid data fetching

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41036: Assignee: Apache Spark > `columns` API should use `schema` API to avoid data fetching >

[jira] [Created] (SPARK-41036) `columns` API should use `schema` API to avoid data fetching

2022-11-07 Thread Rui Wang (Jira)
Rui Wang created SPARK-41036: Summary: `columns` API should use `schema` API to avoid data fetching Key: SPARK-41036 URL: https://issues.apache.org/jira/browse/SPARK-41036 Project: Spark Issue

[jira] [Updated] (SPARK-41035) Incorrect results or NPE when a literal is reused across distinct aggregations

2022-11-07 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-41035: -- Description: This query produces incorrect results: {noformat} select a, count(distinct 100)

[jira] [Commented] (SPARK-40815) SymlinkTextInputFormat returns incorrect result due to enabled spark.hadoopRDD.ignoreEmptySplits

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630127#comment-17630127 ] Apache Spark commented on SPARK-40815: -- User 'sadikovi' has created a pull request for this issue:

[jira] [Commented] (SPARK-41035) Incorrect results or NPE when a literal is reused across distinct aggregations

2022-11-07 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630125#comment-17630125 ] Bruce Robbins commented on SPARK-41035: --- This is a bug in {{RewriteDistinctAggregates}}. I will

[jira] [Created] (SPARK-41035) Incorrect results or NPE when a literal is reused across distinct aggregations

2022-11-07 Thread Bruce Robbins (Jira)
Bruce Robbins created SPARK-41035: - Summary: Incorrect results or NPE when a literal is reused across distinct aggregations Key: SPARK-41035 URL: https://issues.apache.org/jira/browse/SPARK-41035

[jira] [Resolved] (SPARK-41031) Upgrade `org.tukaani:xz` to 1.9

2022-11-07 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-41031. -- Fix Version/s: 3.3.2 3.4.0 Resolution: Fixed Issue resolved by pull

[jira] [Assigned] (SPARK-41031) Upgrade `org.tukaani:xz` to 1.9

2022-11-07 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-41031: Assignee: Yang Jie > Upgrade `org.tukaani:xz` to 1.9 > --- >

[jira] [Updated] (SPARK-41031) Upgrade `org.tukaani:xz` to 1.9

2022-11-07 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen updated SPARK-41031: - Priority: Minor (was: Major) > Upgrade `org.tukaani:xz` to 1.9 >

[jira] [Assigned] (SPARK-41007) BigInteger Serialization doesn't work with JavaBean Encoder

2022-11-07 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-41007: Assignee: Daniel Fiterma > BigInteger Serialization doesn't work with JavaBean Encoder >

[jira] [Resolved] (SPARK-41007) BigInteger Serialization doesn't work with JavaBean Encoder

2022-11-07 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-41007. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38500

[jira] [Commented] (SPARK-40281) Memory Profiler on Executors

2022-11-07 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630120#comment-17630120 ] Xinrong Meng commented on SPARK-40281: -- Thanks [~alfiewdavidson] for the feedback! I am currently

[jira] [Resolved] (SPARK-41026) Support Repartition in Connect DSL

2022-11-07 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-41026. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38529

[jira] [Assigned] (SPARK-41026) Support Repartition in Connect DSL

2022-11-07 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-41026: --- Assignee: Rui Wang > Support Repartition in Connect DSL >

[jira] [Assigned] (SPARK-41002) Compatible `take`, `head` and `first` API in Python client

2022-11-07 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng reassigned SPARK-41002: - Assignee: Rui Wang > Compatible `take`, `head` and `first` API in Python client >

[jira] [Resolved] (SPARK-41002) Compatible `take`, `head` and `first` API in Python client

2022-11-07 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-41002. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38488

[jira] [Assigned] (SPARK-41030) Upgrade Apache Ivy to 2.5.1

2022-11-07 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-41030: - Assignee: Bjørn Jørgensen > Upgrade Apache Ivy to 2.5.1 > ---

[jira] [Resolved] (SPARK-41030) Upgrade Apache Ivy to 2.5.1

2022-11-07 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-41030. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38539

[jira] [Commented] (SPARK-40281) Memory Profiler on Executors

2022-11-07 Thread Alfred Davidson (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630080#comment-17630080 ] Alfred Davidson commented on SPARK-40281: - +1 in general this would be good regardless of

[jira] [Updated] (SPARK-41018) Koalas.idxmin() is not picking the minimum value from a dataframe, but pandas.idxmin() gives

2022-11-07 Thread Nikesh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nikesh updated SPARK-41018: --- Attachment: screenshot-1.png > Koalas.idxmin() is not picking the minimum value from a dataframe, but >

[jira] [Commented] (SPARK-38550) Use a disk-based store to save more information in live UI to help debug

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630062#comment-17630062 ] Apache Spark commented on SPARK-38550: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-41034) Connect DataFrame should require RemoteSparkSession

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630049#comment-17630049 ] Apache Spark commented on SPARK-41034: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-41034) Connect DataFrame should require RemoteSparkSession

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41034: Assignee: Apache Spark > Connect DataFrame should require RemoteSparkSession >

[jira] [Assigned] (SPARK-41034) Connect DataFrame should require RemoteSparkSession

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41034: Assignee: (was: Apache Spark) > Connect DataFrame should require RemoteSparkSession

[jira] [Commented] (SPARK-41034) Connect DataFrame should require RemoteSparkSession

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630047#comment-17630047 ] Apache Spark commented on SPARK-41034: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Created] (SPARK-41034) Connect DataFrame should require RemoteSparkSession

2022-11-07 Thread Rui Wang (Jira)
Rui Wang created SPARK-41034: Summary: Connect DataFrame should require RemoteSparkSession Key: SPARK-41034 URL: https://issues.apache.org/jira/browse/SPARK-41034 Project: Spark Issue Type:

[jira] [Updated] (SPARK-41030) Upgrade Apache Ivy to 2.5.1

2022-11-07 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-41030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-41030: Description: [CVE-2022-37865|https://www.cve.org/CVERecord?id=CVE-2022-37865] and

[jira] [Commented] (SPARK-40791) The semantics of `F` in `DateTimeFormatter` have changed

2022-11-07 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630010#comment-17630010 ] Dongjoon Hyun commented on SPARK-40791: --- Got it. Thank you for the confirmation, [~LuciferYang].

[jira] [Comment Edited] (SPARK-40351) Spark Sum increases the precision of DecimalType arguments by 10

2022-11-07 Thread Dustin Smith (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630001#comment-17630001 ] Dustin Smith edited comment on SPARK-40351 at 11/7/22 8:14 PM: ---

[jira] [Comment Edited] (SPARK-40351) Spark Sum increases the precision of DecimalType arguments by 10

2022-11-07 Thread Dustin Smith (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630001#comment-17630001 ] Dustin Smith edited comment on SPARK-40351 at 11/7/22 8:14 PM: ---

[jira] [Commented] (SPARK-41033) RemoteSparkSession should only accept one `user_id`

2022-11-07 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630008#comment-17630008 ] Apache Spark commented on SPARK-41033: -- User 'amaliujia' has created a pull request for this issue:

  1   2   >