[jira] [Commented] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527893#comment-17527893 ] Apache Spark commented on SPARK-39015: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527892#comment-17527892 ] Apache Spark commented on SPARK-39015: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39015: Assignee: Apache Spark > SparkRuntimeException when trying to get non-existent key in a

[jira] [Assigned] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39015: Assignee: (was: Apache Spark) > SparkRuntimeException when trying to get

[jira] [Updated] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-39015: - Component/s: SQL (was: Spark Core) > SparkRuntimeException when trying to

[jira] [Resolved] (SPARK-39014) Respect ignoreMissingFiles from Data Source options in InMemoryFileIndex

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-39014. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36348

[jira] [Assigned] (SPARK-39014) Respect ignoreMissingFiles from Data Source options in InMemoryFileIndex

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-39014: Assignee: Yaohua Cui > Respect ignoreMissingFiles from Data Source options in

[jira] [Resolved] (SPARK-38976) spark-sql. overwrite. hive table-duplicate records

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38976. -- Resolution: Invalid > spark-sql. overwrite. hive table-duplicate records >

[jira] [Commented] (SPARK-38976) spark-sql. overwrite. hive table-duplicate records

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527865#comment-17527865 ] Hyukjin Kwon commented on SPARK-38976: -- [~wesharn] I think it's best to interact in dev mailing

[jira] [Created] (SPARK-39017) Change Java8 datetime support to configurable

2022-04-25 Thread Weicheng Wang (Jira)
Weicheng Wang created SPARK-39017: - Summary: Change Java8 datetime support to configurable Key: SPARK-39017 URL: https://issues.apache.org/jira/browse/SPARK-39017 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-38820) Support Index can hold arbitrary ExtensionArrays

2022-04-25 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17525648#comment-17525648 ] Yikun Jiang edited comment on SPARK-38820 at 4/26/22 3:18 AM: --

[jira] [Assigned] (SPARK-38700) Use error classes in the execution errors of save mode

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38700: Assignee: (was: Apache Spark) > Use error classes in the execution errors of save

[jira] [Commented] (SPARK-38700) Use error classes in the execution errors of save mode

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527862#comment-17527862 ] Apache Spark commented on SPARK-38700: -- User 'panbingkun' has created a pull request for this

[jira] [Commented] (SPARK-38700) Use error classes in the execution errors of save mode

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527863#comment-17527863 ] Apache Spark commented on SPARK-38700: -- User 'panbingkun' has created a pull request for this

[jira] [Assigned] (SPARK-38700) Use error classes in the execution errors of save mode

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38700: Assignee: Apache Spark > Use error classes in the execution errors of save mode >

[jira] [Updated] (SPARK-39016) Fix compilation warnings related to "`enum` will become a keyword in Scala 3"

2022-04-25 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-39016: - Summary: Fix compilation warnings related to "`enum` will become a keyword in Scala 3" (was: Fix

[jira] [Created] (SPARK-39016) Fix compilation warnings related to "Wrap `enum` in backticks to use it as an identifier"

2022-04-25 Thread Yang Jie (Jira)
Yang Jie created SPARK-39016: Summary: Fix compilation warnings related to "Wrap `enum` in backticks to use it as an identifier" Key: SPARK-39016 URL: https://issues.apache.org/jira/browse/SPARK-39016

[jira] [Resolved] (SPARK-38989) Implement `ignore_index` of `DataFrame/Series.sample`

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38989. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36306

[jira] [Assigned] (SPARK-38989) Implement `ignore_index` of `DataFrame/Series.sample`

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38989: Assignee: Xinrong Meng > Implement `ignore_index` of `DataFrame/Series.sample` >

[jira] [Updated] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Raza Jafri (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raza Jafri updated SPARK-39015: --- Description: [~maxgekk] submitted a

[jira] [Commented] (SPARK-39014) Respect ignoreMissingFiles from Data Source options in InMemoryFileIndex

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527829#comment-17527829 ] Apache Spark commented on SPARK-39014: -- User 'Yaohua628' has created a pull request for this issue:

[jira] [Assigned] (SPARK-39014) Respect ignoreMissingFiles from Data Source options in InMemoryFileIndex

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39014: Assignee: (was: Apache Spark) > Respect ignoreMissingFiles from Data Source options

[jira] [Assigned] (SPARK-39014) Respect ignoreMissingFiles from Data Source options in InMemoryFileIndex

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39014: Assignee: Apache Spark > Respect ignoreMissingFiles from Data Source options in

[jira] [Updated] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Raza Jafri (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raza Jafri updated SPARK-39015: --- Description: [~maxgekk] submitted a

[jira] [Created] (SPARK-39015) SparkRuntimeException when trying to get non-existent key in a map

2022-04-25 Thread Raza Jafri (Jira)
Raza Jafri created SPARK-39015: -- Summary: SparkRuntimeException when trying to get non-existent key in a map Key: SPARK-39015 URL: https://issues.apache.org/jira/browse/SPARK-39015 Project: Spark

[jira] [Created] (SPARK-39014) Respect ignoreMissingFiles from Data Source options in InMemoryFileIndex

2022-04-25 Thread Yaohua Zhao (Jira)
Yaohua Zhao created SPARK-39014: --- Summary: Respect ignoreMissingFiles from Data Source options in InMemoryFileIndex Key: SPARK-39014 URL: https://issues.apache.org/jira/browse/SPARK-39014 Project:

[jira] [Commented] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527811#comment-17527811 ] Apache Spark commented on SPARK-39001: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527812#comment-17527812 ] Apache Spark commented on SPARK-39001: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-39008) Change ASF as a single author in Spark distribution

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-39008: Assignee: Hyukjin Kwon > Change ASF as a single author in Spark distribution >

[jira] [Resolved] (SPARK-39008) Change ASF as a single author in Spark distribution

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-39008. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 36337

[jira] [Commented] (SPARK-39013) Parser changes to enforce `()` for creating table without any columns

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527789#comment-17527789 ] Apache Spark commented on SPARK-39013: -- User 'jackierwzhang' has created a pull request for this

[jira] [Assigned] (SPARK-39013) Parser changes to enforce `()` for creating table without any columns

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39013: Assignee: (was: Apache Spark) > Parser changes to enforce `()` for creating table

[jira] [Commented] (SPARK-39013) Parser changes to enforce `()` for creating table without any columns

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527788#comment-17527788 ] Apache Spark commented on SPARK-39013: -- User 'jackierwzhang' has created a pull request for this

[jira] [Assigned] (SPARK-39013) Parser changes to enforce `()` for creating table without any columns

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39013: Assignee: Apache Spark > Parser changes to enforce `()` for creating table without any

[jira] [Updated] (SPARK-39013) Parser changes to enforce `()` for creating table without any columns

2022-04-25 Thread Jackie Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jackie Zhang updated SPARK-39013: - Summary: Parser changes to enforce `()` for creating table without any columns (was: Parse

[jira] [Created] (SPARK-39013) Parse changes to enforce `()` for creating table without any columns

2022-04-25 Thread Jackie Zhang (Jira)
Jackie Zhang created SPARK-39013: Summary: Parse changes to enforce `()` for creating table without any columns Key: SPARK-39013 URL: https://issues.apache.org/jira/browse/SPARK-39013 Project: Spark

[jira] [Updated] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Rui Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Wang updated SPARK-39012: - Description: When Spark needs to infer schema, it needs to parse string to a type. Not all data types

[jira] [Updated] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Rui Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Wang updated SPARK-39012: - Description: When Spark needs to infer schema, it needs to parse string to a type. Not all data types

[jira] [Commented] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527770#comment-17527770 ] Apache Spark commented on SPARK-39012: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39012: Assignee: Apache Spark > SparkSQL infer schema does not support all data types >

[jira] [Assigned] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39012: Assignee: (was: Apache Spark) > SparkSQL infer schema does not support all data

[jira] [Commented] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527768#comment-17527768 ] Apache Spark commented on SPARK-39012: -- User 'amaliujia' has created a pull request for this issue:

[jira] [Commented] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Rui Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527767#comment-17527767 ] Rui Wang commented on SPARK-39012: -- PR is ready to support binary type

[jira] [Created] (SPARK-39012) SparkSQL Infer schema path does not support all data types

2022-04-25 Thread Rui Wang (Jira)
Rui Wang created SPARK-39012: Summary: SparkSQL Infer schema path does not support all data types Key: SPARK-39012 URL: https://issues.apache.org/jira/browse/SPARK-39012 Project: Spark Issue

[jira] [Updated] (SPARK-39012) SparkSQL infer schema does not support all data types

2022-04-25 Thread Rui Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Wang updated SPARK-39012: - Summary: SparkSQL infer schema does not support all data types (was: SparkSQL Infer schema path does

[jira] [Commented] (SPARK-35739) [Spark Sql] Add Java-comptable Dataset.join overloads

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527751#comment-17527751 ] Apache Spark commented on SPARK-35739: -- User 'brandondahler' has created a pull request for this

[jira] [Updated] (SPARK-38954) Implement sharing of cloud credentials among driver and executors

2022-04-25 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-38954: -- Affects Version/s: 3.4.0 (was: 3.2.1) > Implement sharing of cloud

[jira] [Resolved] (SPARK-38742) Move the tests `MISSING_COLUMN` to QueryCompilationErrorsSuite

2022-04-25 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-38742. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36280

[jira] [Assigned] (SPARK-38742) Move the tests `MISSING_COLUMN` to QueryCompilationErrorsSuite

2022-04-25 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-38742: Assignee: panbingkun > Move the tests `MISSING_COLUMN` to QueryCompilationErrorsSuite >

[jira] [Updated] (SPARK-38939) Support ALTER TABLE ... DROP [IF EXISTS] COLUMN .. syntax

2022-04-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-38939: Fix Version/s: 3.3.0 (was: 3.4.0) > Support ALTER TABLE ... DROP [IF

[jira] [Resolved] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-39001. -- Fix Version/s: 3.3.0 3.4.0 Resolution: Fixed Issue resolved by pull request

[jira] [Assigned] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-39001: Assignee: Hyukjin Kwon > Document which options are unsupported in CSV and JSON functions >

[jira] [Updated] (SPARK-39007) Use double quotes for SQL configs in error messages

2022-04-25 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk updated SPARK-39007: - Fix Version/s: 3.3.0 > Use double quotes for SQL configs in error messages >

[jira] [Resolved] (SPARK-38939) Support ALTER TABLE ... DROP [IF EXISTS] COLUMN .. syntax

2022-04-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-38939. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36252

[jira] [Assigned] (SPARK-38939) Support ALTER TABLE ... DROP [IF EXISTS] COLUMN .. syntax

2022-04-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-38939: --- Assignee: Jackie Zhang > Support ALTER TABLE ... DROP [IF EXISTS] COLUMN .. syntax >

[jira] [Commented] (SPARK-25355) Support --proxy-user for Spark on K8s

2022-04-25 Thread jagadeesh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527648#comment-17527648 ] jagadeesh commented on SPARK-25355: --- [~pedro.rossi]  , we are running into problem with this feature

[jira] [Commented] (SPARK-38879) Improve the test coverage for pyspark/rddsampler.py

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527611#comment-17527611 ] Apache Spark commented on SPARK-38879: -- User 'pralabhkumar' has created a pull request for this

[jira] [Assigned] (SPARK-38879) Improve the test coverage for pyspark/rddsampler.py

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38879: Assignee: Apache Spark > Improve the test coverage for pyspark/rddsampler.py >

[jira] [Assigned] (SPARK-38879) Improve the test coverage for pyspark/rddsampler.py

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38879: Assignee: (was: Apache Spark) > Improve the test coverage for pyspark/rddsampler.py

[jira] [Updated] (SPARK-37696) Optimizer exceeds max iterations

2022-04-25 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-37696: - Affects Version/s: 3.2.1 > Optimizer exceeds max iterations >

[jira] [Commented] (SPARK-38868) `assert_true` fails unconditionnaly after `left_outer` joins

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527585#comment-17527585 ] Apache Spark commented on SPARK-38868: -- User 'bersprockets' has created a pull request for this

[jira] [Updated] (SPARK-39011) V2 Filter to ORC Predicate support

2022-04-25 Thread Huaxin Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Huaxin Gao updated SPARK-39011: --- Summary: V2 Filter to ORC Predicate support (was: V2 Filter to ORC Filter support) > V2 Filter to

[jira] [Created] (SPARK-39011) V2 Filter to ORC Filter support

2022-04-25 Thread Huaxin Gao (Jira)
Huaxin Gao created SPARK-39011: -- Summary: V2 Filter to ORC Filter support Key: SPARK-39011 URL: https://issues.apache.org/jira/browse/SPARK-39011 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-39010) V2 Filter to Parquet Predicate support

2022-04-25 Thread Huaxin Gao (Jira)
Huaxin Gao created SPARK-39010: -- Summary: V2 Filter to Parquet Predicate support Key: SPARK-39010 URL: https://issues.apache.org/jira/browse/SPARK-39010 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-38667) Optimizer generates error when using inner join along with sequence

2022-04-25 Thread Lars (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lars resolved SPARK-38667. -- Resolution: Resolved > Optimizer generates error when using inner join along with sequence >

[jira] [Commented] (SPARK-38667) Optimizer generates error when using inner join along with sequence

2022-04-25 Thread Lars (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527547#comment-17527547 ] Lars commented on SPARK-38667: -- Thanks all for pointing this out. Changed the affected version to 3.1.2 and

[jira] [Updated] (SPARK-38667) Optimizer generates error when using inner join along with sequence

2022-04-25 Thread Lars (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lars updated SPARK-38667: - Affects Version/s: 3.1.2 (was: 3.2.1) > Optimizer generates error when using inner

[jira] [Commented] (SPARK-37222) Max iterations reached in Operator Optimization w/left_anti or left_semi join and nested structures

2022-04-25 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527527#comment-17527527 ] Nicholas Chammas commented on SPARK-37222: -- Thanks for the detailed report, [~ssmith]. I am

[jira] [Updated] (SPARK-37222) Max iterations reached in Operator Optimization w/left_anti or left_semi join and nested structures

2022-04-25 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-37222: - Affects Version/s: 3.2.1 > Max iterations reached in Operator Optimization w/left_anti

[jira] [Commented] (SPARK-38983) Pyspark throws AnalysisException with incorrect error message when using .grouping() or .groupingId() (AnalysisException: grouping() can only be used with GroupingSets

2022-04-25 Thread Chris Kimmel (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527512#comment-17527512 ] Chris Kimmel commented on SPARK-38983: -- Thanks for your comment, [~hyukjin.kwon] . This issue is

[jira] [Updated] (SPARK-38983) Pyspark throws AnalysisException with incorrect error message when using .grouping() or .groupingId() (AnalysisException: grouping() can only be used with GroupingSets/C

2022-04-25 Thread Chris Kimmel (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Kimmel updated SPARK-38983: - Description: h1. In a nutshell Pyspark emits an incorrect error message when committing a type

[jira] [Commented] (SPARK-39007) Use double quotes for SQL configs in error messages

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527501#comment-17527501 ] Apache Spark commented on SPARK-39007: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Commented] (SPARK-39007) Use double quotes for SQL configs in error messages

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527498#comment-17527498 ] Apache Spark commented on SPARK-39007: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Resolved] (SPARK-39009) Spark Log4j vul - CVE-2021-44228

2022-04-25 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-39009. -- Resolution: Duplicate https://issues.apache.org/jira/browse/SPARK-6305 But this is not how to

[jira] [Updated] (SPARK-37174) WARN WindowExec: No Partition Defined is being printed 4 times.

2022-04-25 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-37174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-37174: Attachment: (was: info.txt) > WARN WindowExec: No Partition Defined is being printed

[jira] [Commented] (SPARK-38988) Pandas API - "PerformanceWarning: DataFrame is highly fragmented." get printed many times.

2022-04-25 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-38988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527432#comment-17527432 ] Bjørn Jørgensen commented on SPARK-38988: - I add a new fil "warning printed.txt" it show that it

[jira] [Updated] (SPARK-38988) Pandas API - "PerformanceWarning: DataFrame is highly fragmented." get printed many times.

2022-04-25 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-38988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-38988: Attachment: warning printed.txt > Pandas API - "PerformanceWarning: DataFrame is highly

[jira] [Updated] (SPARK-38965) Optimize RemoteBlockPushResolver with a memory pool

2022-04-25 Thread Wan Kun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wan Kun updated SPARK-38965: Summary: Optimize RemoteBlockPushResolver with a memory pool (was: Retry transfer blocks for exceptions

[jira] [Commented] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527402#comment-17527402 ] Apache Spark commented on SPARK-39001: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39001: Assignee: Apache Spark > Document which options are unsupported in CSV and JSON

[jira] [Assigned] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39001: Assignee: (was: Apache Spark) > Document which options are unsupported in CSV and

[jira] [Commented] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527401#comment-17527401 ] Apache Spark commented on SPARK-39001: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-39001) Document which options are unsupported in CSV and JSON functions

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527400#comment-17527400 ] Hyukjin Kwon commented on SPARK-39001: -- Actually this is pretty straightforward. let me just make a

[jira] [Updated] (SPARK-38965) Retry transfer blocks for exceptions listed in the error handler

2022-04-25 Thread Wan Kun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wan Kun updated SPARK-38965: Description: For push-based shuffle service, there are many  {{BLOCK_APPEND_COLLISION_DETECTED}} when

[jira] [Resolved] (SPARK-39007) Use double quotes for SQL configs in error messages

2022-04-25 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-39007. -- Resolution: Fixed Issue resolved by pull request 36335 [https://github.com/apache/spark/pull/36335]

[jira] [Assigned] (SPARK-38999) Refactor DataSourceScanExec code to

2022-04-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-38999: --- Assignee: Utkarsh Agarwal > Refactor DataSourceScanExec code to >

[jira] [Resolved] (SPARK-38999) Refactor DataSourceScanExec code to

2022-04-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-38999. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36327

[jira] [Updated] (SPARK-38981) Unexpected commutative property of udf/pandas_udf and filters

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-38981: - Priority: Major (was: Critical) > Unexpected commutative property of udf/pandas_udf and

[jira] [Updated] (SPARK-38981) Unexpected commutative property of udf/pandas_udf and filters

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-38981: - Component/s: PySpark > Unexpected commutative property of udf/pandas_udf and filters >

[jira] [Updated] (SPARK-38981) Unexpected commutative property of udf/pandas_udf and filters

2022-04-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-38981: - Labels: (was: beginner) > Unexpected commutative property of udf/pandas_udf and filters >

[jira] [Created] (SPARK-39009) Spark Log4j vul - CVE-2021-44228

2022-04-25 Thread Prakash Shankar (Jira)
Prakash Shankar created SPARK-39009: --- Summary: Spark Log4j vul - CVE-2021-44228 Key: SPARK-39009 URL: https://issues.apache.org/jira/browse/SPARK-39009 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-38981) Unexpected commutative property of udf/pandas_udf and filters

2022-04-25 Thread Maximilian Sackel (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17527322#comment-17527322 ] Maximilian Sackel commented on SPARK-38981: --- To fill the minimal working example with some