[jira] [Updated] (SPARK-46769) Fix inferring of TIMESTAMP_NTZ in CSV/JSON

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46769: --- Labels: pull-request-available (was: ) > Fix inferring of TIMESTAMP_NTZ in CSV/JSON >

[jira] [Created] (SPARK-46769) Fix inferring of TIMESTAMP_NTZ in CSV/JSON

2024-01-18 Thread Max Gekk (Jira)
Max Gekk created SPARK-46769: Summary: Fix inferring of TIMESTAMP_NTZ in CSV/JSON Key: SPARK-46769 URL: https://issues.apache.org/jira/browse/SPARK-46769 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-46765) make `shuffle` specify the datatype of `seed`

2024-01-18 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-46765. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 44793

[jira] [Assigned] (SPARK-46765) make `shuffle` specify the datatype of `seed`

2024-01-18 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng reassigned SPARK-46765: - Assignee: Ruifeng Zheng > make `shuffle` specify the datatype of `seed` >

[jira] [Updated] (SPARK-46768) Upgrade the Guava version used by the connect module to 33.0-jre

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46768: --- Labels: pull-request-available (was: ) > Upgrade the Guava version used by the connect

[jira] [Created] (SPARK-46768) Upgrade the Guava version used by the connect module to 33.0-jre

2024-01-18 Thread Yang Jie (Jira)
Yang Jie created SPARK-46768: Summary: Upgrade the Guava version used by the connect module to 33.0-jre Key: SPARK-46768 URL: https://issues.apache.org/jira/browse/SPARK-46768 Project: Spark

[jira] [Updated] (SPARK-46767) Refine docstring of `abs/acos/acosh`

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46767: --- Labels: pull-request-available (was: ) > Refine docstring of `abs/acos/acosh` >

[jira] [Created] (SPARK-46767) Refine docstring of `abs/acos/acosh`

2024-01-18 Thread Yang Jie (Jira)
Yang Jie created SPARK-46767: Summary: Refine docstring of `abs/acos/acosh` Key: SPARK-46767 URL: https://issues.apache.org/jira/browse/SPARK-46767 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-46765) make `shuffle` specify the datatype of `seed`

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46765: --- Labels: pull-request-available (was: ) > make `shuffle` specify the datatype of `seed` >

[jira] [Updated] (SPARK-46765) make `shuffle` specify the datatype of `seed`

2024-01-18 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng updated SPARK-46765: -- Summary: make `shuffle` specify the datatype of `seed` (was: Support upcasting for

[jira] [Updated] (SPARK-46765) Support upcasting for unregistered functions

2024-01-18 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng updated SPARK-46765: -- Priority: Major (was: Minor) > Support upcasting for unregistered functions >

[jira] [Updated] (SPARK-46765) Support upcasting for unregistered functions

2024-01-18 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng updated SPARK-46765: -- Summary: Support upcasting for unregistered functions (was: make `shuffle` specify the

[jira] [Updated] (SPARK-46766) ZSTD Buffer Pool Support For AVRO datasource

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46766: --- Labels: pull-request-available (was: ) > ZSTD Buffer Pool Support For AVRO datasource >

[jira] [Created] (SPARK-46766) ZSTD Buffer Pool Support For AVRO datasource

2024-01-18 Thread Kent Yao (Jira)
Kent Yao created SPARK-46766: Summary: ZSTD Buffer Pool Support For AVRO datasource Key: SPARK-46766 URL: https://issues.apache.org/jira/browse/SPARK-46766 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-46676) dropDuplicatesWithinWatermark throws error on canonicalizing plan

2024-01-18 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim reassigned SPARK-46676: Assignee: Jungtaek Lim > dropDuplicatesWithinWatermark throws error on canonicalizing

[jira] [Resolved] (SPARK-46676) dropDuplicatesWithinWatermark throws error on canonicalizing plan

2024-01-18 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-46676. -- Fix Version/s: 3.5.1 4.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Created] (SPARK-46765) make `shuffle` specify the datatype of `seed`

2024-01-18 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-46765: - Summary: make `shuffle` specify the datatype of `seed` Key: SPARK-46765 URL: https://issues.apache.org/jira/browse/SPARK-46765 Project: Spark Issue Type:

[jira] [Updated] (SPARK-46764) Reorganize Ruby script to build API docs

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46764: --- Labels: pull-request-available (was: ) > Reorganize Ruby script to build API docs >

[jira] [Created] (SPARK-46764) Reorganize Ruby script to build API docs

2024-01-18 Thread Nicholas Chammas (Jira)
Nicholas Chammas created SPARK-46764: Summary: Reorganize Ruby script to build API docs Key: SPARK-46764 URL: https://issues.apache.org/jira/browse/SPARK-46764 Project: Spark Issue Type:

[jira] [Created] (SPARK-46763) ReplaceDeduplicateWithAggregate fails when non-grouping keys have duplicate attributes

2024-01-18 Thread Nikhil Sheoran (Jira)
Nikhil Sheoran created SPARK-46763: -- Summary: ReplaceDeduplicateWithAggregate fails when non-grouping keys have duplicate attributes Key: SPARK-46763 URL: https://issues.apache.org/jira/browse/SPARK-46763

[jira] [Commented] (SPARK-45282) Join loses records for cached datasets

2024-01-18 Thread Rob Russo (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17808393#comment-17808393 ] Rob Russo commented on SPARK-45282: --- Is it possible that this also affects spark 3.3.2? I have an

[jira] [Updated] (SPARK-46762) Spark Connect 3.5 Classloading issue

2024-01-18 Thread nirav patel (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] nirav patel updated SPARK-46762: Description: *Affected version:* spark 3.5 and spark-connect_2.12:3.5.0   *Not affected

[jira] [Created] (SPARK-46762) Spark Connect 3.5 Classloading issue

2024-01-18 Thread nirav patel (Jira)
nirav patel created SPARK-46762: --- Summary: Spark Connect 3.5 Classloading issue Key: SPARK-46762 URL: https://issues.apache.org/jira/browse/SPARK-46762 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-46762) Spark Connect 3.5 Classloading issue

2024-01-18 Thread nirav patel (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] nirav patel updated SPARK-46762: Description: *Affected version:* spark 3.5 and spark-connect_2.12:3.5.0   *Not affected

[jira] [Created] (SPARK-46761) quoted strings in a JSON path should support ? characters

2024-01-18 Thread Robert Joseph Evans (Jira)
Robert Joseph Evans created SPARK-46761: --- Summary: quoted strings in a JSON path should support ? characters Key: SPARK-46761 URL: https://issues.apache.org/jira/browse/SPARK-46761 Project:

[jira] [Resolved] (SPARK-46759) Codec xz and zstandard support compression level for avro files

2024-01-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-46759. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 44786

[jira] [Assigned] (SPARK-46759) Codec xz and zstandard support compression level for avro files

2024-01-18 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-46759: - Assignee: Kent Yao > Codec xz and zstandard support compression level for avro files >

[jira] [Commented] (SPARK-46247) Invalid bucket file error when reading from bucketed table created with PathOutputCommitProtocol

2024-01-18 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-46247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17808235#comment-17808235 ] Никита Соколов commented on SPARK-46247: No, there was no trailing dot at the end of the

[jira] [Commented] (SPARK-46247) Invalid bucket file error when reading from bucketed table created with PathOutputCommitProtocol

2024-01-18 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17808227#comment-17808227 ] Steve Loughran commented on SPARK-46247: why is the file invalid? any more stack trace? # try

[jira] [Updated] (SPARK-46759) Codec xz and zstandard support compression level for avro files

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46759: --- Labels: pull-request-available (was: ) > Codec xz and zstandard support compression level

[jira] [Updated] (SPARK-46760) Make the document of spark.sql.adaptive.coalescePartitions.parallelismFirst clearer

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46760: --- Labels: pull-request-available (was: ) > Make the document of

[jira] [Created] (SPARK-46760) Make the document of spark.sql.adaptive.coalescePartitions.parallelismFirst clearer

2024-01-18 Thread Jiaan Geng (Jira)
Jiaan Geng created SPARK-46760: -- Summary: Make the document of spark.sql.adaptive.coalescePartitions.parallelismFirst clearer Key: SPARK-46760 URL: https://issues.apache.org/jira/browse/SPARK-46760

[jira] [Created] (SPARK-46759) Codec xz and zstandard support compression level for avro files

2024-01-18 Thread Kent Yao (Jira)
Kent Yao created SPARK-46759: Summary: Codec xz and zstandard support compression level for avro files Key: SPARK-46759 URL: https://issues.apache.org/jira/browse/SPARK-46759 Project: Spark

[jira] [Assigned] (SPARK-39910) DataFrameReader API cannot read files from hadoop archives (.har)

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-39910: -- Assignee: Apache Spark > DataFrameReader API cannot read files from hadoop archives

[jira] [Assigned] (SPARK-39910) DataFrameReader API cannot read files from hadoop archives (.har)

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-39910: -- Assignee: (was: Apache Spark) > DataFrameReader API cannot read files from

[jira] [Commented] (SPARK-46623) Replace SimpleDateFormat with DateTimeFormatter

2024-01-18 Thread Mridul Muralidharan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17808080#comment-17808080 ] Mridul Muralidharan commented on SPARK-46623: - Issue resolved by pull request 44616

[jira] [Resolved] (SPARK-46623) Replace SimpleDateFormat with DateTimeFormatter

2024-01-18 Thread Mridul Muralidharan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan resolved SPARK-46623. - Fix Version/s: 4.0.0 Assignee: Jiaan Geng Resolution: Fixed >

[jira] [Resolved] (SPARK-46696) In ResourceProfileManager, function calls should occur after variable declarations.

2024-01-18 Thread Mridul Muralidharan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan resolved SPARK-46696. - Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request

[jira] [Assigned] (SPARK-46696) In ResourceProfileManager, function calls should occur after variable declarations.

2024-01-18 Thread Mridul Muralidharan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan reassigned SPARK-46696: --- Assignee: liangyongyuan > In ResourceProfileManager, function calls should

[jira] [Resolved] (SPARK-46754) Fix compression code resolution in avro table definition

2024-01-18 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao resolved SPARK-46754. -- Fix Version/s: 4.0.0 Assignee: Kent Yao Resolution: Fixed resolved by  [GitHub Pull

[jira] [Updated] (SPARK-46708) Support error message format in Spark Connect service

2024-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-46708: --- Labels: pull-request-available (was: ) > Support error message format in Spark Connect