[jira] [Updated] (SPARK-48921) ScalaUDF in subquery should run through analyzer

2024-07-16 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh updated SPARK-48921: Affects Version/s: 3.5.1 4.0.0 > ScalaUDF in subquery should run through an

[jira] [Resolved] (SPARK-48920) Upgrade ORC to 1.9.4

2024-07-16 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-48920. --- Fix Version/s: 3.5.2 Resolution: Fixed Issue resolved by pull request 47379 [https://

[jira] [Updated] (SPARK-48923) Fix the incorrect logic of `CollationFactorySuite`

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-48923: --- Labels: pull-request-available (was: ) > Fix the incorrect logic of `CollationFactorySuite`

[jira] [Created] (SPARK-48923) Fix the incorrect logic of `CollationFactorySuite`

2024-07-16 Thread BingKun Pan (Jira)
BingKun Pan created SPARK-48923: --- Summary: Fix the incorrect logic of `CollationFactorySuite` Key: SPARK-48923 URL: https://issues.apache.org/jira/browse/SPARK-48923 Project: Spark Issue Type:

[jira] [Created] (SPARK-48922) Optimize complex type insertion performance

2024-07-16 Thread Zhen Wang (Jira)
Zhen Wang created SPARK-48922: - Summary: Optimize complex type insertion performance Key: SPARK-48922 URL: https://issues.apache.org/jira/browse/SPARK-48922 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-48921) ScalaUDF in subquery should run through analyzer

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-48921: --- Labels: pull-request-available (was: ) > ScalaUDF in subquery should run through analyzer >

[jira] [Created] (SPARK-48921) ScalaUDF in subquery should run through analyzer

2024-07-16 Thread L. C. Hsieh (Jira)
L. C. Hsieh created SPARK-48921: --- Summary: ScalaUDF in subquery should run through analyzer Key: SPARK-48921 URL: https://issues.apache.org/jira/browse/SPARK-48921 Project: Spark Issue Type: Bu

[jira] [Assigned] (SPARK-48921) ScalaUDF in subquery should run through analyzer

2024-07-16 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh reassigned SPARK-48921: --- Assignee: L. C. Hsieh > ScalaUDF in subquery should run through analyzer >

[jira] [Updated] (SPARK-48918) Create a unified SQL Scala interface shared by regular SQL and Connect.

2024-07-16 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-48918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell updated SPARK-48918: -- Description: *Motivation* Current the scala sql/core and connect API share the same A

[jira] [Assigned] (SPARK-48903) Update latestSnapshot version correctly on remote load

2024-07-16 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim reassigned SPARK-48903: Assignee: Anish Shrigondekar > Update latestSnapshot version correctly on remote load > -

[jira] [Resolved] (SPARK-48903) Update latestSnapshot version correctly on remote load

2024-07-16 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-48903. -- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47363 [https://gi

[jira] [Created] (SPARK-48918) Create a unified SQL Scala interface shared by regular SQL and Connect.

2024-07-16 Thread Jira
Herman van Hövell created SPARK-48918: - Summary: Create a unified SQL Scala interface shared by regular SQL and Connect. Key: SPARK-48918 URL: https://issues.apache.org/jira/browse/SPARK-48918 Pro

[jira] [Resolved] (SPARK-48883) In spark ML, replace RDD read / write API invocation with Dataframe read / write API

2024-07-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-48883. -- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47341 [https://gi

[jira] [Updated] (SPARK-48917) Upgrade tink to 1.14.0

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-48917: --- Labels: pull-request-available (was: ) > Upgrade tink to 1.14.0 > -- >

[jira] [Assigned] (SPARK-48892) Avoid per-row param read in `Tokenizer`

2024-07-16 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng reassigned SPARK-48892: - Assignee: Ruifeng Zheng > Avoid per-row param read in `Tokenizer` > ---

[jira] [Resolved] (SPARK-48892) Avoid per-row param read in `Tokenizer`

2024-07-16 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-48892. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47342 [https://

[jira] [Updated] (SPARK-48901) Add clusterBy DataStreamWriter API for Scala

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-48901: --- Labels: pull-request-available (was: ) > Add clusterBy DataStreamWriter API for Scala > ---

[jira] [Created] (SPARK-48916) Add clusterBy DataStreamWriter API for Python

2024-07-16 Thread Chirag Singh (Jira)
Chirag Singh created SPARK-48916: Summary: Add clusterBy DataStreamWriter API for Python Key: SPARK-48916 URL: https://issues.apache.org/jira/browse/SPARK-48916 Project: Spark Issue Type: Sub

[jira] [Created] (SPARK-48915) Add inequality (!=, <, <=, >, >=) predicates for correlation in GeneratedSubquerySuite

2024-07-16 Thread Nick Young (Jira)
Nick Young created SPARK-48915: -- Summary: Add inequality (!=, <, <=, >, >=) predicates for correlation in GeneratedSubquerySuite Key: SPARK-48915 URL: https://issues.apache.org/jira/browse/SPARK-48915 Pr

[jira] [Created] (SPARK-48913) Avoid `com.sun.xml.txw2.output.IndentingXMLStreamWriter` usage

2024-07-16 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-48913: - Summary: Avoid `com.sun.xml.txw2.output.IndentingXMLStreamWriter` usage Key: SPARK-48913 URL: https://issues.apache.org/jira/browse/SPARK-48913 Project: Spark

[jira] [Updated] (SPARK-48911) Improve collation support testing for various expressions

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-48911: --- Labels: pull-request-available (was: ) > Improve collation support testing for various expr

[jira] [Created] (SPARK-48912) Improve collation support testing for various collations

2024-07-16 Thread Jira
Uroš Bojanić created SPARK-48912: Summary: Improve collation support testing for various collations Key: SPARK-48912 URL: https://issues.apache.org/jira/browse/SPARK-48912 Project: Spark Issu

[jira] [Created] (SPARK-48911) Improve collation support testing for various expressions

2024-07-16 Thread Jira
Uroš Bojanić created SPARK-48911: Summary: Improve collation support testing for various expressions Key: SPARK-48911 URL: https://issues.apache.org/jira/browse/SPARK-48911 Project: Spark Iss

[jira] [Assigned] (SPARK-48909) Uses SparkSession over SparkContext when writing metadata

2024-07-16 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-48909: - Assignee: Hyukjin Kwon > Uses SparkSession over SparkContext when writing metadata > --

[jira] [Resolved] (SPARK-48909) Uses SparkSession over SparkContext when writing metadata

2024-07-16 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-48909. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47366 [https://

[jira] [Resolved] (SPARK-48896) Remove repartition(1) in writing metadata ML

2024-07-16 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-48896. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47347 [https://

[jira] [Assigned] (SPARK-48896) Remove repartition(1) in writing metadata ML

2024-07-16 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-48896: - Assignee: Hyukjin Kwon > Remove repartition(1) in writing metadata ML > ---

[jira] [Resolved] (SPARK-48908) GitHub API Rate Limit Exceeded Problem in spark-rm Dockerfile

2024-07-16 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao resolved SPARK-48908. -- Resolution: Not A Problem Only 3.5 has such an issue > GitHub API Rate Limit Exceeded Problem in spar

[jira] [Updated] (SPARK-48910) Slow linear searches in PreprocessTableCreation

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-48910: --- Labels: pull-request-available (was: ) > Slow linear searches in PreprocessTableCreation >

[jira] [Created] (SPARK-48910) Slow linear searches in PreprocessTableCreation

2024-07-16 Thread Vladimir Golubev (Jira)
Vladimir Golubev created SPARK-48910: Summary: Slow linear searches in PreprocessTableCreation Key: SPARK-48910 URL: https://issues.apache.org/jira/browse/SPARK-48910 Project: Spark Issue

[jira] [Updated] (SPARK-48909) Uses SparkSession over SparkContext when writing metadata

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-48909: --- Labels: pull-request-available (was: ) > Uses SparkSession over SparkContext when writing m

[jira] [Created] (SPARK-48909) Uses SparkSession over SparkContext when writing metadata

2024-07-16 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-48909: Summary: Uses SparkSession over SparkContext when writing metadata Key: SPARK-48909 URL: https://issues.apache.org/jira/browse/SPARK-48909 Project: Spark Iss

[jira] [Created] (SPARK-48908) GitHub API Rate Limit Exceeded Problem in spark-rm Dockerfile

2024-07-16 Thread Kent Yao (Jira)
Kent Yao created SPARK-48908: Summary: GitHub API Rate Limit Exceeded Problem in spark-rm Dockerfile Key: SPARK-48908 URL: https://issues.apache.org/jira/browse/SPARK-48908 Project: Spark Issue

[jira] [Assigned] (SPARK-48761) Add clusterBy DataFrameWriter API for Scala

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-48761: -- Assignee: Apache Spark > Add clusterBy DataFrameWriter API for Scala > --

[jira] [Assigned] (SPARK-48761) Add clusterBy DataFrameWriter API for Scala

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-48761: -- Assignee: (was: Apache Spark) > Add clusterBy DataFrameWriter API for Scala > ---

[jira] [Assigned] (SPARK-48761) Add clusterBy DataFrameWriter API for Scala

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-48761: -- Assignee: Apache Spark > Add clusterBy DataFrameWriter API for Scala > --

[jira] [Assigned] (SPARK-48761) Add clusterBy DataFrameWriter API for Scala

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-48761: -- Assignee: (was: Apache Spark) > Add clusterBy DataFrameWriter API for Scala > ---

[jira] [Assigned] (SPARK-48761) Add clusterBy DataFrameWriter API for Scala

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-48761: -- Assignee: Apache Spark > Add clusterBy DataFrameWriter API for Scala > --

[jira] [Assigned] (SPARK-48761) Add clusterBy DataFrameWriter API for Scala

2024-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-48761: -- Assignee: (was: Apache Spark) > Add clusterBy DataFrameWriter API for Scala > ---

[jira] [Assigned] (SPARK-48873) Use UnsafeRow in JSON parser.

2024-07-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-48873: Assignee: Chenhao Li > Use UnsafeRow in JSON parser. > - > >

[jira] [Resolved] (SPARK-48873) Use UnsafeRow in JSON parser.

2024-07-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-48873. -- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47310 [https://gi