[jira] [Resolved] (SPARK-49050) Enabling deleteIfExists operator in TWS with Virtual Column Families

2024-08-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-49050. -- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47880 [https://gi

[jira] [Assigned] (SPARK-49050) Enabling deleteIfExists operator in TWS with Virtual Column Families

2024-08-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim reassigned SPARK-49050: Assignee: Eric Marnadi > Enabling deleteIfExists operator in TWS with Virtual Column Fami

[jira] [Commented] (SPARK-49442) Complete Metadata requests on each micro batch causing Kafka issues

2024-08-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877256#comment-17877256 ] Jungtaek Lim commented on SPARK-49442: -- Could you give a try with setting SQL confi

[jira] [Updated] (SPARK-49442) Complete Metadata requests on each micro batch causing Kafka issues

2024-08-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-49442: - Target Version/s: (was: 3.3.2) > Complete Metadata requests on each micro batch causing Kafka

[jira] [Updated] (SPARK-49442) Complete Metadata requests on each micro batch causing Kafka issues

2024-08-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-49442: - Priority: Major (was: Blocker) > Complete Metadata requests on each micro batch causing Kafka i

[jira] [Updated] (SPARK-49442) Complete Metadata requests on each micro batch causing Kafka issues

2024-08-27 Thread vipin Kumar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vipin Kumar updated SPARK-49442: Target Version/s: 3.3.2 Labels: Kafka spark-streaming-kafka (was: ) Prio

[jira] [Resolved] (SPARK-49412) Compute all box plot metrics in single job

2024-08-27 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-49412. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47897 [https://

[jira] [Created] (SPARK-49442) Complete Metadata requests on each micro batch causing Kafka issues

2024-08-27 Thread vipin Kumar (Jira)
vipin Kumar created SPARK-49442: --- Summary: Complete Metadata requests on each micro batch causing Kafka issues Key: SPARK-49442 URL: https://issues.apache.org/jira/browse/SPARK-49442 Project: Spark

[jira] [Commented] (SPARK-49069) Kafka Offsets Not Committed to Consumer Group in Spark Structured Streaming

2024-08-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877247#comment-17877247 ] Jungtaek Lim commented on SPARK-49069: -- I changed this ticket to Wish. As commented

[jira] [Updated] (SPARK-49069) Kafka Offsets Not Committed to Consumer Group in Spark Structured Streaming

2024-08-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-49069: - Issue Type: Wish (was: Bug) > Kafka Offsets Not Committed to Consumer Group in Spark Structured

[jira] [Updated] (SPARK-49441) StringIndexer sort arrays in executors

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49441: --- Labels: pull-request-available (was: ) > StringIndexer sort arrays in executors > -

[jira] [Created] (SPARK-49441) StringIndexer sort arrays in executors

2024-08-27 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-49441: - Summary: StringIndexer sort arrays in executors Key: SPARK-49441 URL: https://issues.apache.org/jira/browse/SPARK-49441 Project: Spark Issue Type: Improvem

[jira] [Updated] (SPARK-49440) Clean up unused LogKey definitions

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49440: --- Labels: pull-request-available (was: ) > Clean up unused LogKey definitions > -

[jira] [Created] (SPARK-49440) Clean up unused LogKey definitions

2024-08-27 Thread Yang Jie (Jira)
Yang Jie created SPARK-49440: Summary: Clean up unused LogKey definitions Key: SPARK-49440 URL: https://issues.apache.org/jira/browse/SPARK-49440 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-49439) Fix the pretty name of the `FromProtobuf` & `ToProtobuf` expression

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49439: --- Labels: pull-request-available (was: ) > Fix the pretty name of the `FromProtobuf` & `ToPro

[jira] [Created] (SPARK-49439) Fix the pretty name of the `FromProtobuf` & `ToProtobuf` expression

2024-08-27 Thread BingKun Pan (Jira)
BingKun Pan created SPARK-49439: --- Summary: Fix the pretty name of the `FromProtobuf` & `ToProtobuf` expression Key: SPARK-49439 URL: https://issues.apache.org/jira/browse/SPARK-49439 Project: Spark

[jira] [Updated] (SPARK-49438) Fix the pretty name of the `FromAvro` & `ToAvro` expression

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49438: --- Labels: pull-request-available (was: ) > Fix the pretty name of the `FromAvro` & `ToAvro`

[jira] [Updated] (SPARK-49050) Enabling deleteIfExists operator in TWS with Virtual Column Families

2024-08-27 Thread Eric Marnadi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Marnadi updated SPARK-49050: - Summary: Enabling deleteIfExists operator in TWS with Virtual Column Families (was: nabling del

[jira] [Updated] (SPARK-49050) nabling deleteIfExists operator in TWS with Virtual Column Families

2024-08-27 Thread Eric Marnadi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Marnadi updated SPARK-49050: - Description: Fully integrating the TransformWithState operator with Virtual Column Families by a

[jira] [Resolved] (SPARK-49357) [PYTHON] SparkConnectClient._truncate is not effective on deeply nested messages

2024-08-27 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-49357. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47891 [https://

[jira] [Assigned] (SPARK-49357) [PYTHON] SparkConnectClient._truncate is not effective on deeply nested messages

2024-08-27 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng reassigned SPARK-49357: - Assignee: Changgyoo Park > [PYTHON] SparkConnectClient._truncate is not effective on de

[jira] [Updated] (SPARK-49241) Add OpenTelemetryPush Sink

2024-08-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-49241: -- Parent: SPARK-44111 Issue Type: Sub-task (was: Improvement) > Add OpenTelemetryPush S

[jira] [Assigned] (SPARK-49241) Add OpenTelemetryPush Sink

2024-08-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-49241: - Assignee: Qi Tan > Add OpenTelemetryPush Sink > -- > >

[jira] [Resolved] (SPARK-49241) Add OpenTelemetryPush Sink

2024-08-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-49241. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47763 [https://

[jira] [Assigned] (SPARK-49248) Scala Client Parity with existing Dataset/DataFrame API

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell reassigned SPARK-49248: - Assignee: Pengfei Xu > Scala Client Parity with existing Dataset/DataFrame API

[jira] [Created] (SPARK-49438) Fix the pretty name of the `FromAvro` & `ToAvro` expression

2024-08-27 Thread BingKun Pan (Jira)
BingKun Pan created SPARK-49438: --- Summary: Fix the pretty name of the `FromAvro` & `ToAvro` expression Key: SPARK-49438 URL: https://issues.apache.org/jira/browse/SPARK-49438 Project: Spark I

[jira] [Updated] (SPARK-49419) Create a shared DataFrameStatFunctions interface

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell updated SPARK-49419: -- Environment: (was: Not sure if we should do this. Connect and Classic have differe

[jira] [Updated] (SPARK-49419) Create a shared DataFrameStatFunctions interface

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49419: --- Labels: pull-request-available (was: ) > Create a shared DataFrameStatFunctions interface >

[jira] [Updated] (SPARK-49430) Add SparkResult to Classic

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell updated SPARK-49430: -- Description: Add collectResult() and the SparkResult class to classic. By doing this t

[jira] [Created] (SPARK-49437) Create interface implementation tests

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49437: - Summary: Create interface implementation tests Key: SPARK-49437 URL: https://issues.apache.org/jira/browse/SPARK-49437 Project: Spark Issue Type: N

[jira] [Assigned] (SPARK-49421) Create a shared RelationalGroupedDataset interface

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell reassigned SPARK-49421: - Assignee: (was: Herman van Hövell) > Create a shared RelationalGroupedDatas

[jira] [Assigned] (SPARK-49422) Create a shared KeyValueGroupedDataset interface

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell reassigned SPARK-49422: - Assignee: (was: Herman van Hövell) > Create a shared KeyValueGroupedDataset

[jira] [Assigned] (SPARK-49419) Create a shared DataFrameStatFunctions interface

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell reassigned SPARK-49419: - Assignee: (was: Herman van Hövell) > Create a shared DataFrameStatFunctions

[jira] [Assigned] (SPARK-49420) Create a shared DataFrameNaFunctions interface

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell reassigned SPARK-49420: - Assignee: (was: Herman van Hövell) > Create a shared DataFrameNaFunctions i

[jira] [Created] (SPARK-49436) Consider adding shared SQLContext interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49436: - Summary: Consider adding shared SQLContext interface Key: SPARK-49436 URL: https://issues.apache.org/jira/browse/SPARK-49436 Project: Spark Issue T

[jira] [Created] (SPARK-49435) Move ReduceAggregator to sql/api

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49435: - Summary: Move ReduceAggregator to sql/api Key: SPARK-49435 URL: https://issues.apache.org/jira/browse/SPARK-49435 Project: Spark Issue Type: New Fe

[jira] [Created] (SPARK-49434) Move org.apache.spark.sql.expressions.javalang.typed/scalalang.type to sql/api

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49434: - Summary: Move org.apache.spark.sql.expressions.javalang.typed/scalalang.type to sql/api Key: SPARK-49434 URL: https://issues.apache.org/jira/browse/SPARK-49434

[jira] [Updated] (SPARK-49428) Move Connect Scala Client to connect/common

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell updated SPARK-49428: -- Description: Move the actual client to connect/common in the package org.apache.spark

[jira] [Updated] (SPARK-49428) Move Connect Scala Client to connect/common

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell updated SPARK-49428: -- Summary: Move Connect Scala Client to connect/common (was: Move Connect implementatio

[jira] [Resolved] (SPARK-49405) Restrict charsets in JsonOptions

2024-08-27 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao resolved SPARK-49405. -- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47887 [https://github.com

[jira] [Created] (SPARK-49433) Consolidate a version of connect UdfUtils in sql/api

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49433: - Summary: Consolidate a version of connect UdfUtils in sql/api Key: SPARK-49433 URL: https://issues.apache.org/jira/browse/SPARK-49433 Project: Spark

[jira] [Created] (SPARK-49432) Consolidate StreamingQuery in sql/api

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49432: - Summary: Consolidate StreamingQuery in sql/api Key: SPARK-49432 URL: https://issues.apache.org/jira/browse/SPARK-49432 Project: Spark Issue Type: N

[jira] [Created] (SPARK-49431) Consolidate ForeachWriter into sql/api

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49431: - Summary: Consolidate ForeachWriter into sql/api Key: SPARK-49431 URL: https://issues.apache.org/jira/browse/SPARK-49431 Project: Spark Issue Type:

[jira] [Created] (SPARK-49430) Add SparkResult to Classic

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49430: - Summary: Add SparkResult to Classic Key: SPARK-49430 URL: https://issues.apache.org/jira/browse/SPARK-49430 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-49429) Create a shared DataStreamWriter interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49429: - Summary: Create a shared DataStreamWriter interface Key: SPARK-49429 URL: https://issues.apache.org/jira/browse/SPARK-49429 Project: Spark Issue Ty

[jira] [Updated] (SPARK-49415) Create a shared interface for SQLImplicits

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell updated SPARK-49415: -- Description: This includes DatasetHolder > Create a shared interface for SQLImplicits

[jira] [Created] (SPARK-49428) Move Connect implementation to connect/common

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49428: - Summary: Move Connect implementation to connect/common Key: SPARK-49428 URL: https://issues.apache.org/jira/browse/SPARK-49428 Project: Spark Issue

[jira] [Created] (SPARK-49427) Create a shared MergeIntoWriter interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49427: - Summary: Create a shared MergeIntoWriter interface Key: SPARK-49427 URL: https://issues.apache.org/jira/browse/SPARK-49427 Project: Spark Issue Typ

[jira] [Created] (SPARK-49426) Create a shared DataFrameWriterV2 interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49426: - Summary: Create a shared DataFrameWriterV2 interface Key: SPARK-49426 URL: https://issues.apache.org/jira/browse/SPARK-49426 Project: Spark Issue T

[jira] [Created] (SPARK-49425) Create a shared DataFrameWriter interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49425: - Summary: Create a shared DataFrameWriter interface Key: SPARK-49425 URL: https://issues.apache.org/jira/browse/SPARK-49425 Project: Spark Issue Typ

[jira] [Created] (SPARK-49424) Consolidate Encoders in sql/api

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49424: - Summary: Consolidate Encoders in sql/api Key: SPARK-49424 URL: https://issues.apache.org/jira/browse/SPARK-49424 Project: Spark Issue Type: New Fea

[jira] [Assigned] (SPARK-49028) Create a shared interface for SparkSession

2024-08-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-49028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell reassigned SPARK-49028: - Assignee: Herman van Hövell > Create a shared interface for SparkSession >

[jira] [Created] (SPARK-49423) Consolidate Observation into a single class in sql/api

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49423: - Summary: Consolidate Observation into a single class in sql/api Key: SPARK-49423 URL: https://issues.apache.org/jira/browse/SPARK-49423 Project: Spark

[jira] [Created] (SPARK-49420) Create a shared DataFrameNaFunctions interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49420: - Summary: Create a shared DataFrameNaFunctions interface Key: SPARK-49420 URL: https://issues.apache.org/jira/browse/SPARK-49420 Project: Spark Issu

[jira] [Created] (SPARK-49422) Create a shared KeyValueGroupedDataset interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49422: - Summary: Create a shared KeyValueGroupedDataset interface Key: SPARK-49422 URL: https://issues.apache.org/jira/browse/SPARK-49422 Project: Spark Is

[jira] [Created] (SPARK-49421) Create a shared RelationalGroupedDataset interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49421: - Summary: Create a shared RelationalGroupedDataset interface Key: SPARK-49421 URL: https://issues.apache.org/jira/browse/SPARK-49421 Project: Spark

[jira] [Created] (SPARK-49419) Create a shared DataFrameStatFunctions interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49419: - Summary: Create a shared DataFrameStatFunctions interface Key: SPARK-49419 URL: https://issues.apache.org/jira/browse/SPARK-49419 Project: Spark Is

[jira] [Created] (SPARK-49418) Active/default ThreadLocals for shared SparkSession

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49418: - Summary: Active/default ThreadLocals for shared SparkSession Key: SPARK-49418 URL: https://issues.apache.org/jira/browse/SPARK-49418 Project: Spark

[jira] [Created] (SPARK-49417) Create a shared interface for StreamingQueryManager

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49417: - Summary: Create a shared interface for StreamingQueryManager Key: SPARK-49417 URL: https://issues.apache.org/jira/browse/SPARK-49417 Project: Spark

[jira] [Created] (SPARK-49415) Create a shared interface for SQLImplicits

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49415: - Summary: Create a shared interface for SQLImplicits Key: SPARK-49415 URL: https://issues.apache.org/jira/browse/SPARK-49415 Project: Spark Issue Ty

[jira] [Created] (SPARK-49416) Create a shared interface for DataStreamReader

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49416: - Summary: Create a shared interface for DataStreamReader Key: SPARK-49416 URL: https://issues.apache.org/jira/browse/SPARK-49416 Project: Spark Issu

[jira] [Created] (SPARK-49414) Create a shared DataFrameReader interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49414: - Summary: Create a shared DataFrameReader interface Key: SPARK-49414 URL: https://issues.apache.org/jira/browse/SPARK-49414 Project: Spark Issue Typ

[jira] [Created] (SPARK-49413) Create shared RuntimeConf interface

2024-08-27 Thread Jira
Herman van Hövell created SPARK-49413: - Summary: Create shared RuntimeConf interface Key: SPARK-49413 URL: https://issues.apache.org/jira/browse/SPARK-49413 Project: Spark Issue Type: New

[jira] [Updated] (SPARK-49412) Compute all box plot metrics in single job

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49412: --- Labels: pull-request-available (was: ) > Compute all box plot metrics in single job > -

[jira] [Updated] (SPARK-49383) Support Transpose DataFrame API

2024-08-27 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-49383: - Description: Support Transpose as Scala/Python DataFrame API in both Spark Connect and Classic

[jira] [Created] (SPARK-49412) Compute all box plot metrics in single job

2024-08-27 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-49412: - Summary: Compute all box plot metrics in single job Key: SPARK-49412 URL: https://issues.apache.org/jira/browse/SPARK-49412 Project: Spark Issue Type: Sub-

[jira] [Updated] (SPARK-49374) RocksDB State Store Checkpoint Structure V2

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49374: Description: h2. Design Doc: [https://docs.google.com/document/d/1uWRMbN927cRXhSm5oeV3pbwb6o73am4

[jira] [Updated] (SPARK-49374) RocksDB State Store Checkpoint Structure V2

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49374: Description: h2. Design Doc: [https://docs.google.com/document/d/1uWRMbN927cRXhSm5oeV3pbwb6o73am4

[jira] [Resolved] (SPARK-49393) fail by default in deprecated catalog plugin APIs

2024-08-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-49393. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 47874 [https://

[jira] [Comment Edited] (SPARK-49069) Kafka Offsets Not Committed to Consumer Group in Spark Structured Streaming

2024-08-27 Thread Anish Shrigondekar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877190#comment-17877190 ] Anish Shrigondekar edited comment on SPARK-49069 at 8/27/24 9:27 PM: -

[jira] [Commented] (SPARK-49069) Kafka Offsets Not Committed to Consumer Group in Spark Structured Streaming

2024-08-27 Thread Anish Shrigondekar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877190#comment-17877190 ] Anish Shrigondekar commented on SPARK-49069: AFAIK - we don't support commit

[jira] [Resolved] (SPARK-40706) IllegalStateException when querying array values inside a nested struct

2024-08-27 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins resolved SPARK-40706. --- Resolution: Duplicate > IllegalStateException when querying array values inside a nested str

[jira] [Commented] (SPARK-45745) Extremely slow execution of sum of columns in Spark 3.4.1

2024-08-27 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877165#comment-17877165 ] Bruce Robbins commented on SPARK-45745: --- I will close as a duplicate of SPARK-4507

[jira] [Commented] (SPARK-45278) Make Yarn executor's bindAddress configurable

2024-08-27 Thread Hendra Saputra (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877149#comment-17877149 ] Hendra Saputra commented on SPARK-45278: Hi [~nishchal] I created new PR for thi

[jira] [Updated] (SPARK-49374) RocksDB State Store Checkpoint Structure V2

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49374: Description: h2. Motivation We expect the new checkpoint structure would be beneficial by establi

[jira] [Updated] (SPARK-49374) RocksDB State Store Checkpoint Structure V2

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49374: Description: h2. Motivation We expect the new checkpoint structure would be beneficial by establi

[jira] [Updated] (SPARK-49411) Communicate RocksDB State Store CheckpointID Between Driver and Executor

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49411: Description: A incremental step to implement RocksDB state store checkpoint format V2. Once conf

[jira] [Updated] (SPARK-49411) Communicate RocksDB State Store CheckpointID Between Driver and Executor

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49411: Summary: Communicate RocksDB State Store CheckpointID Between Driver and Executor (was: Communica

[jira] [Updated] (SPARK-49411) Communicate State Store CheckpointID Between Driver and Executor

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49411: Description: Once conf STATE_STORE_CHECKPOINT_FORMAT_VERSION is set to be higher than version 2, t

[jira] [Updated] (SPARK-49374) RocksDB State Store Checkpoint Structure V2

2024-08-27 Thread Siying Dong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siying Dong updated SPARK-49374: Epic Name: RocksDB State Store Checkpoint Structure V2 > RocksDB State Store Checkpoint Structure

[jira] [Updated] (SPARK-49028) Create a shared interface for SparkSession

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49028: --- Labels: pull-request-available (was: ) > Create a shared interface for SparkSession > -

[jira] [Updated] (SPARK-49410) Update collation benchmarks

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49410: --- Labels: pull-request-available (was: ) > Update collation benchmarks >

[jira] [Created] (SPARK-49410) Update collation benchmarks

2024-08-27 Thread Jira
Uroš Bojanić created SPARK-49410: Summary: Update collation benchmarks Key: SPARK-49410 URL: https://issues.apache.org/jira/browse/SPARK-49410 Project: Spark Issue Type: Sub-task Co

[jira] [Updated] (SPARK-49261) Correlation between lit and round during grouping

2024-08-27 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-49261: -- Target Version/s: (was: 3.5.0) > Correlation between lit and round during grouping > ---

[jira] [Updated] (SPARK-49261) Correlation between lit and round during grouping

2024-08-27 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-49261: -- Fix Version/s: (was: 3.5.0) > Correlation between lit and round during grouping >

[jira] [Created] (SPARK-49409) CONNECT_SESSION_PLAN_CACHE_SIZE is too small for certain programming patterns

2024-08-27 Thread Changgyoo Park (Jira)
Changgyoo Park created SPARK-49409: -- Summary: CONNECT_SESSION_PLAN_CACHE_SIZE is too small for certain programming patterns Key: SPARK-49409 URL: https://issues.apache.org/jira/browse/SPARK-49409 Pro

[jira] [Resolved] (SPARK-49404) Adjust `ERROR`-level log messages

2024-08-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-49404. --- Fix Version/s: kubernetes-operator-0.1.0 Resolution: Fixed Issue resolved by pull req

[jira] [Resolved] (SPARK-49350) FoldablePropagation rule and ConstantFolding rule leads to wrong aggregated result

2024-08-27 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins resolved SPARK-49350. --- Resolution: Duplicate > FoldablePropagation rule and ConstantFolding rule leads to wrong agg

[jira] [Commented] (SPARK-49350) FoldablePropagation rule and ConstantFolding rule leads to wrong aggregated result

2024-08-27 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17877052#comment-17877052 ] Bruce Robbins commented on SPARK-49350: --- [~Wayne Guo] Thanks for the update. Closi

[jira] [Updated] (SPARK-49357) [PYTHON] SparkConnectClient._truncate is not effective on deeply nested messages

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated SPARK-49357: --- Labels: pull-request-available (was: ) > [PYTHON] SparkConnectClient._truncate is not effec

[jira] [Updated] (SPARK-49408) Poor performance in ProjectingInternalRow

2024-08-27 Thread ASF GitHub Bot (Jira)
Reporter: Frank Wong >Priority: Major > Labels: pull-request-available > Attachments: 20240827-172739.html > > > In {*}ProjectingInternalRow{*}, the *colOrdinals* is passed as a {_}List{_}. > According to the Scala documentation, th

[jira] [Updated] (SPARK-49408) Poor performance in ProjectingInternalRow

2024-08-27 Thread Frank Wong (Jira)
Components: SQL >Affects Versions: 3.5.2 >Reporter: Frank Wong >Priority: Major > Attachments: 20240827-172739.html > > > In {*}ProjectingInternalRow{*}, the *colOrdinals* is passed as a {_}List{_}. > According to the Scala documentation

[jira] [Updated] (SPARK-49408) Low performance in ProjectingInternalRow

2024-08-27 Thread Frank Wong (Jira)
considerable amount of time was spent on {{{}List.apply{}}}. Changing this to  _{{IndexedSeq}}_ would improve the performance.   [^20240827-172739.html] [https://docs.scala-lang.org/overviews/collections-2.13/performance-characteristics.html] was: In {*}ProjectingInternalRow{*}, the *colOrdinals* is

[jira] [Updated] (SPARK-49408) Low performance in ProjectingInternalRow

2024-08-27 Thread Frank Wong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Wong updated SPARK-49408: --- Attachment: 20240827-172739.html > Low performance in ProjectingInternal

[jira] [Updated] (SPARK-49408) Low performance in ProjectingInternalRow

2024-08-27 Thread Frank Wong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Wong updated SPARK-49408: --- Description: In {*}ProjectingInternalRow{*}, the *colOrdinals* is passed as a {_}List{_}. According

[jira] [Created] (SPARK-49408) Low performance in ProjectingInternalRow

2024-08-27 Thread Frank Wong (Jira)
Frank Wong created SPARK-49408: -- Summary: Low performance in ProjectingInternalRow Key: SPARK-49408 URL: https://issues.apache.org/jira/browse/SPARK-49408 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-49407) Support Custom configuration of exposed types: ClusterIP, Nodeport, LoadBalancer, Ingress

2024-08-27 Thread melin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17876975#comment-17876975 ] melin commented on SPARK-49407: --- cc [~dongjoon]  > Support Custom configuration of expose

[jira] [Updated] (SPARK-49407) Support Custom configuration of exposed types: ClusterIP, Nodeport, LoadBalancer, Ingress

2024-08-27 Thread melin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] melin updated SPARK-49407: -- Description: The exposed rest service could be used to access the Spark’s Web UI,Similar to flink kubernetes.

[jira] [Updated] (SPARK-49407) Support Custom configuration of exposed types: ClusterIP, Nodeport, LoadBalancer, Ingress

2024-08-27 Thread melin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] melin updated SPARK-49407: -- Fix Version/s: 4.0.0 > Support Custom configuration of exposed types: ClusterIP, Nodeport, > LoadBalancer, In

[jira] [Assigned] (SPARK-49392) Catch errors when failing to write to external data source

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot reassigned SPARK-49392: -- Assignee: (was: Apache Spark) > Catch errors when failing to write to external da

  1   2   >