[jira] [Created] (SPARK-40247) Fix BitSet equality check

2022-08-28 Thread Peter Toth (Jira)
Peter Toth created SPARK-40247: -- Summary: Fix BitSet equality check Key: SPARK-40247 URL: https://issues.apache.org/jira/browse/SPARK-40247 Project: Spark Issue Type: Bug Components: S

[jira] [Created] (SPARK-40248) Use larger number of bits to build bloom filter

2022-08-28 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40248: --- Summary: Use larger number of bits to build bloom filter Key: SPARK-40248 URL: https://issues.apache.org/jira/browse/SPARK-40248 Project: Spark Issue Type: Im

[jira] [Updated] (SPARK-40248) Use larger number of bits to build bloom filter

2022-08-28 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40248: Component/s: SQL (was: Optimizer) > Use larger number of bits to build bloom

[jira] [Created] (SPARK-40249) Some cache miss cases in MLLib

2022-08-28 Thread Mingchao Wu (Jira)
Mingchao Wu created SPARK-40249: --- Summary: Some cache miss cases in MLLib Key: SPARK-40249 URL: https://issues.apache.org/jira/browse/SPARK-40249 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-40247) Fix BitSet equality check

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17589595#comment-17589595 ] Apache Spark commented on SPARK-40247: -- User 'peter-toth' has created a pull reques

[jira] [Assigned] (SPARK-40247) Fix BitSet equality check

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40247: Assignee: Apache Spark > Fix BitSet equality check > - > >

[jira] [Assigned] (SPARK-40247) Fix BitSet equality check

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40247: Assignee: (was: Apache Spark) > Fix BitSet equality check > -

[jira] [Assigned] (SPARK-40248) Use larger number of bits to build bloom filter

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40248: Assignee: Apache Spark > Use larger number of bits to build bloom filter > -

[jira] [Commented] (SPARK-40248) Use larger number of bits to build bloom filter

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17590411#comment-17590411 ] Apache Spark commented on SPARK-40248: -- User 'wangyum' has created a pull request f

[jira] [Assigned] (SPARK-40248) Use larger number of bits to build bloom filter

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40248: Assignee: (was: Apache Spark) > Use larger number of bits to build bloom filter > --

[jira] [Commented] (SPARK-40248) Use larger number of bits to build bloom filter

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17590435#comment-17590435 ] Apache Spark commented on SPARK-40248: -- User 'wangyum' has created a pull request f

[jira] [Commented] (SPARK-39184) ArrayIndexOutOfBoundsException for some date/time sequences in some time-zones

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17596153#comment-17596153 ] Apache Spark commented on SPARK-39184: -- User 'bersprockets' has created a pull requ

[jira] [Assigned] (SPARK-40039) Introducing a streaming checkpoint file manager based on Hadoop's Abortable interface

2022-08-28 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40039: Assignee: Attila Zsolt Piros > Introducing a streaming checkpoint file manager based on H

[jira] [Resolved] (SPARK-40039) Introducing a streaming checkpoint file manager based on Hadoop's Abortable interface

2022-08-28 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40039. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 37687 [https://gi

[jira] [Updated] (SPARK-39184) ArrayIndexOutOfBoundsException for some date/time sequences in some time-zones

2022-08-28 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-39184: - Fix Version/s: 3.1.4 > ArrayIndexOutOfBoundsException for some date/time sequences in some time-

[jira] [Updated] (SPARK-40212) SparkSQL castPartValue does not properly handle byte & short

2022-08-28 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-40212: - Fix Version/s: 3.3.1 3.2.3 > SparkSQL castPartValue does not properly handle

[jira] [Assigned] (SPARK-40212) SparkSQL castPartValue does not properly handle byte & short

2022-08-28 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40212: Assignee: Brennan Stein > SparkSQL castPartValue does not properly handle byte & short >

[jira] [Resolved] (SPARK-40212) SparkSQL castPartValue does not properly handle byte & short

2022-08-28 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40212. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 37659 [https://gi

[jira] [Created] (SPARK-40250) Further check the availability of RemoteBlockPushResolver API after call close

2022-08-28 Thread Yang Jie (Jira)
Yang Jie created SPARK-40250: Summary: Further check the availability of RemoteBlockPushResolver API after call close Key: SPARK-40250 URL: https://issues.apache.org/jira/browse/SPARK-40250 Project: Spark

[jira] [Created] (SPARK-40251) Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2

2022-08-28 Thread BingKun Pan (Jira)
BingKun Pan created SPARK-40251: --- Summary: Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 Key: SPARK-40251 URL: https://issues.apache.org/jira/browse/SPARK-40251 Project: Spark Issue Type: Impr

[jira] [Assigned] (SPARK-40251) Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40251: Assignee: Apache Spark > Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 > ---

[jira] [Commented] (SPARK-40251) Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17596947#comment-17596947 ] Apache Spark commented on SPARK-40251: -- User 'panbingkun' has created a pull reques

[jira] [Assigned] (SPARK-40251) Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40251: Assignee: (was: Apache Spark) > Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 >

[jira] [Commented] (SPARK-40251) Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17596946#comment-17596946 ] Apache Spark commented on SPARK-40251: -- User 'panbingkun' has created a pull reques

[jira] [Created] (SPARK-40252) Replcace `Stream.collect(Collectors.joining(delimiter))` to `StringJoiner` Api

2022-08-28 Thread Yang Jie (Jira)
Yang Jie created SPARK-40252: Summary: Replcace `Stream.collect(Collectors.joining(delimiter))` to `StringJoiner` Api Key: SPARK-40252 URL: https://issues.apache.org/jira/browse/SPARK-40252 Project: Spark

[jira] [Updated] (SPARK-40252) Replcace `Stream.collect(Collectors.joining(delimiter))` with `StringJoiner` Api

2022-08-28 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-40252: - Summary: Replcace `Stream.collect(Collectors.joining(delimiter))` with `StringJoiner` Api (was: Replcac

[jira] [Updated] (SPARK-40252) Replace `Stream.collect(Collectors.joining(delimiter))` with `StringJoiner` Api

2022-08-28 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-40252: - Summary: Replace `Stream.collect(Collectors.joining(delimiter))` with `StringJoiner` Api (was: Replcace

[jira] [Assigned] (SPARK-40252) Replace `Stream.collect(Collectors.joining(delimiter))` with `StringJoiner` Api

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40252: Assignee: Apache Spark > Replace `Stream.collect(Collectors.joining(delimiter))` with `St

[jira] [Assigned] (SPARK-40252) Replace `Stream.collect(Collectors.joining(delimiter))` with `StringJoiner` Api

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40252: Assignee: (was: Apache Spark) > Replace `Stream.collect(Collectors.joining(delimiter)

[jira] [Commented] (SPARK-40252) Replace `Stream.collect(Collectors.joining(delimiter))` with `StringJoiner` Api

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17596984#comment-17596984 ] Apache Spark commented on SPARK-40252: -- User 'LuciferYang' has created a pull reque

[jira] [Commented] (SPARK-40012) Make pyspark.sql.dataframe examples self-contained

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17596992#comment-17596992 ] Apache Spark commented on SPARK-40012: -- User 'HyukjinKwon' has created a pull reque

[jira] [Commented] (SPARK-40012) Make pyspark.sql.dataframe examples self-contained

2022-08-28 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17596993#comment-17596993 ] Apache Spark commented on SPARK-40012: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-39607) DataSourceV2: Distribution and ordering support V2 function in writing

2022-08-28 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-39607: --- Assignee: Cheng Pan > DataSourceV2: Distribution and ordering support V2 function in writin

[jira] [Resolved] (SPARK-39607) DataSourceV2: Distribution and ordering support V2 function in writing

2022-08-28 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-39607. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36995 [https://gith

[jira] [Created] (SPARK-40253) Data read exception in orc format

2022-08-28 Thread yihangqiao (Jira)
yihangqiao created SPARK-40253: -- Summary: Data read exception in orc format Key: SPARK-40253 URL: https://issues.apache.org/jira/browse/SPARK-40253 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-40253) Data read exception in orc format

2022-08-28 Thread yihangqiao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yihangqiao updated SPARK-40253: --- Issue Type: Bug (was: Improvement) > Data read exception in orc format > -

[jira] [Updated] (SPARK-40253) Data read exception in orc format

2022-08-28 Thread yihangqiao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yihangqiao updated SPARK-40253: --- Description: {code:java} //代码占位符 {code} When running batches using spark-sql and using the create ta

[jira] [Updated] (SPARK-40253) Data read exception in orc format

2022-08-28 Thread yihangqiao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yihangqiao updated SPARK-40253: --- Description: Caused by: java.io.EOFException: Read past end of RLE integer from compressed stream S

[jira] [Commented] (SPARK-40253) Data read exception in orc format

2022-08-28 Thread yihangqiao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17597002#comment-17597002 ] yihangqiao commented on SPARK-40253: does not fundamentally solve the problem > Da