[jira] [Resolved] (SPARK-40590) Fix `ps.read_parquet` when pandas_metadata is True

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40590. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38055 [https://gi

[jira] [Assigned] (SPARK-40590) Fix `ps.read_parquet` when pandas_metadata is True

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40590: Assignee: Haejoon Lee > Fix `ps.read_parquet` when pandas_metadata is True >

[jira] [Assigned] (SPARK-40674) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40674: Assignee: Hyukjin Kwon > Use uniitest's asserts instead of built-in assert >

[jira] [Resolved] (SPARK-40674) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40674. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38120 [https://gi

[jira] [Resolved] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-40670. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38115 [https://gi

[jira] [Assigned] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim reassigned SPARK-40670: Assignee: Jungtaek Lim > NPE in applyInPandasWithState when the input schema has "non-nul

[jira] [Assigned] (SPARK-40672) Run Scala side tests in GitHub Actions

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40672: Assignee: Hyukjin Kwon > Run Scala side tests in GitHub Actions > ---

[jira] [Resolved] (SPARK-40672) Run Scala side tests in GitHub Actions

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40672. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38119 [https://gi

[jira] [Created] (SPARK-40675) Add missing spark configuration to documentation

2022-10-05 Thread Qian Sun (Jira)
Qian Sun created SPARK-40675: Summary: Add missing spark configuration to documentation Key: SPARK-40675 URL: https://issues.apache.org/jira/browse/SPARK-40675 Project: Spark Issue Type: Improvem

[jira] [Comment Edited] (SPARK-40651) Drop Hadoop2 binary distribtuion from release process

2022-10-05 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613306#comment-17613306 ] Yang Jie edited comment on SPARK-40651 at 10/6/22 5:43 AM: --- Th

[jira] [Comment Edited] (SPARK-40651) Drop Hadoop2 binary distribtuion from release process

2022-10-05 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613306#comment-17613306 ] Yang Jie edited comment on SPARK-40651 at 10/6/22 5:43 AM: --- Th

[jira] [Comment Edited] (SPARK-40651) Drop Hadoop2 binary distribtuion from release process

2022-10-05 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613306#comment-17613306 ] Yang Jie edited comment on SPARK-40651 at 10/6/22 5:43 AM: --- Th

[jira] [Commented] (SPARK-40651) Drop Hadoop2 binary distribtuion from release process

2022-10-05 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613306#comment-17613306 ] Yang Jie commented on SPARK-40651: -- Thank you for your answer. I will push my company t

[jira] [Commented] (SPARK-40584) Incorrect Count when reading CSV file

2022-10-05 Thread Ivan Sadikov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613304#comment-17613304 ] Ivan Sadikov commented on SPARK-40584: -- Disabling "multiLine" also fixes the issue.

[jira] [Updated] (SPARK-40660) Switch to XORShiftRandom to distribute elements

2022-10-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40660: Fix Version/s: 3.3.1 3.2.3 > Switch to XORShiftRandom to distribute elements >

[jira] [Commented] (SPARK-40663) Migrate execution errors onto error classes

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613301#comment-17613301 ] Apache Spark commented on SPARK-40663: -- User 'itholic' has created a pull request f

[jira] [Commented] (SPARK-40663) Migrate execution errors onto error classes

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613300#comment-17613300 ] Apache Spark commented on SPARK-40663: -- User 'itholic' has created a pull request f

[jira] [Resolved] (SPARK-40643) Implement `min_count` in `GroupBy.last`

2022-10-05 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-40643. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38087 [https://

[jira] [Assigned] (SPARK-40643) Implement `min_count` in `GroupBy.last`

2022-10-05 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng reassigned SPARK-40643: - Assignee: Ruifeng Zheng > Implement `min_count` in `GroupBy.last` > ---

[jira] [Assigned] (SPARK-40674) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40674: Assignee: Apache Spark > Use uniitest's asserts instead of built-in assert >

[jira] [Commented] (SPARK-40674) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613294#comment-17613294 ] Apache Spark commented on SPARK-40674: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-40674) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40674: Assignee: (was: Apache Spark) > Use uniitest's asserts instead of built-in assert > -

[jira] [Resolved] (SPARK-40644) Show the versions of dependencies in PySpark REPL

2022-10-05 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-40644. --- Resolution: Not A Problem > Show the versions of dependencies in PySpark REPL >

[jira] [Deleted] (SPARK-40673) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon deleted SPARK-40673: - > Use uniitest's asserts instead of built-in assert > -

[jira] [Created] (SPARK-40673) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-40673: Summary: Use uniitest's asserts instead of built-in assert Key: SPARK-40673 URL: https://issues.apache.org/jira/browse/SPARK-40673 Project: Spark Issue Type:

[jira] [Created] (SPARK-40674) Use uniitest's asserts instead of built-in assert

2022-10-05 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-40674: Summary: Use uniitest's asserts instead of built-in assert Key: SPARK-40674 URL: https://issues.apache.org/jira/browse/SPARK-40674 Project: Spark Issue Type:

[jira] [Commented] (SPARK-40672) Run Scala side tests in GitHub Actions

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613293#comment-17613293 ] Apache Spark commented on SPARK-40672: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-40672) Run Scala side tests in GitHub Actions

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40672: Assignee: (was: Apache Spark) > Run Scala side tests in GitHub Actions >

[jira] [Assigned] (SPARK-40672) Run Scala side tests in GitHub Actions

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40672: Assignee: Apache Spark > Run Scala side tests in GitHub Actions > ---

[jira] [Commented] (SPARK-40537) Re-enable mypi supoprt

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613291#comment-17613291 ] Apache Spark commented on SPARK-40537: -- User 'HyukjinKwon' has created a pull reque

[jira] [Created] (SPARK-40672) Run Scala side tests in GitHub Actions

2022-10-05 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-40672: Summary: Run Scala side tests in GitHub Actions Key: SPARK-40672 URL: https://issues.apache.org/jira/browse/SPARK-40672 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-40537) Re-enable mypi supoprt

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613292#comment-17613292 ] Apache Spark commented on SPARK-40537: -- User 'HyukjinKwon' has created a pull reque

[jira] [Resolved] (SPARK-40665) Avoid embedding Spark Connect in the Apache Spark binary release

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40665. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38109 [https://gi

[jira] [Assigned] (SPARK-40665) Avoid embedding Spark Connect in the Apache Spark binary release

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40665: Assignee: Hyukjin Kwon > Avoid embedding Spark Connect in the Apache Spark binary release

[jira] [Created] (SPARK-40671) Configurability on driver service labels

2022-10-05 Thread Shiqi Sun (Jira)
Shiqi Sun created SPARK-40671: - Summary: Configurability on driver service labels Key: SPARK-40671 URL: https://issues.apache.org/jira/browse/SPARK-40671 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-40616) Loss of precision using SparkSQL shell on high-precision DECIMAL types

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40616. -- Resolution: Invalid > Loss of precision using SparkSQL shell on high-precision DECIMAL types >

[jira] [Resolved] (SPARK-40614) Job aborted due to stage failure: Task 165 in stage 292.0 failed 4 times, most recent failure: Lost task 165.3 in stage 292.0 (TID 122333) (x.x.x.x executor 0): java.la

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40614. -- Resolution: Invalid > Job aborted due to stage failure: Task 165 in stage 292.0 failed 4 times

[jira] [Resolved] (SPARK-40629) FLOAT/DOUBLE division by 0 gives Infinity/-Infinity/NaN in DataFrame but NULL in SparkSQL

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40629. -- Resolution: Invalid > FLOAT/DOUBLE division by 0 gives Infinity/-Infinity/NaN in DataFrame but

[jira] [Resolved] (SPARK-40630) Both SparkSQL and DataFrame insert invalid DATE/TIMESTAMP as NULL

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40630. -- Resolution: Invalid > Both SparkSQL and DataFrame insert invalid DATE/TIMESTAMP as NULL >

[jira] [Resolved] (SPARK-40624) A DECIMAL value with division by 0 errors in DataFrame but evaluates to NULL in SparkSQL

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40624. -- Resolution: Invalid > A DECIMAL value with division by 0 errors in DataFrame but evaluates to

[jira] [Resolved] (SPARK-40638) RpcOutboxMessage: Ask terminated before connecting successfully

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40638. -- Resolution: Invalid > RpcOutboxMessage: Ask terminated before connecting successfully > --

[jira] [Commented] (SPARK-40638) RpcOutboxMessage: Ask terminated before connecting successfully

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613282#comment-17613282 ] Hyukjin Kwon commented on SPARK-40638: -- [~test2022123] it sounds more like a questi

[jira] [Commented] (SPARK-40663) Migrate execution errors onto error classes

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613281#comment-17613281 ] Apache Spark commented on SPARK-40663: -- User 'itholic' has created a pull request f

[jira] [Updated] (SPARK-40668) "Cannot use an UnspecifiedFrame" error for User Defined Aggregation Function over Window

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-40668: - Component/s: SQL (was: Spark Core) > "Cannot use an UnspecifiedFrame" error

[jira] [Commented] (SPARK-40663) Migrate execution errors onto error classes

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613280#comment-17613280 ] Apache Spark commented on SPARK-40663: -- User 'itholic' has created a pull request f

[jira] [Commented] (SPARK-40663) Migrate execution errors onto error classes

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613275#comment-17613275 ] Apache Spark commented on SPARK-40663: -- User 'itholic' has created a pull request f

[jira] [Assigned] (SPARK-40669) Parameterize InMemoryColumnarBenchmark

2022-10-05 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-40669: - Assignee: Dongjoon Hyun > Parameterize InMemoryColumnarBenchmark >

[jira] [Resolved] (SPARK-40669) Parameterize InMemoryColumnarBenchmark

2022-10-05 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-40669. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38114 [https://

[jira] [Resolved] (SPARK-40537) Re-enable mypi supoprt

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40537. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38037 [https://gi

[jira] [Assigned] (SPARK-40537) Re-enable mypi supoprt

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40537: Assignee: Martin Grund > Re-enable mypi supoprt > -- > >

[jira] [Commented] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613261#comment-17613261 ] Apache Spark commented on SPARK-40670: -- User 'HeartSaVioR' has created a pull reque

[jira] [Assigned] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40670: Assignee: (was: Apache Spark) > NPE in applyInPandasWithState when the input schema h

[jira] [Assigned] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40670: Assignee: Apache Spark > NPE in applyInPandasWithState when the input schema has "non-nul

[jira] [Commented] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613260#comment-17613260 ] Apache Spark commented on SPARK-40670: -- User 'HeartSaVioR' has created a pull reque

[jira] [Created] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Jungtaek Lim (Jira)
Jungtaek Lim created SPARK-40670: Summary: NPE in applyInPandasWithState when the input schema has "non-nullable" column(s) Key: SPARK-40670 URL: https://issues.apache.org/jira/browse/SPARK-40670 Proj

[jira] [Commented] (SPARK-40670) NPE in applyInPandasWithState when the input schema has "non-nullable" column(s)

2022-10-05 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613257#comment-17613257 ] Jungtaek Lim commented on SPARK-40670: -- Will submit a PR soon. > NPE in applyInPan

[jira] [Commented] (SPARK-40669) Parameterize InMemoryColumnarBenchmark

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613256#comment-17613256 ] Apache Spark commented on SPARK-40669: -- User 'dongjoon-hyun' has created a pull req

[jira] [Assigned] (SPARK-40669) Parameterize InMemoryColumnarBenchmark

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40669: Assignee: (was: Apache Spark) > Parameterize InMemoryColumnarBenchmark >

[jira] [Assigned] (SPARK-40669) Parameterize InMemoryColumnarBenchmark

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40669: Assignee: Apache Spark > Parameterize InMemoryColumnarBenchmark > ---

[jira] [Commented] (SPARK-40669) Parameterize InMemoryColumnarBenchmark

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613255#comment-17613255 ] Apache Spark commented on SPARK-40669: -- User 'dongjoon-hyun' has created a pull req

[jira] [Created] (SPARK-40669) Parameterize InMemoryColumnarBenchmark

2022-10-05 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-40669: - Summary: Parameterize InMemoryColumnarBenchmark Key: SPARK-40669 URL: https://issues.apache.org/jira/browse/SPARK-40669 Project: Spark Issue Type: Test

[jira] [Resolved] (SPARK-40651) Drop Hadoop2 binary distribtuion from release process

2022-10-05 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-40651. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38099 [https://

[jira] [Assigned] (SPARK-40651) Drop Hadoop2 binary distribtuion from release process

2022-10-05 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-40651: - Assignee: Dongjoon Hyun > Drop Hadoop2 binary distribtuion from release process > -

[jira] [Resolved] (SPARK-40607) Remove redundant string interpolator operations

2022-10-05 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-40607. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38043 [https://gi

[jira] [Updated] (SPARK-40607) Remove redundant string interpolator operations

2022-10-05 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen updated SPARK-40607: - Priority: Trivial (was: Minor) > Remove redundant string interpolator operations >

[jira] [Assigned] (SPARK-40607) Remove redundant string interpolator operations

2022-10-05 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-40607: Assignee: Yang Jie > Remove redundant string interpolator operations > --

[jira] [Updated] (SPARK-40667) Refactor File Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaonan Yang updated SPARK-40667: - Description: Currently for each file data source, all options are placed sparsely in the option

[jira] [Updated] (SPARK-40667) Refactor File Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaonan Yang updated SPARK-40667: - Summary: Refactor File Data Source Options (was: Refactor Data Source Options) > Refactor File

[jira] [Assigned] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40667: Assignee: Apache Spark > Refactor Data Source Options > > >

[jira] [Assigned] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40667: Assignee: (was: Apache Spark) > Refactor Data Source Options > --

[jira] [Commented] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613250#comment-17613250 ] Apache Spark commented on SPARK-40667: -- User 'xiaonanyang-db' has created a pull re

[jira] [Created] (SPARK-40668) "Cannot use an UnspecifiedFrame" error for User Defined Aggregation Function over Window

2022-10-05 Thread Harold Hotelling (Jira)
Harold Hotelling created SPARK-40668: Summary: "Cannot use an UnspecifiedFrame" error for User Defined Aggregation Function over Window Key: SPARK-40668 URL: https://issues.apache.org/jira/browse/SPARK-40668

[jira] [Updated] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaonan Yang updated SPARK-40667: - Description: Currently for each data source, all options are placed sparsely in the options cla

[jira] [Updated] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaonan Yang updated SPARK-40667: - Description: Currently for each data source, all options are placed sparsely in the options cla

[jira] [Updated] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaonan Yang updated SPARK-40667: - Description: Currently for each data source, all options are placed sparsely in the options cla

[jira] [Commented] (SPARK-40659) Schema evolution for protobuf (and Avro too?)

2022-10-05 Thread Raghu Angadi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613099#comment-17613099 ] Raghu Angadi commented on SPARK-40659: -- {quote}Regarding application restart, why s

[jira] [Updated] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaonan Yang updated SPARK-40667: - Description: Refactor data source options like `CSVOptions`, `JsonOptions` for better code maint

[jira] [Updated] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaonan Yang updated SPARK-40667: - Description: Refactor data source options like `CSVOptions`, `JsonOptions`. (was: Refactor data

[jira] [Created] (SPARK-40667) Refactor Data Source Options

2022-10-05 Thread Xiaonan Yang (Jira)
Xiaonan Yang created SPARK-40667: Summary: Refactor Data Source Options Key: SPARK-40667 URL: https://issues.apache.org/jira/browse/SPARK-40667 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-40585) Support double-quoted identifiers

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613086#comment-17613086 ] Apache Spark commented on SPARK-40585: -- User 'gengliangwang' has created a pull req

[jira] [Commented] (SPARK-40659) Schema evolution for protobuf (and Avro too?)

2022-10-05 Thread Mohan Parthasarathy (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613073#comment-17613073 ] Mohan Parthasarathy commented on SPARK-40659: - Okay, I see what you are sayi

[jira] [Commented] (SPARK-40651) Drop Hadoop2 binary distribtuion from release process

2022-10-05 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613064#comment-17613064 ] Dongjoon Hyun commented on SPARK-40651: --- Thank you for asking. We are still in the

[jira] [Commented] (SPARK-40660) Switch to XORShiftRandom to distribute elements

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613007#comment-17613007 ] Apache Spark commented on SPARK-40660: -- User 'wangyum' has created a pull request f

[jira] [Commented] (SPARK-40660) Switch to XORShiftRandom to distribute elements

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613004#comment-17613004 ] Apache Spark commented on SPARK-40660: -- User 'wangyum' has created a pull request f

[jira] [Created] (SPARK-40666) Upgrade FasterXML jackson-databind to 2.14

2022-10-05 Thread Jira
Bjørn Jørgensen created SPARK-40666: --- Summary: Upgrade FasterXML jackson-databind to 2.14 Key: SPARK-40666 URL: https://issues.apache.org/jira/browse/SPARK-40666 Project: Spark Issue Type:

[jira] [Commented] (SPARK-40664) Union in query can remove cache from the plan

2022-10-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17612970#comment-17612970 ] Yuming Wang commented on SPARK-40664: - This is a know issue, please see comment: ht

[jira] [Commented] (SPARK-40665) Avoid embedding Spark Connect in the Apache Spark binary release

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17612936#comment-17612936 ] Apache Spark commented on SPARK-40665: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-40665) Avoid embedding Spark Connect in the Apache Spark binary release

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40665: Assignee: (was: Apache Spark) > Avoid embedding Spark Connect in the Apache Spark bin

[jira] [Assigned] (SPARK-40665) Avoid embedding Spark Connect in the Apache Spark binary release

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40665: Assignee: Apache Spark > Avoid embedding Spark Connect in the Apache Spark binary release

[jira] [Resolved] (SPARK-40663) Migrate execution errors onto error classes

2022-10-05 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-40663. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38104 [https://github.com

[jira] [Assigned] (SPARK-40663) Migrate execution errors onto error classes

2022-10-05 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-40663: Assignee: Haejoon Lee > Migrate execution errors onto error classes > ---

[jira] [Created] (SPARK-40665) Avoid embedding Spark Connect in the Apache Spark binary release

2022-10-05 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-40665: Summary: Avoid embedding Spark Connect in the Apache Spark binary release Key: SPARK-40665 URL: https://issues.apache.org/jira/browse/SPARK-40665 Project: Spark

[jira] [Resolved] (SPARK-40635) Scala 2.12 + Hadoop 2 + JDK 8 Daily Test failed

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40635. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38079 [https://gi

[jira] [Assigned] (SPARK-40635) Scala 2.12 + Hadoop 2 + JDK 8 Daily Test failed

2022-10-05 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40635: Assignee: Yang Jie > Scala 2.12 + Hadoop 2 + JDK 8 Daily Test failed > -

[jira] [Updated] (SPARK-40664) Union in query can remove cache from the plan

2022-10-05 Thread Tanel Kiis (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tanel Kiis updated SPARK-40664: --- Description: Failing unitest: {code} test("SPARK-40664: Cache with join, union and renames") {

[jira] [Updated] (SPARK-40664) Union in query can remove cache from the plan

2022-10-05 Thread Tanel Kiis (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tanel Kiis updated SPARK-40664: --- Description: Failing unitest: {code} test("SPARK-40664: Cache with join, union and renames") {

[jira] [Commented] (SPARK-40664) Union in query can remove cache from the plan

2022-10-05 Thread Tanel Kiis (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17612881#comment-17612881 ] Tanel Kiis commented on SPARK-40664: I do not think that https://github.com/apache/s

[jira] [Created] (SPARK-40664) Union in query can remove cache from the plan

2022-10-05 Thread Tanel Kiis (Jira)
Tanel Kiis created SPARK-40664: -- Summary: Union in query can remove cache from the plan Key: SPARK-40664 URL: https://issues.apache.org/jira/browse/SPARK-40664 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-40540) Migrate compilation errors onto error classes

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17612879#comment-17612879 ] Apache Spark commented on SPARK-40540: -- User 'itholic' has created a pull request f

[jira] [Commented] (SPARK-40540) Migrate compilation errors onto error classes

2022-10-05 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17612878#comment-17612878 ] Apache Spark commented on SPARK-40540: -- User 'itholic' has created a pull request f

  1   2   >