[jira] [Comment Edited] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599263#comment-17599263 ] Yang Jie edited comment on SPARK-40303 at 9/2/22 5:48 AM: -- If r

[jira] [Comment Edited] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599262#comment-17599262 ] Yang Jie edited comment on SPARK-40303 at 9/2/22 5:15 AM: -- {cod

[jira] [Commented] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599263#comment-17599263 ] Yang Jie commented on SPARK-40303: -- If you run with Java 17, the performance gap will b

[jira] [Commented] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599262#comment-17599262 ] Yang Jie commented on SPARK-40303: -- {code:java}  64265 21820       3       org.apache.

[jira] [Comment Edited] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599254#comment-17599254 ] Yang Jie edited comment on SPARK-40303 at 9/2/22 4:59 AM: -- Run

[jira] [Comment Edited] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599254#comment-17599254 ] Yang Jie edited comment on SPARK-40303 at 9/2/22 4:58 AM: -- Run

[jira] [Comment Edited] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599254#comment-17599254 ] Yang Jie edited comment on SPARK-40303 at 9/2/22 4:56 AM: -- Run

[jira] [Commented] (SPARK-39284) Implement Groupby.mad

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599258#comment-17599258 ] Apache Spark commented on SPARK-39284: -- User 'zhengruifeng' has created a pull requ

[jira] [Comment Edited] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599254#comment-17599254 ] Yang Jie edited comment on SPARK-40303 at 9/2/22 4:49 AM: -- Run

[jira] [Commented] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599254#comment-17599254 ] Yang Jie commented on SPARK-40303: -- Run use Java 8 with `-XX:+PrintCompilation`, I foun

[jira] [Commented] (SPARK-40288) After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression.

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599251#comment-17599251 ] Apache Spark commented on SPARK-40288: -- User 'hgs19921112' has created a pull reque

[jira] [Commented] (SPARK-40288) After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression.

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599249#comment-17599249 ] Apache Spark commented on SPARK-40288: -- User 'hgs19921112' has created a pull reque

[jira] [Commented] (SPARK-40288) After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression.

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599243#comment-17599243 ] Apache Spark commented on SPARK-40288: -- User 'hgs19921112' has created a pull reque

[jira] [Commented] (SPARK-40288) After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression.

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599242#comment-17599242 ] Apache Spark commented on SPARK-40288: -- User 'hgs19921112' has created a pull reque

[jira] [Assigned] (SPARK-40288) After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression.

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40288: Assignee: (was: Apache Spark) > After `RemoveRedundantAggregates`, `PullOutGroupingEx

[jira] [Assigned] (SPARK-40288) After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression.

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40288: Assignee: Apache Spark > After `RemoveRedundantAggregates`, `PullOutGroupingExpressions`

[jira] [Updated] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-01 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-40309: Labels: release-notes (was: ) > Introduce sql_conf context manager for pyspark.sql >

[jira] [Updated] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Santosh Pingale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Santosh Pingale updated SPARK-40311: Description: Add a scala, pyspark, R dataframe API that can rename multiple columns in a

[jira] [Commented] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599194#comment-17599194 ] Apache Spark commented on SPARK-40311: -- User 'santosh-d3vpl3x' has created a pull r

[jira] [Assigned] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40311: Assignee: (was: Apache Spark) > Introduce withColumnsRenamed > --

[jira] [Updated] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Santosh Pingale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Santosh Pingale updated SPARK-40311: Description: Add a scala, pyspark, R dataframe API that can rename multiple columns in a

[jira] [Commented] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599191#comment-17599191 ] Apache Spark commented on SPARK-40311: -- User 'santosh-d3vpl3x' has created a pull r

[jira] [Assigned] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40311: Assignee: Apache Spark > Introduce withColumnsRenamed > > >

[jira] [Updated] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Santosh Pingale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Santosh Pingale updated SPARK-40311: Description: Add a scala, pyspark, R dataframe API that can rename multiple columns in a

[jira] [Created] (SPARK-40311) Introduce withColumnsRenamed

2022-09-01 Thread Santosh Pingale (Jira)
Santosh Pingale created SPARK-40311: --- Summary: Introduce withColumnsRenamed Key: SPARK-40311 URL: https://issues.apache.org/jira/browse/SPARK-40311 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-40310) try_sum() should throw the exceptions from its child

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40310: Assignee: Apache Spark (was: Gengliang Wang) > try_sum() should throw the exceptions fro

[jira] [Assigned] (SPARK-40310) try_sum() should throw the exceptions from its child

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40310: Assignee: Gengliang Wang (was: Apache Spark) > try_sum() should throw the exceptions fro

[jira] [Commented] (SPARK-40310) try_sum() should throw the exceptions from its child

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599185#comment-17599185 ] Apache Spark commented on SPARK-40310: -- User 'gengliangwang' has created a pull req

[jira] [Updated] (SPARK-40310) try_sum() should throw the exceptions from its child

2022-09-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-40310: --- Summary: try_sum() should throw the exceptions from its child (was: try_sum() should throw

[jira] [Created] (SPARK-40310) try_sum() should throw exceptions from its child

2022-09-01 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-40310: -- Summary: try_sum() should throw exceptions from its child Key: SPARK-40310 URL: https://issues.apache.org/jira/browse/SPARK-40310 Project: Spark Issue Ty

[jira] [Updated] (SPARK-35161) Error-handling SQL functions

2022-09-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-35161: --- Epic Link: SPARK-35030 (was: SPARK-38783) > Error-handling SQL functions >

[jira] [Assigned] (SPARK-40308) str_to_map should accept non-foldable delimiter arguments

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40308: Assignee: Apache Spark > str_to_map should accept non-foldable delimiter arguments >

[jira] [Assigned] (SPARK-40308) str_to_map should accept non-foldable delimiter arguments

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40308: Assignee: (was: Apache Spark) > str_to_map should accept non-foldable delimiter argum

[jira] [Commented] (SPARK-40308) str_to_map should accept non-foldable delimiter arguments

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599174#comment-17599174 ] Apache Spark commented on SPARK-40308: -- User 'bersprockets' has created a pull requ

[jira] [Updated] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40309: - Description: [https://github.com/apache/spark/blob/master/python/pyspark/pandas/utils.py#L490]

[jira] [Created] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40309: Summary: Introduce sql_conf context manager for pyspark.sql Key: SPARK-40309 URL: https://issues.apache.org/jira/browse/SPARK-40309 Project: Spark Issue Type

[jira] [Updated] (SPARK-40308) str_to_map should accept non-foldable delimiter arguments

2022-09-01 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-40308: -- Description: Currently, str_to_map requires the delimiter arguments to be foldable expression

[jira] [Updated] (SPARK-40308) str_to_map should accept non-foldable delimiter arguments

2022-09-01 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-40308: -- Summary: str_to_map should accept non-foldable delimiter arguments (was: str_to_map should ac

[jira] [Created] (SPARK-40308) str_to_map should accept non-foldable delimiter parameters

2022-09-01 Thread Bruce Robbins (Jira)
Bruce Robbins created SPARK-40308: - Summary: str_to_map should accept non-foldable delimiter parameters Key: SPARK-40308 URL: https://issues.apache.org/jira/browse/SPARK-40308 Project: Spark

[jira] [Commented] (SPARK-33605) Add GCS FS/connector config (dependencies?) akin to S3

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599162#comment-17599162 ] Dongjoon Hyun commented on SPARK-33605: --- {{My bad. It was Java 8.}} {{- https://

[jira] [Assigned] (SPARK-33605) Add GCS FS/connector config (dependencies?) akin to S3

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33605: Assignee: Apache Spark > Add GCS FS/connector config (dependencies?) akin to S3 > ---

[jira] [Assigned] (SPARK-33605) Add GCS FS/connector config (dependencies?) akin to S3

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33605: Assignee: (was: Apache Spark) > Add GCS FS/connector config (dependencies?) akin to S

[jira] [Reopened] (SPARK-33605) Add GCS FS/connector config (dependencies?) akin to S3

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reopened SPARK-33605: --- > Add GCS FS/connector config (dependencies?) akin to S3 > -

[jira] [Resolved] (SPARK-33605) Add GCS FS/connector config (dependencies?) akin to S3

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-33605. --- Resolution: Won't Do I made a PR and had a discussion, but this issue is closed based on the

[jira] [Comment Edited] (SPARK-33605) Add GCS FS/connector config (dependencies?) akin to S3

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599160#comment-17599160 ] Dongjoon Hyun edited comment on SPARK-33605 at 9/1/22 9:33 PM: ---

[jira] [Created] (SPARK-40307) Optimize (De)Serialization of Python UDF

2022-09-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40307: Summary: Optimize (De)Serialization of Python UDF Key: SPARK-40307 URL: https://issues.apache.org/jira/browse/SPARK-40307 Project: Spark Issue Type: Umbrella

[jira] [Resolved] (SPARK-38310) Support job queue in YuniKorn feature step

2022-09-01 Thread Weiwei Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiwei Yang resolved SPARK-38310. - Resolution: Won't Do YuniKorn will follow the standard Spark API for the integration, there is n

[jira] [Resolved] (SPARK-37809) Add yunikorn feature step

2022-09-01 Thread Weiwei Yang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiwei Yang resolved SPARK-37809. - Resolution: Won't Do Based on the feedback from the community, there is no need to add the extra

[jira] [Assigned] (SPARK-39996) Upgrade postgresql to 42.5.0

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39996: Assignee: (was: Apache Spark) > Upgrade postgresql to 42.5.0 > --

[jira] [Assigned] (SPARK-39996) Upgrade postgresql to 42.5.0

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39996: Assignee: Apache Spark > Upgrade postgresql to 42.5.0 > > >

[jira] [Commented] (SPARK-39996) Upgrade postgresql to 42.5.0

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599103#comment-17599103 ] Apache Spark commented on SPARK-39996: -- User 'bjornjorgensen' has created a pull re

[jira] [Updated] (SPARK-39996) Upgrade postgresql to 42.5.0

2022-09-01 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-39996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-39996: Summary: Upgrade postgresql to 42.5.0 (was: Upgrade postgresql to 42.4.1) > Upgrade postg

[jira] [Updated] (SPARK-40302) Add YuniKornSuite

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-40302: -- Parent: SPARK-36057 Issue Type: Sub-task (was: Test) > Add YuniKornSuite > --

[jira] [Resolved] (SPARK-40304) Add decomTestTag to K8s Integration Test

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-40304. --- Fix Version/s: 3.4.0 3.3.1 Assignee: Dongjoon Hyun Resolut

[jira] [Resolved] (SPARK-40302) Add YuniKornSuite

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-40302. --- Fix Version/s: 3.3.1 3.4.0 Resolution: Fixed Issue resolved by pul

[jira] [Assigned] (SPARK-40302) Add YuniKornSuite

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-40302: - Assignee: Dongjoon Hyun > Add YuniKornSuite > - > > Key

[jira] [Commented] (SPARK-38404) Spark does not find CTE inside nested CTE

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599027#comment-17599027 ] Apache Spark commented on SPARK-38404: -- User 'peter-toth' has created a pull reques

[jira] [Commented] (SPARK-38404) Spark does not find CTE inside nested CTE

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599028#comment-17599028 ] Apache Spark commented on SPARK-38404: -- User 'peter-toth' has created a pull reques

[jira] [Commented] (SPARK-40306) Support more than Integer.MAX_VALUE of the same join key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599021#comment-17599021 ] Apache Spark commented on SPARK-40306: -- User 'wankunde' has created a pull request

[jira] [Assigned] (SPARK-40306) Support more than Integer.MAX_VALUE of the same join key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40306: Assignee: (was: Apache Spark) > Support more than Integer.MAX_VALUE of the same join

[jira] [Commented] (SPARK-40306) Support more than Integer.MAX_VALUE of the same join key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599023#comment-17599023 ] Apache Spark commented on SPARK-40306: -- User 'wankunde' has created a pull request

[jira] [Assigned] (SPARK-40306) Support more than Integer.MAX_VALUE of the same join key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40306: Assignee: Apache Spark > Support more than Integer.MAX_VALUE of the same join key > -

[jira] [Updated] (SPARK-40306) Support more than Integer.MAX_VALUE of the same join key

2022-09-01 Thread Wan Kun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wan Kun updated SPARK-40306: Description: For SMJ, the number of the same join key records of the right table is greater than Integer.

[jira] [Updated] (SPARK-40306) Support more than Integer.MAX_VALUE of the same join key

2022-09-01 Thread Wan Kun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wan Kun updated SPARK-40306: Attachment: image-2022-09-01-23-02-15-955.png > Support more than Integer.MAX_VALUE of the same join key >

[jira] [Created] (SPARK-40306) Support more than Integer.MAX_VALUE of the same join key

2022-09-01 Thread Wan Kun (Jira)
Wan Kun created SPARK-40306: --- Summary: Support more than Integer.MAX_VALUE of the same join key Key: SPARK-40306 URL: https://issues.apache.org/jira/browse/SPARK-40306 Project: Spark Issue Type: Bu

[jira] [Comment Edited] (SPARK-36862) ERROR CodeGenerator: failed to compile: org.codehaus.commons.compiler.CompileException: File 'generated.java'

2022-09-01 Thread Lukas Waldmann (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599013#comment-17599013 ] Lukas Waldmann edited comment on SPARK-36862 at 9/1/22 2:55 PM: --

[jira] [Commented] (SPARK-36862) ERROR CodeGenerator: failed to compile: org.codehaus.commons.compiler.CompileException: File 'generated.java'

2022-09-01 Thread Lukas Waldmann (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17599013#comment-17599013 ] Lukas Waldmann commented on SPARK-36862: I manage reproduce the issue in my envi

[jira] [Assigned] (SPARK-40279) Document spark.yarn.report.interval

2022-09-01 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-40279: Assignee: Luca Canali > Document spark.yarn.report.interval > ---

[jira] [Resolved] (SPARK-40279) Document spark.yarn.report.interval

2022-09-01 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-40279. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 37731 [https://gi

[jira] [Assigned] (SPARK-40149) Star expansion after outer join asymmetrically includes joining key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40149: Assignee: (was: Apache Spark) > Star expansion after outer join asymmetrically includ

[jira] [Commented] (SPARK-40149) Star expansion after outer join asymmetrically includes joining key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598939#comment-17598939 ] Apache Spark commented on SPARK-40149: -- User 'cloud-fan' has created a pull request

[jira] [Commented] (SPARK-40149) Star expansion after outer join asymmetrically includes joining key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598938#comment-17598938 ] Apache Spark commented on SPARK-40149: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-40149) Star expansion after outer join asymmetrically includes joining key

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40149: Assignee: Apache Spark > Star expansion after outer join asymmetrically includes joining

[jira] [Commented] (SPARK-40305) Implement Groupby.sem

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598906#comment-17598906 ] Apache Spark commented on SPARK-40305: -- User 'zhengruifeng' has created a pull requ

[jira] [Assigned] (SPARK-40305) Implement Groupby.sem

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40305: Assignee: Apache Spark > Implement Groupby.sem > - > >

[jira] [Commented] (SPARK-40305) Implement Groupby.sem

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598905#comment-17598905 ] Apache Spark commented on SPARK-40305: -- User 'zhengruifeng' has created a pull requ

[jira] [Assigned] (SPARK-40305) Implement Groupby.sem

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40305: Assignee: (was: Apache Spark) > Implement Groupby.sem > - > >

[jira] [Created] (SPARK-40305) Implement Groupby.sem

2022-09-01 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-40305: - Summary: Implement Groupby.sem Key: SPARK-40305 URL: https://issues.apache.org/jira/browse/SPARK-40305 Project: Spark Issue Type: Improvement Com

[jira] [Commented] (SPARK-40304) Add decomTestTag to K8s Integration Test

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598867#comment-17598867 ] Apache Spark commented on SPARK-40304: -- User 'dongjoon-hyun' has created a pull req

[jira] [Assigned] (SPARK-40304) Add decomTestTag to K8s Integration Test

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40304: Assignee: Apache Spark > Add decomTestTag to K8s Integration Test > -

[jira] [Commented] (SPARK-40304) Add decomTestTag to K8s Integration Test

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598866#comment-17598866 ] Apache Spark commented on SPARK-40304: -- User 'dongjoon-hyun' has created a pull req

[jira] [Assigned] (SPARK-40304) Add decomTestTag to K8s Integration Test

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40304: Assignee: (was: Apache Spark) > Add decomTestTag to K8s Integration Test > --

[jira] [Created] (SPARK-40304) Add decomTestTag to K8s Integration Test

2022-09-01 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-40304: - Summary: Add decomTestTag to K8s Integration Test Key: SPARK-40304 URL: https://issues.apache.org/jira/browse/SPARK-40304 Project: Spark Issue Type: Test

[jira] [Updated] (SPARK-40304) Add decomTestTag to K8s Integration Test

2022-09-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-40304: -- Priority: Minor (was: Major) > Add decomTestTag to K8s Integration Test > ---

[jira] [Commented] (SPARK-39906) Eliminate build warnings - 'sbt 0.13 shell syntax is deprecated; use slash syntax instead'

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598847#comment-17598847 ] Apache Spark commented on SPARK-39906: -- User 'panbingkun' has created a pull reques

[jira] [Commented] (SPARK-39906) Eliminate build warnings - 'sbt 0.13 shell syntax is deprecated; use slash syntax instead'

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598845#comment-17598845 ] Apache Spark commented on SPARK-39906: -- User 'panbingkun' has created a pull reques

[jira] [Commented] (SPARK-40286) Load Data from S3 deletes data source file

2022-09-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598837#comment-17598837 ] Steve Loughran commented on SPARK-40286: this is EMR. can you repliacate in an A

[jira] [Commented] (SPARK-40287) Load Data using Spark by a single partition moves entire dataset under same location in S3

2022-09-01 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598835#comment-17598835 ] Steve Loughran commented on SPARK-40287: does this happen when # you switch to a

[jira] [Commented] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598831#comment-17598831 ] Yuming Wang commented on SPARK-40303: - cc [~cloud_fan] [~joshrosen] [~rednaxelafx]

[jira] [Created] (SPARK-40303) The performance will be worse after codegen

2022-09-01 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40303: --- Summary: The performance will be worse after codegen Key: SPARK-40303 URL: https://issues.apache.org/jira/browse/SPARK-40303 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-40302) Add YuniKornSuite

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40302: Assignee: (was: Apache Spark) > Add YuniKornSuite > - > >

[jira] [Assigned] (SPARK-40302) Add YuniKornSuite

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40302: Assignee: Apache Spark > Add YuniKornSuite > - > > Key: S

[jira] [Commented] (SPARK-40302) Add YuniKornSuite

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598826#comment-17598826 ] Apache Spark commented on SPARK-40302: -- User 'dongjoon-hyun' has created a pull req

[jira] [Created] (SPARK-40302) Add YuniKornSuite

2022-09-01 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-40302: - Summary: Add YuniKornSuite Key: SPARK-40302 URL: https://issues.apache.org/jira/browse/SPARK-40302 Project: Spark Issue Type: Test Components: Ku

[jira] [Resolved] (SPARK-39664) RowMatrix(...).computeCovariance() VS Correlation.corr(..., ...)

2022-09-01 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-39664. --- Resolution: Not A Problem > RowMatrix(...).computeCovariance() VS Correlation.corr(..., ...)

[jira] [Commented] (SPARK-39664) RowMatrix(...).computeCovariance() VS Correlation.corr(..., ...)

2022-09-01 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598813#comment-17598813 ] Ruifeng Zheng commented on SPARK-39664: --- [~igaloly] _RowMatrix_ is in .mllib, whil

[jira] [Assigned] (SPARK-40301) Add parameter validation in pyspark.rdd

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40301: Assignee: Apache Spark > Add parameter validation in pyspark.rdd > --

[jira] [Assigned] (SPARK-40301) Add parameter validation in pyspark.rdd

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40301: Assignee: (was: Apache Spark) > Add parameter validation in pyspark.rdd > ---

[jira] [Commented] (SPARK-40301) Add parameter validation in pyspark.rdd

2022-09-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17598784#comment-17598784 ] Apache Spark commented on SPARK-40301: -- User 'zhengruifeng' has created a pull requ

[jira] [Created] (SPARK-40301) Add parameter validation in pyspark.rdd

2022-09-01 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-40301: - Summary: Add parameter validation in pyspark.rdd Key: SPARK-40301 URL: https://issues.apache.org/jira/browse/SPARK-40301 Project: Spark Issue Type: Improve

  1   2   >