[jira] [Assigned] (SPARK-40239) Remove duplicated 'fraction' validation in RDD.sample

2022-08-27 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng reassigned SPARK-40239: - Assignee: Ruifeng Zheng > Remove duplicated 'fraction' validation in RDD.sample > -

[jira] [Resolved] (SPARK-40239) Remove duplicated 'fraction' validation in RDD.sample

2022-08-27 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-40239. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 37682 [https://

[jira] [Updated] (SPARK-40149) Star expansion after outer join asymmetrically includes joining key

2022-08-27 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-40149: Priority: Blocker (was: Major) > Star expansion after outer join asymmetrically includes joining key > --

[jira] [Updated] (SPARK-40149) Star expansion after outer join asymmetrically includes joining key

2022-08-27 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-40149: Target Version/s: 3.4.0 > Star expansion after outer join asymmetrically includes joining key > --

[jira] [Commented] (SPARK-40156) url_decode() exposes a Java error

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17586140#comment-17586140 ] Apache Spark commented on SPARK-40156: -- User 'ming95' has created a pull reques

[jira] [Assigned] (SPARK-40240) PySpark rdd.takeSample should validate `num > maxSampleSize` at first

2022-08-27 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng reassigned SPARK-40240: - Assignee: Ruifeng Zheng > PySpark rdd.takeSample should validate `num > maxSampleSize`

[jira] [Resolved] (SPARK-40240) PySpark rdd.takeSample should validate `num > maxSampleSize` at first

2022-08-27 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng resolved SPARK-40240. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 37683 [https://

[jira] [Updated] (SPARK-40124) Update TPCDS v1.4 q32 for Plan Stability tests

2022-08-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-40124: -- Fix Version/s: 3.2.3 > Update TPCDS v1.4 q32 for Plan Stability tests > --

[jira] [Resolved] (SPARK-40234) Clean only MDC items set by Spark

2022-08-27 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-40234. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 37680 [https://gith

[jira] [Assigned] (SPARK-40246) Logging isn't configurable via log4j2 with hadoop-provided profile

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40246: Assignee: (was: Apache Spark) > Logging isn't configurable via log4j2 with hadoop-pro

[jira] [Commented] (SPARK-40246) Logging isn't configurable via log4j2 with hadoop-provided profile

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17586104#comment-17586104 ] Apache Spark commented on SPARK-40246: -- User 'Kimahriman' has created a pull reques

[jira] [Assigned] (SPARK-40246) Logging isn't configurable via log4j2 with hadoop-provided profile

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40246: Assignee: Apache Spark > Logging isn't configurable via log4j2 with hadoop-provided profi

[jira] [Updated] (SPARK-40246) Logging isn't configurable via log4j2 with hadoop-provided profile

2022-08-27 Thread Adam Binford (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Binford updated SPARK-40246: - Component/s: Build (was: Spark Core) > Logging isn't configurable via log4j

[jira] [Created] (SPARK-40246) Logging isn't configurable via log4j2 with hadoop-provided profile

2022-08-27 Thread Adam Binford (Jira)
Adam Binford created SPARK-40246: Summary: Logging isn't configurable via log4j2 with hadoop-provided profile Key: SPARK-40246 URL: https://issues.apache.org/jira/browse/SPARK-40246 Project: Spark

[jira] [Assigned] (SPARK-40245) Fix FileScan equality check when partition or data filter columns are not read

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40245: Assignee: Apache Spark > Fix FileScan equality check when partition or data filter column

[jira] [Assigned] (SPARK-40245) Fix FileScan equality check when partition or data filter columns are not read

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40245: Assignee: (was: Apache Spark) > Fix FileScan equality check when partition or data fi

[jira] [Commented] (SPARK-40245) Fix FileScan equality check when partition or data filter columns are not read

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17586097#comment-17586097 ] Apache Spark commented on SPARK-40245: -- User 'peter-toth' has created a pull reques

[jira] [Updated] (SPARK-40245) Fix FileScan equality check when partition or data filter columns are not read

2022-08-27 Thread Peter Toth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Toth updated SPARK-40245: --- Summary: Fix FileScan equality check when partition or data filter columns are not read (was: Fix F

[jira] [Created] (SPARK-40245) Fix FileScan canonicalization when partition or data filter columns are not read

2022-08-27 Thread Peter Toth (Jira)
Peter Toth created SPARK-40245: -- Summary: Fix FileScan canonicalization when partition or data filter columns are not read Key: SPARK-40245 URL: https://issues.apache.org/jira/browse/SPARK-40245 Project:

[jira] [Resolved] (SPARK-40241) Correct the link of GenericUDTF

2022-08-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40241. - Fix Version/s: 3.3.1 3.1.4 3.2.3 3.4.0

[jira] [Assigned] (SPARK-40241) Correct the link of GenericUDTF

2022-08-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-40241: --- Assignee: Ruifeng Zheng > Correct the link of GenericUDTF > ---

[jira] [Commented] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585717#comment-17585717 ] Apache Spark commented on SPARK-40244: -- User 'mukever' has created a pull request f

[jira] [Commented] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585716#comment-17585716 ] Apache Spark commented on SPARK-40244: -- User 'mukever' has created a pull request f

[jira] [Commented] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585712#comment-17585712 ] Apache Spark commented on SPARK-40244: -- User 'mukever' has created a pull request f

[jira] [Commented] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585711#comment-17585711 ] Apache Spark commented on SPARK-40244: -- User 'mukever' has created a pull request f

[jira] [Commented] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585710#comment-17585710 ] Apache Spark commented on SPARK-40244: -- User 'mukever' has created a pull request f

[jira] [Assigned] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40244: Assignee: (was: Apache Spark) > Correct the property name of data source option for c

[jira] [Assigned] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40244: Assignee: Apache Spark > Correct the property name of data source option for csv > --

[jira] [Commented] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585706#comment-17585706 ] Apache Spark commented on SPARK-40244: -- User 'mukever' has created a pull request f

[jira] [Created] (SPARK-40244) Correct the property name of data source option for csv

2022-08-27 Thread Jira
陈志祥 created SPARK-40244: --- Summary: Correct the property name of data source option for csv Key: SPARK-40244 URL: https://issues.apache.org/jira/browse/SPARK-40244 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-40243) Enhance Hive UDF support documentation

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585704#comment-17585704 ] Apache Spark commented on SPARK-40243: -- User 'wangyum' has created a pull request f

[jira] [Assigned] (SPARK-40243) Enhance Hive UDF support documentation

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40243: Assignee: (was: Apache Spark) > Enhance Hive UDF support documentation >

[jira] [Assigned] (SPARK-40243) Enhance Hive UDF support documentation

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40243: Assignee: Apache Spark > Enhance Hive UDF support documentation > ---

[jira] [Commented] (SPARK-40243) Enhance Hive UDF support documentation

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585703#comment-17585703 ] Apache Spark commented on SPARK-40243: -- User 'wangyum' has created a pull request f

[jira] [Commented] (SPARK-40039) Introducing a streaming checkpoint file manager based on Hadoop's Abortable interface

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585701#comment-17585701 ] Apache Spark commented on SPARK-40039: -- User 'attilapiros' has created a pull reque

[jira] [Created] (SPARK-40243) Enhance Hive UDF support documentation

2022-08-27 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40243: --- Summary: Enhance Hive UDF support documentation Key: SPARK-40243 URL: https://issues.apache.org/jira/browse/SPARK-40243 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-40242) Only return all physical plans after summitting pyspark script with several spark sql blocks inside

2022-08-27 Thread Liang Fenjie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang Fenjie updated SPARK-40242: - Description: Backgroud:     In industry development environment, we got used to write several s

[jira] [Updated] (SPARK-40242) Only return all physical plans after summitting pyspark script with several spark sql blocks inside

2022-08-27 Thread Liang Fenjie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang Fenjie updated SPARK-40242: - Description: Backgroud:     In industry development environment, we got used to write several s

[jira] [Commented] (SPARK-40142) Make pyspark.sql.functions examples self-contained

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585680#comment-17585680 ] Apache Spark commented on SPARK-40142: -- User 'khalidmammadov' has created a pull re

[jira] [Commented] (SPARK-40142) Make pyspark.sql.functions examples self-contained

2022-08-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585681#comment-17585681 ] Apache Spark commented on SPARK-40142: -- User 'khalidmammadov' has created a pull re

[jira] [Created] (SPARK-40242) Only return all physical plans after summitting pyspark script with several spark sql blocks inside

2022-08-27 Thread Liang Fenjie (Jira)
Liang Fenjie created SPARK-40242: Summary: Only return all physical plans after summitting pyspark script with several spark sql blocks inside Key: SPARK-40242 URL: https://issues.apache.org/jira/browse/SPARK-4024