[jira] [Commented] (SPARK-32810) CSV/JSON data sources should avoid globbing paths when inferring schema

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192628#comment-17192628 ] Hyukjin Kwon commented on SPARK-32810: -- Thanks [~dongjoon] for fixing it here and in other JIRAs.

[jira] [Updated] (SPARK-32810) CSV/JSON data sources should avoid globbing paths when inferring schema

2020-09-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32810: -- Affects Version/s: 2.4.6 3.0.0 3.0.1 > CSV/JSON

[jira] [Commented] (SPARK-32827) Add spark.sql.maxMetadataStringLength config

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192621#comment-17192621 ] Apache Spark commented on SPARK-32827: -- User 'ulysses-you' has created a pull request for this

[jira] [Commented] (SPARK-32827) Add spark.sql.maxMetadataStringLength config

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192620#comment-17192620 ] Apache Spark commented on SPARK-32827: -- User 'ulysses-you' has created a pull request for this

[jira] [Assigned] (SPARK-32827) Add spark.sql.maxMetadataStringLength config

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32827: Assignee: Apache Spark > Add spark.sql.maxMetadataStringLength config >

[jira] [Assigned] (SPARK-32827) Add spark.sql.maxMetadataStringLength config

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32827: Assignee: (was: Apache Spark) > Add spark.sql.maxMetadataStringLength config >

[jira] [Created] (SPARK-32827) Add spark.sql.maxMetadataStringLength config

2020-09-08 Thread ulysses you (Jira)
ulysses you created SPARK-32827: --- Summary: Add spark.sql.maxMetadataStringLength config Key: SPARK-32827 URL: https://issues.apache.org/jira/browse/SPARK-32827 Project: Spark Issue Type:

[jira] [Commented] (SPARK-32187) User Guide - Shipping Python Package

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192607#comment-17192607 ] Hyukjin Kwon commented on SPARK-32187: -- Hey [~fhoering] are you back now :-)? > User Guide -

[jira] [Commented] (SPARK-32826) Add test case for get null columns using SparkGetColumnsOperation

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192603#comment-17192603 ] Apache Spark commented on SPARK-32826: -- User 'yaooqinn' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32826) Add test case for get null columns using SparkGetColumnsOperation

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32826: Assignee: (was: Apache Spark) > Add test case for get null columns using

[jira] [Commented] (SPARK-32826) Add test case for get null columns using SparkGetColumnsOperation

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192602#comment-17192602 ] Apache Spark commented on SPARK-32826: -- User 'yaooqinn' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32826) Add test case for get null columns using SparkGetColumnsOperation

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32826: Assignee: Apache Spark > Add test case for get null columns using

[jira] [Created] (SPARK-32826) Add test case for get null columns using SparkGetColumnsOperation

2020-09-08 Thread Kent Yao (Jira)
Kent Yao created SPARK-32826: Summary: Add test case for get null columns using SparkGetColumnsOperation Key: SPARK-32826 URL: https://issues.apache.org/jira/browse/SPARK-32826 Project: Spark

[jira] [Updated] (SPARK-32187) User Guide - Shipping Python Package

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32187: - Description: - Zipped file - Python files - Virtualenv with Yarn - PEX \(?\) (see also

[jira] [Resolved] (SPARK-32813) Reading parquet rdd in non columnar mode fails in multithreaded environment

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32813. -- Fix Version/s: 3.0.2 3.1.0 Resolution: Fixed Issue resolved by pull

[jira] [Assigned] (SPARK-32813) Reading parquet rdd in non columnar mode fails in multithreaded environment

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-32813: Assignee: L. C. Hsieh > Reading parquet rdd in non columnar mode fails in multithreaded

[jira] [Commented] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2020-09-08 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192574#comment-17192574 ] Jungtaek Lim commented on SPARK-24295: -- [~sta...@gmail.com] Thanks for sharing the workaround.

[jira] [Commented] (SPARK-32821) cannot group by with window in sql statement for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192573#comment-17192573 ] Johnny Bai commented on SPARK-32821: Let's take a discussion about watermark with window grammar in

[jira] [Updated] (SPARK-32788) non-partitioned table scan should not have partition filter

2020-09-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32788: -- Affects Version/s: (was: 3.0.0) 3.0.1 > non-partitioned table scan

[jira] [Updated] (SPARK-32788) non-partitioned table scan should not have partition filter

2020-09-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32788: -- Affects Version/s: 3.0.0 > non-partitioned table scan should not have partition filter >

[jira] [Commented] (SPARK-27089) Loss of precision during decimal division

2020-09-08 Thread Daeho Ro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192566#comment-17192566 ] Daeho Ro commented on SPARK-27089: -- It seems that the bug persists on the spark version 3.0.0 > Loss

[jira] [Updated] (SPARK-32821) cannot group by with window in sql statement for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Summary: cannot group by with window in sql statement for structured streaming with watermark

[jira] [Comment Edited] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192563#comment-17192563 ] Johnny Bai edited comment on SPARK-32821 at 9/9/20, 1:45 AM: - [~kabhwan] As

[jira] [Commented] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192563#comment-17192563 ] Johnny Bai commented on SPARK-32821: [~kabhwan] as structured streaming going, I think it is

[jira] [Resolved] (SPARK-32823) Standalone Master UI resources in use wrong

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32823. -- Fix Version/s: 3.0.2 3.1.0 Resolution: Fixed Issue resolved by pull

[jira] [Assigned] (SPARK-32823) Standalone Master UI resources in use wrong

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-32823: Assignee: Thomas Graves > Standalone Master UI resources in use wrong >

[jira] [Resolved] (SPARK-32824) The error is confusing when resource .amount not provided

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32824. -- Fix Version/s: 3.0.2 3.1.0 Resolution: Fixed Issue resolved by pull

[jira] [Assigned] (SPARK-32824) The error is confusing when resource .amount not provided

2020-09-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-32824: Assignee: Thomas Graves > The error is confusing when resource .amount not provided >

[jira] [Updated] (SPARK-32638) WidenSetOperationTypes in subquery attribute missing

2020-09-08 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-32638: - Fix Version/s: 3.0.2 > WidenSetOperationTypes in subquery attribute missing >

[jira] [Updated] (SPARK-32812) Run tests script for Python fails in certain environments

2020-09-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32812: -- Fix Version/s: (was: 2.4.8) 2.4.7 > Run tests script for Python fails

[jira] [Updated] (SPARK-32810) CSV/JSON data sources should avoid globbing paths when inferring schema

2020-09-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32810: -- Fix Version/s: (was: 2.4.8) 2.4.7 > CSV/JSON data sources should avoid

[jira] [Assigned] (SPARK-32810) CSV/JSON data sources should avoid globbing paths when inferring schema

2020-09-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32810: - Assignee: Maxim Gekk (was: Apache Spark) > CSV/JSON data sources should avoid

[jira] [Assigned] (SPARK-32312) Upgrade Apache Arrow to 1.0.0

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32312: Assignee: (was: Apache Spark) > Upgrade Apache Arrow to 1.0.0 >

[jira] [Assigned] (SPARK-32312) Upgrade Apache Arrow to 1.0.0

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32312: Assignee: Apache Spark > Upgrade Apache Arrow to 1.0.0 > - >

[jira] [Commented] (SPARK-32312) Upgrade Apache Arrow to 1.0.0

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192422#comment-17192422 ] Apache Spark commented on SPARK-32312: -- User 'BryanCutler' has created a pull request for this

[jira] [Commented] (SPARK-32824) The error is confusing when resource .amount not provided

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192394#comment-17192394 ] Apache Spark commented on SPARK-32824: -- User 'tgravescs' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32824) The error is confusing when resource .amount not provided

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32824: Assignee: Apache Spark > The error is confusing when resource .amount not provided >

[jira] [Commented] (SPARK-32824) The error is confusing when resource .amount not provided

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192393#comment-17192393 ] Apache Spark commented on SPARK-32824: -- User 'tgravescs' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32824) The error is confusing when resource .amount not provided

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32824: Assignee: (was: Apache Spark) > The error is confusing when resource .amount not

[jira] [Created] (SPARK-32825) CTE support on MSSQL

2020-09-08 Thread Ankit Sinha (Jira)
Ankit Sinha created SPARK-32825: --- Summary: CTE support on MSSQL Key: SPARK-32825 URL: https://issues.apache.org/jira/browse/SPARK-32825 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-32810) CSV/JSON data sources should avoid globbing paths when inferring schema

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192343#comment-17192343 ] Apache Spark commented on SPARK-32810: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Updated] (SPARK-32135) Show Spark Driver name on Spark history web page

2020-09-08 Thread Gaurangi Saxena (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gaurangi Saxena updated SPARK-32135: Description: Our service dynamically creates short-lived YARN clusters in cloud. Spark

[jira] [Updated] (SPARK-32097) Allow reading history log files from multiple directories

2020-09-08 Thread Gaurangi Saxena (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gaurangi Saxena updated SPARK-32097: Description: Our service dynamically creates short-lived YARN clusters in cloud. Spark

[jira] [Assigned] (SPARK-32823) Standalone Master UI resources in use wrong

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32823: Assignee: Apache Spark > Standalone Master UI resources in use wrong >

[jira] [Commented] (SPARK-32823) Standalone Master UI resources in use wrong

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192295#comment-17192295 ] Apache Spark commented on SPARK-32823: -- User 'tgravescs' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32823) Standalone Master UI resources in use wrong

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32823: Assignee: (was: Apache Spark) > Standalone Master UI resources in use wrong >

[jira] [Comment Edited] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2020-09-08 Thread Avner Livne (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192280#comment-17192280 ] Avner Livne edited comment on SPARK-24295 at 9/8/20, 3:45 PM: -- for those

[jira] [Comment Edited] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2020-09-08 Thread Avner Livne (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192280#comment-17192280 ] Avner Livne edited comment on SPARK-24295 at 9/8/20, 3:45 PM: -- for those

[jira] [Comment Edited] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2020-09-08 Thread Avner Livne (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192280#comment-17192280 ] Avner Livne edited comment on SPARK-24295 at 9/8/20, 3:44 PM: -- for those

[jira] [Comment Edited] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2020-09-08 Thread Avner Livne (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192280#comment-17192280 ] Avner Livne edited comment on SPARK-24295 at 9/8/20, 3:42 PM: -- for those

[jira] [Comment Edited] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2020-09-08 Thread Avner Livne (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192280#comment-17192280 ] Avner Livne edited comment on SPARK-24295 at 9/8/20, 3:42 PM: -- for those

[jira] [Commented] (SPARK-24295) Purge Structured streaming FileStreamSinkLog metadata compact file data.

2020-09-08 Thread Avner Livne (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192280#comment-17192280 ] Avner Livne commented on SPARK-24295: - for those looking for a temporary workaround: run this code

[jira] [Created] (SPARK-32824) The error is confusing when resource .amount not provided

2020-09-08 Thread Thomas Graves (Jira)
Thomas Graves created SPARK-32824: - Summary: The error is confusing when resource .amount not provided Key: SPARK-32824 URL: https://issues.apache.org/jira/browse/SPARK-32824 Project: Spark

[jira] [Created] (SPARK-32823) Standalone Master UI resources in use wrong

2020-09-08 Thread Thomas Graves (Jira)
Thomas Graves created SPARK-32823: - Summary: Standalone Master UI resources in use wrong Key: SPARK-32823 URL: https://issues.apache.org/jira/browse/SPARK-32823 Project: Spark Issue Type:

[jira] [Commented] (SPARK-32823) Standalone Master UI resources in use wrong

2020-09-08 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192256#comment-17192256 ] Thomas Graves commented on SPARK-32823: --- I'm looking into this. > Standalone Master UI resources

[jira] [Commented] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192252#comment-17192252 ] Apache Spark commented on SPARK-32753: -- User 'manuzhang' has created a pull request for this issue:

[jira] [Commented] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192251#comment-17192251 ] Apache Spark commented on SPARK-32753: -- User 'manuzhang' has created a pull request for this issue:

[jira] [Updated] (SPARK-32815) Fix LibSVM data source loading error on file paths with glob metacharacters

2020-09-08 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-32815: Fix Version/s: 3.0.2 2.4.8 > Fix LibSVM data source loading error on file

[jira] [Resolved] (SPARK-32815) Fix LibSVM data source loading error on file paths with glob metacharacters

2020-09-08 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-32815. - Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29670

[jira] [Assigned] (SPARK-32815) Fix LibSVM data source loading error on file paths with glob metacharacters

2020-09-08 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-32815: --- Assignee: Maxim Gekk > Fix LibSVM data source loading error on file paths with glob

[jira] [Updated] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-09-08 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-32753: Fix Version/s: 3.0.2 > Deduplicating and repartitioning the same column create duplicate rows

[jira] [Updated] (SPARK-32817) DPP throws error when broadcast side is empty

2020-09-08 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-32817: - Affects Version/s: (was: 3.0.0) 3.1.0 > DPP throws error

[jira] [Resolved] (SPARK-32817) DPP throws error when broadcast side is empty

2020-09-08 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-32817. -- Fix Version/s: 3.1.0 Assignee: Zhenhua Wang Resolution: Fixed

[jira] [Commented] (SPARK-32822) Change the number of partitions to zero when a range is empty with WholeStageCodegen disabled or falled back

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192150#comment-17192150 ] Apache Spark commented on SPARK-32822: -- User 'sarutak' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32822) Change the number of partitions to zero when a range is empty with WholeStageCodegen disabled or falled back

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32822: Assignee: Apache Spark (was: Kousuke Saruta) > Change the number of partitions to zero

[jira] [Assigned] (SPARK-32822) Change the number of partitions to zero when a range is empty with WholeStageCodegen disabled or falled back

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32822: Assignee: Kousuke Saruta (was: Apache Spark) > Change the number of partitions to zero

[jira] [Resolved] (SPARK-32748) Support local property propagation in SubqueryBroadcastExec

2020-09-08 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-32748. -- Resolution: Won't Fix > Support local property propagation in SubqueryBroadcastExec >

[jira] [Reopened] (SPARK-32748) Support local property propagation in SubqueryBroadcastExec

2020-09-08 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro reopened SPARK-32748: -- > Support local property propagation in SubqueryBroadcastExec >

[jira] [Updated] (SPARK-32748) Support local property propagation in SubqueryBroadcastExec

2020-09-08 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-32748: - Fix Version/s: (was: 3.1.0) > Support local property propagation in

[jira] [Updated] (SPARK-32822) Change the number of partitions to zero when a range is empty with WholeStageCodegen disabled or falled back

2020-09-08 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-32822: --- Summary: Change the number of partitions to zero when a range is empty with

[jira] [Updated] (SPARK-32822) Change the number of partitions to zero when a range is empty with WholeStageCodegen disabled or falled back

2020-09-08 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-32822: --- Description: If WholeStageCodegen effects, the number of partitions of an empty range will

[jira] [Created] (SPARK-32822) Change the number of partition to zero when a range is empty with WholeStageCodegen disabled or falled back

2020-09-08 Thread Kousuke Saruta (Jira)
Kousuke Saruta created SPARK-32822: -- Summary: Change the number of partition to zero when a range is empty with WholeStageCodegen disabled or falled back Key: SPARK-32822 URL:

[jira] [Commented] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192138#comment-17192138 ] Jungtaek Lim commented on SPARK-32821: -- Let's leave the fix version field be empty - the field will

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-32821: - Fix Version/s: (was: 3.0.1) > cannot group by with window in sql sentence for structured

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-32821: - Labels: (was: 2.1.0) > cannot group by with window in sql sentence for structured streaming

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Description: current only support dsl style as below:  import spark.implicits._ val words = ...

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Affects Version/s: 2.2.0 2.3.0 2.4.0 > cannot group

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Labels: 2.1.0 (was: ) > cannot group by with window in sql sentence for structured streaming with

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Affects Version/s: (was: 3.0.0) 2.1.0 > cannot group by with window in

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Target Version/s: (was: 2.4.3) > cannot group by with window in sql sentence for structured

[jira] [Updated] (SPARK-32811) Replace IN predicate of continuous range with boundary checks

2020-09-08 Thread Vu Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vu Ho updated SPARK-32811: -- Description: This expression  {code:java} select a from t where a in (1, 2, 3, 3, 4){code} can be translated

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Description:   import spark.implicits._ val words = ... // streaming DataFrame of schema {

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Description:   import spark.implicits._ val words = ... // streaming DataFrame of schema {

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Description:   import spark.implicits._ val words = ... // streaming DataFrame of schema {

[jira] [Updated] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Bai updated SPARK-32821: --- Description:     import spark.implicits._ val words = ... // streaming DataFrame of

[jira] [Commented] (SPARK-32638) WidenSetOperationTypes in subquery attribute missing

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192111#comment-17192111 ] Apache Spark commented on SPARK-32638: -- User 'maropu' has created a pull request for this issue:

[jira] [Commented] (SPARK-32638) WidenSetOperationTypes in subquery attribute missing

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192110#comment-17192110 ] Apache Spark commented on SPARK-32638: -- User 'maropu' has created a pull request for this issue:

[jira] [Commented] (SPARK-32182) Getting Started - Quickstart

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192103#comment-17192103 ] Apache Spark commented on SPARK-32182: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-32182) Getting Started - Quickstart

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192102#comment-17192102 ] Apache Spark commented on SPARK-32182: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-32821) cannot group by with window in sql sentence for structured streaming with watermark

2020-09-08 Thread Johnny Bai (Jira)
Johnny Bai created SPARK-32821: -- Summary: cannot group by with window in sql sentence for structured streaming with watermark Key: SPARK-32821 URL: https://issues.apache.org/jira/browse/SPARK-32821

[jira] [Commented] (SPARK-32182) Getting Started - Quickstart

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192101#comment-17192101 ] Apache Spark commented on SPARK-32182: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-32204) Binder Integration

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192100#comment-17192100 ] Apache Spark commented on SPARK-32204: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-32204) Binder Integration

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192097#comment-17192097 ] Apache Spark commented on SPARK-32204: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-32815) Fix LibSVM data source loading error on file paths with glob metacharacters

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192093#comment-17192093 ] Apache Spark commented on SPARK-32815: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Updated] (SPARK-32811) Replace IN predicate of continuous range with boundary checks

2020-09-08 Thread Vu Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vu Ho updated SPARK-32811: -- Priority: Minor (was: Major) > Replace IN predicate of continuous range with boundary checks >

[jira] [Assigned] (SPARK-32820) Remove redundant shuffle exchanges inserted by EnsureRequirements

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32820: Assignee: Kousuke Saruta (was: Apache Spark) > Remove redundant shuffle exchanges

[jira] [Assigned] (SPARK-32820) Remove redundant shuffle exchanges inserted by EnsureRequirements

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32820: Assignee: Apache Spark (was: Kousuke Saruta) > Remove redundant shuffle exchanges

[jira] [Commented] (SPARK-32820) Remove redundant shuffle exchanges inserted by EnsureRequirements

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192081#comment-17192081 ] Apache Spark commented on SPARK-32820: -- User 'sarutak' has created a pull request for this issue:

[jira] [Commented] (SPARK-32815) Fix LibSVM data source loading error on file paths with glob metacharacters

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192073#comment-17192073 ] Apache Spark commented on SPARK-32815: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Commented] (SPARK-32815) Fix LibSVM data source loading error on file paths with glob metacharacters

2020-09-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17192071#comment-17192071 ] Apache Spark commented on SPARK-32815: -- User 'MaxGekk' has created a pull request for this issue:

  1   2   >