[jira] [Updated] (SPARK-34341) ./build/mvn error output on aarch64

2021-02-02 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yikun Jiang updated SPARK-34341: Description: You can see the below error ouput in every spark arm jenkins job [1]:

[jira] [Created] (SPARK-34341) ./build/mvn error output on aarch64

2021-02-02 Thread Yikun Jiang (Jira)
Yikun Jiang created SPARK-34341: --- Summary: ./build/mvn error output on aarch64 Key: SPARK-34341 URL: https://issues.apache.org/jira/browse/SPARK-34341 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-34340) Support ZSTD JNI BufferPool

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277746#comment-17277746 ] Apache Spark commented on SPARK-34340: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-34340) Support ZSTD JNI BufferPool

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34340: Assignee: (was: Apache Spark) > Support ZSTD JNI BufferPool >

[jira] [Assigned] (SPARK-34340) Support ZSTD JNI BufferPool

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34340: Assignee: Apache Spark > Support ZSTD JNI BufferPool > --- > >

[jira] [Commented] (SPARK-34340) Support ZSTD JNI BufferPool

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277745#comment-17277745 ] Apache Spark commented on SPARK-34340: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Created] (SPARK-34340) Support ZSTD JNI BufferPool

2021-02-02 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-34340: - Summary: Support ZSTD JNI BufferPool Key: SPARK-34340 URL: https://issues.apache.org/jira/browse/SPARK-34340 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-34317) Introduce relationTypeMismatchHint to UnresolvedTable for a better error message

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277741#comment-17277741 ] Apache Spark commented on SPARK-34317: -- User 'imback82' has created a pull request for this issue:

[jira] [Commented] (SPARK-34317) Introduce relationTypeMismatchHint to UnresolvedTable for a better error message

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277739#comment-17277739 ] Apache Spark commented on SPARK-34317: -- User 'imback82' has created a pull request for this issue:

[jira] [Resolved] (SPARK-34327) Omit inlining passwords during build process.

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-34327. -- Fix Version/s: 3.1.2 2.4.8 3.0.2 Resolution:

[jira] [Assigned] (SPARK-34327) Omit inlining passwords during build process.

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-34327: Assignee: Prashant Sharma > Omit inlining passwords during build process. >

[jira] [Commented] (SPARK-34338) Report metrics from Datasource v2 scan

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277701#comment-17277701 ] Apache Spark commented on SPARK-34338: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-34338) Report metrics from Datasource v2 scan

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34338: Assignee: Apache Spark (was: L. C. Hsieh) > Report metrics from Datasource v2 scan >

[jira] [Assigned] (SPARK-34338) Report metrics from Datasource v2 scan

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34338: Assignee: L. C. Hsieh (was: Apache Spark) > Report metrics from Datasource v2 scan >

[jira] [Resolved] (SPARK-34313) Migrate ALTER TABLE SET/UNSET TBLPROPERTIES commands to the new resolution framework

2021-02-02 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-34313. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31422

[jira] [Assigned] (SPARK-34313) Migrate ALTER TABLE SET/UNSET TBLPROPERTIES commands to the new resolution framework

2021-02-02 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-34313: --- Assignee: Terry Kim > Migrate ALTER TABLE SET/UNSET TBLPROPERTIES commands to the new

[jira] [Assigned] (SPARK-33763) Add metrics for better tracking of dynamic allocation

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33763: Assignee: Apache Spark > Add metrics for better tracking of dynamic allocation >

[jira] [Commented] (SPARK-33763) Add metrics for better tracking of dynamic allocation

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277694#comment-17277694 ] Apache Spark commented on SPARK-33763: -- User 'attilapiros' has created a pull request for this

[jira] [Assigned] (SPARK-33763) Add metrics for better tracking of dynamic allocation

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33763: Assignee: (was: Apache Spark) > Add metrics for better tracking of dynamic

[jira] [Commented] (SPARK-34339) Expose the number of truncated paths in Utils.buildLocationMetadata()

2021-02-02 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277684#comment-17277684 ] Jungtaek Lim commented on SPARK-34339: -- I have a patch but the patch depends on SPARK-34326, so

[jira] [Created] (SPARK-34339) Expose the number of truncated paths in Utils.buildLocationMetadata()

2021-02-02 Thread Jungtaek Lim (Jira)
Jungtaek Lim created SPARK-34339: Summary: Expose the number of truncated paths in Utils.buildLocationMetadata() Key: SPARK-34339 URL: https://issues.apache.org/jira/browse/SPARK-34339 Project: Spark

[jira] [Resolved] (SPARK-34307) TakeOrderedAndProjectExec avoid shuffle if input rdd has single partition

2021-02-02 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-34307. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31409

[jira] [Assigned] (SPARK-34307) TakeOrderedAndProjectExec avoid shuffle if input rdd has single partition

2021-02-02 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-34307: --- Assignee: zhengruifeng > TakeOrderedAndProjectExec avoid shuffle if input rdd has single

[jira] [Commented] (SPARK-33763) Add metrics for better tracking of dynamic allocation

2021-02-02 Thread Attila Zsolt Piros (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277675#comment-17277675 ] Attila Zsolt Piros commented on SPARK-33763: I am ready with the executor removals (1 and 4

[jira] [Commented] (SPARK-31793) Reduce the memory usage in file scan location metadata

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277672#comment-17277672 ] Apache Spark commented on SPARK-31793: -- User 'HeartSaVioR' has created a pull request for this

[jira] [Commented] (SPARK-31793) Reduce the memory usage in file scan location metadata

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277673#comment-17277673 ] Apache Spark commented on SPARK-31793: -- User 'HeartSaVioR' has created a pull request for this

[jira] [Commented] (SPARK-34326) "SPARK-31793: FileSourceScanExec metadata should contain limited file paths" fails in some edge-case

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277671#comment-17277671 ] Apache Spark commented on SPARK-34326: -- User 'HeartSaVioR' has created a pull request for this

[jira] [Assigned] (SPARK-28137) Data Type Formatting Functions: `to_number`

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-28137: Assignee: Apache Spark > Data Type Formatting Functions: `to_number` >

[jira] [Assigned] (SPARK-28137) Data Type Formatting Functions: `to_number`

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-28137: Assignee: (was: Apache Spark) > Data Type Formatting Functions: `to_number` >

[jira] [Reopened] (SPARK-28137) Data Type Formatting Functions: `to_number`

2021-02-02 Thread jiaan.geng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiaan.geng reopened SPARK-28137: to_number is very useful for formatted currency to number conversion. > Data Type Formatting

[jira] [Updated] (SPARK-34338) Report metrics from Datasource v2 scan

2021-02-02 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh updated SPARK-34338: Description: This is related to SPARK-34297. In SPARK-34297, we want to add a couple of useful

[jira] [Created] (SPARK-34338) Report metrics from Datasource v2 scan

2021-02-02 Thread L. C. Hsieh (Jira)
L. C. Hsieh created SPARK-34338: --- Summary: Report metrics from Datasource v2 scan Key: SPARK-34338 URL: https://issues.apache.org/jira/browse/SPARK-34338 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-28137) Data Type Formatting Functions: `to_number`

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277662#comment-17277662 ] Apache Spark commented on SPARK-28137: -- User 'beliefer' has created a pull request for this issue:

[jira] [Assigned] (SPARK-34326) "SPARK-31793: FileSourceScanExec metadata should contain limited file paths" fails in some edge-case

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34326: Assignee: Apache Spark (was: Jungtaek Lim) > "SPARK-31793: FileSourceScanExec metadata

[jira] [Assigned] (SPARK-34326) "SPARK-31793: FileSourceScanExec metadata should contain limited file paths" fails in some edge-case

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34326: Assignee: Jungtaek Lim (was: Apache Spark) > "SPARK-31793: FileSourceScanExec metadata

[jira] [Reopened] (SPARK-34326) "SPARK-31793: FileSourceScanExec metadata should contain limited file paths" fails in some edge-case

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reopened SPARK-34326: -- Reverted https://github.com/apache/spark/commit/e927bf90e0e035a5103e029f2524239ee11c2961 >

[jira] [Updated] (SPARK-34326) "SPARK-31793: FileSourceScanExec metadata should contain limited file paths" fails in some edge-case

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-34326: - Fix Version/s: (was: 3.1.1) (was: 3.2.0) > "SPARK-31793:

[jira] [Commented] (SPARK-33726) Duplicate field names causes wrong answers during aggregation

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277644#comment-17277644 ] Apache Spark commented on SPARK-33726: -- User 'yliou' has created a pull request for this issue:

[jira] [Commented] (SPARK-33726) Duplicate field names causes wrong answers during aggregation

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277643#comment-17277643 ] Apache Spark commented on SPARK-33726: -- User 'yliou' has created a pull request for this issue:

[jira] [Resolved] (SPARK-34308) Escape meta-characters in printSchema

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-34308. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31412

[jira] [Commented] (SPARK-27281) Wrong latest offsets returned by DirectKafkaInputDStream#latestOffsets

2021-02-02 Thread SeaAndHill (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277611#comment-17277611 ] SeaAndHill commented on SPARK-27281: [~yuanyuan.xia]   [~vkrot]  do you fix it now? i encounter the

[jira] [Resolved] (SPARK-34325) remove_shuffleBlockResolver_in_SortShuffleWriter

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-34325. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31433

[jira] [Resolved] (SPARK-29594) Create a Dataset from a Sequence of Case class

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29594. -- Resolution: Duplicate > Create a Dataset from a Sequence of Case class >

[jira] [Updated] (SPARK-34331) Speed up DS v2 metadata col resolution

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-34331: - Description: There is a performance regression in Spark 3.1.1. Please refer to the PR

[jira] [Updated] (SPARK-34331) Speed up DS v2 metadata col resolution

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-34331: - Summary: Speed up DS v2 metadata col resolution (was: speed up DS v2 metadata col resolution)

[jira] [Updated] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-34331: - Priority: Blocker (was: Major) > speed up DS v2 metadata col resolution >

[jira] [Updated] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-34331: - Issue Type: Bug (was: Improvement) > speed up DS v2 metadata col resolution >

[jira] [Updated] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-34331: - Target Version/s: 3.1.1 > speed up DS v2 metadata col resolution >

[jira] [Commented] (SPARK-34115) Long runtime on many environment variables

2021-02-02 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277534#comment-17277534 ] Hyukjin Kwon commented on SPARK-34115: -- [~jystephan], it will be included in Spark 3.1.1. I will

[jira] [Resolved] (SPARK-34326) "SPARK-31793: FileSourceScanExec metadata should contain limited file paths" fails in some edge-case

2021-02-02 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-34326. -- Fix Version/s: 3.1.1 3.2.0 Assignee: Jungtaek Lim

[jira] [Created] (SPARK-34337) Reject disk blocks when out of disk space

2021-02-02 Thread Holden Karau (Jira)
Holden Karau created SPARK-34337: Summary: Reject disk blocks when out of disk space Key: SPARK-34337 URL: https://issues.apache.org/jira/browse/SPARK-34337 Project: Spark Issue Type:

[jira] [Commented] (SPARK-32119) ExecutorPlugin doesn't work with Standalone Cluster and Kubernetes with --jars

2021-02-02 Thread John Pugliesi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277499#comment-17277499 ] John Pugliesi commented on SPARK-32119: --- To clarify - this issue and solution address `--packages`

[jira] [Assigned] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34336: Assignee: Apache Spark > Use GenericData as Avro serialization data model can improve

[jira] [Assigned] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34336: Assignee: (was: Apache Spark) > Use GenericData as Avro serialization data model can

[jira] [Commented] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277488#comment-17277488 ] Erik Krogen commented on SPARK-34336: - Thanks for bringing this up [~Baohe Zhang], I came across PR

[jira] [Commented] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277485#comment-17277485 ] Baohe Zhang commented on SPARK-34336: - Full benchmark results are added as txt attachments. > Use

[jira] [Updated] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baohe Zhang updated SPARK-34336: Attachment: generic_data_read.txt > Use GenericData as Avro serialization data model can improve

[jira] [Updated] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baohe Zhang updated SPARK-34336: Attachment: base_read.txt > Use GenericData as Avro serialization data model can improve Avro

[jira] [Updated] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baohe Zhang updated SPARK-34336: Attachment: generic_data_write.txt > Use GenericData as Avro serialization data model can improve

[jira] [Updated] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baohe Zhang updated SPARK-34336: Attachment: read_comparison.png > Use GenericData as Avro serialization data model can improve

[jira] [Commented] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277483#comment-17277483 ] Baohe Zhang commented on SPARK-34336: - Column chart comparison on avg time: Avro write:

[jira] [Updated] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baohe Zhang updated SPARK-34336: Attachment: base_write.txt > Use GenericData as Avro serialization data model can improve Avro

[jira] [Updated] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baohe Zhang updated SPARK-34336: Attachment: write_comparison.png > Use GenericData as Avro serialization data model can improve

[jira] [Created] (SPARK-34336) Use GenericData as Avro serialization data model can improve Avro write/read performance

2021-02-02 Thread Baohe Zhang (Jira)
Baohe Zhang created SPARK-34336: --- Summary: Use GenericData as Avro serialization data model can improve Avro write/read performance Key: SPARK-34336 URL: https://issues.apache.org/jira/browse/SPARK-34336

[jira] [Assigned] (SPARK-34334) ExecutorPodsAllocator fails to identify some excess requests during downscaling

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34334: Assignee: Apache Spark > ExecutorPodsAllocator fails to identify some excess requests

[jira] [Assigned] (SPARK-34334) ExecutorPodsAllocator fails to identify some excess requests during downscaling

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34334: Assignee: (was: Apache Spark) > ExecutorPodsAllocator fails to identify some excess

[jira] [Commented] (SPARK-34334) ExecutorPodsAllocator fails to identify some excess requests during downscaling

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277460#comment-17277460 ] Apache Spark commented on SPARK-34334: -- User 'attilapiros' has created a pull request for this

[jira] [Assigned] (SPARK-34335) Support referencing subquery with column aliases by table alias

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34335: Assignee: Apache Spark > Support referencing subquery with column aliases by table alias

[jira] [Assigned] (SPARK-34335) Support referencing subquery with column aliases by table alias

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34335: Assignee: (was: Apache Spark) > Support referencing subquery with column aliases by

[jira] [Commented] (SPARK-34335) Support referencing subquery with column aliases by table alias

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277454#comment-17277454 ] Apache Spark commented on SPARK-34335: -- User 'allisonwang-db' has created a pull request for this

[jira] [Commented] (SPARK-34335) Support referencing subquery with column aliases by table alias

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277453#comment-17277453 ] Apache Spark commented on SPARK-34335: -- User 'allisonwang-db' has created a pull request for this

[jira] [Created] (SPARK-34335) Support referencing subquery with column aliases by table alias

2021-02-02 Thread Allison Wang (Jira)
Allison Wang created SPARK-34335: Summary: Support referencing subquery with column aliases by table alias Key: SPARK-34335 URL: https://issues.apache.org/jira/browse/SPARK-34335 Project: Spark

[jira] [Created] (SPARK-34334) ExecutorPodsAllocator fails to identify some excess requests during downscaling

2021-02-02 Thread Attila Zsolt Piros (Jira)
Attila Zsolt Piros created SPARK-34334: -- Summary: ExecutorPodsAllocator fails to identify some excess requests during downscaling Key: SPARK-34334 URL: https://issues.apache.org/jira/browse/SPARK-34334

[jira] [Commented] (SPARK-34334) ExecutorPodsAllocator fails to identify some excess requests during downscaling

2021-02-02 Thread Attila Zsolt Piros (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277436#comment-17277436 ] Attila Zsolt Piros commented on SPARK-34334: I am working on the this. >

[jira] [Resolved] (SPARK-34324) FileTable should not list TRUNCATE in capabilities by default

2021-02-02 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-34324. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31432

[jira] [Commented] (SPARK-26325) Interpret timestamp fields in Spark while reading json (timestampFormat)

2021-02-02 Thread Daniel Himmelstein (Jira)
[ https://issues.apache.org/jira/browse/SPARK-26325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277405#comment-17277405 ] Daniel Himmelstein commented on SPARK-26325: h1. Solution in pyspark 3.0.1 Turns out there

[jira] [Commented] (SPARK-24497) ANSI SQL: Recursive query

2021-02-02 Thread Peter Toth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277367#comment-17277367 ] Peter Toth commented on SPARK-24497: Thanks [~ilaurens] for your comment. Recursive queries are very

[jira] [Commented] (SPARK-34212) For parquet table, after changing the precision and scale of decimal type in hive, spark reads incorrect value

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277349#comment-17277349 ] Apache Spark commented on SPARK-34212: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-34333) Fix PostgresDialect to handle money types properly

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277342#comment-17277342 ] Apache Spark commented on SPARK-34333: -- User 'sarutak' has created a pull request for this issue:

[jira] [Assigned] (SPARK-34333) Fix PostgresDialect to handle money types properly

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34333: Assignee: Apache Spark (was: Kousuke Saruta) > Fix PostgresDialect to handle money

[jira] [Assigned] (SPARK-34333) Fix PostgresDialect to handle money types properly

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34333: Assignee: Kousuke Saruta (was: Apache Spark) > Fix PostgresDialect to handle money

[jira] [Commented] (SPARK-34333) Fix PostgresDialect to handle money types properly

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277340#comment-17277340 ] Apache Spark commented on SPARK-34333: -- User 'sarutak' has created a pull request for this issue:

[jira] [Created] (SPARK-34333) Fix PostgresDialect to handle money types properly

2021-02-02 Thread Kousuke Saruta (Jira)
Kousuke Saruta created SPARK-34333: -- Summary: Fix PostgresDialect to handle money types properly Key: SPARK-34333 URL: https://issues.apache.org/jira/browse/SPARK-34333 Project: Spark Issue

[jira] [Commented] (SPARK-33591) NULL is recognized as the "null" string in partition specs

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277238#comment-17277238 ] Apache Spark commented on SPARK-33591: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-33591) NULL is recognized as the "null" string in partition specs

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277237#comment-17277237 ] Apache Spark commented on SPARK-33591: -- User 'gengliangwang' has created a pull request for this

[jira] [Resolved] (SPARK-34263) Simplify the code for treating unicode/octal/escaped characters in string literals

2021-02-02 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta resolved SPARK-34263. Resolution: Fixed > Simplify the code for treating unicode/octal/escaped characters in

[jira] [Commented] (SPARK-34263) Simplify the code for treating unicode/octal/escaped characters in string literals

2021-02-02 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277233#comment-17277233 ] Kousuke Saruta commented on SPARK-34263: This issue is resolved by

[jira] [Updated] (SPARK-34263) Simplify the code for treating unicode/octal/escaped characters in string literals

2021-02-02 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-34263: --- Fix Version/s: 3.2.0 > Simplify the code for treating unicode/octal/escaped characters in

[jira] [Assigned] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34331: Assignee: Wenchen Fan (was: Apache Spark) > speed up DS v2 metadata col resolution >

[jira] [Commented] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277219#comment-17277219 ] Apache Spark commented on SPARK-34331: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277218#comment-17277218 ] Apache Spark commented on SPARK-34331: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34331: Assignee: Apache Spark (was: Wenchen Fan) > speed up DS v2 metadata col resolution >

[jira] [Commented] (SPARK-33591) NULL is recognized as the "null" string in partition specs

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277203#comment-17277203 ] Apache Spark commented on SPARK-33591: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-33591) NULL is recognized as the "null" string in partition specs

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277202#comment-17277202 ] Apache Spark commented on SPARK-33591: -- User 'gengliangwang' has created a pull request for this

[jira] [Updated] (SPARK-34332) Unify v1 and v2 ALTER TABLE .. SET LOCATION tests

2021-02-02 Thread Maxim Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Gekk updated SPARK-34332: --- Description: Extract ALTER TABLE .. SET LOCATION tests to the common place to run them for V1 and

[jira] [Created] (SPARK-34332) Unify v1 and v2 ALTER TABLE .. SET LOCATION tests

2021-02-02 Thread Maxim Gekk (Jira)
Maxim Gekk created SPARK-34332: -- Summary: Unify v1 and v2 ALTER TABLE .. SET LOCATION tests Key: SPARK-34332 URL: https://issues.apache.org/jira/browse/SPARK-34332 Project: Spark Issue Type:

[jira] [Commented] (SPARK-34115) Long runtime on many environment variables

2021-02-02 Thread Jean-Yves STEPHAN (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277182#comment-17277182 ] Jean-Yves STEPHAN commented on SPARK-34115: --- Hello - thanks for this fix [~nob13] , we

[jira] [Created] (SPARK-34331) speed up DS v2 metadata col resolution

2021-02-02 Thread Wenchen Fan (Jira)
Wenchen Fan created SPARK-34331: --- Summary: speed up DS v2 metadata col resolution Key: SPARK-34331 URL: https://issues.apache.org/jira/browse/SPARK-34331 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-34282) Unify v1 and v2 TRUNCATE TABLE tests

2021-02-02 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-34282. - Resolution: Fixed Issue resolved by pull request 31387

[jira] [Commented] (SPARK-34330) Literal constructor support UTFString

2021-02-02 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277112#comment-17277112 ] Apache Spark commented on SPARK-34330: -- User 'AngersZh' has created a pull request for this

  1   2   >