[jira] [Created] (SPARK-41200) BytesToBytesMap's longArray size can be up to MAX_CAPACITY

2022-11-18 Thread EdisonWang (Jira)
EdisonWang created SPARK-41200: -- Summary: BytesToBytesMap's longArray size can be up to MAX_CAPACITY Key: SPARK-41200 URL: https://issues.apache.org/jira/browse/SPARK-41200 Project: Spark Issue

[jira] [Created] (SPARK-40035) Avoid apply filter twice when listing files

2022-08-10 Thread EdisonWang (Jira)
EdisonWang created SPARK-40035: -- Summary: Avoid apply filter twice when listing files Key: SPARK-40035 URL: https://issues.apache.org/jira/browse/SPARK-40035 Project: Spark Issue Type:

[jira] [Created] (SPARK-39476) Disable Unwrap cast optimize when casting from Long to Float/ Double or from Integer to Float

2022-06-14 Thread EdisonWang (Jira)
EdisonWang created SPARK-39476: -- Summary: Disable Unwrap cast optimize when casting from Long to Float/ Double or from Integer to Float Key: SPARK-39476 URL: https://issues.apache.org/jira/browse/SPARK-39476

[jira] [Updated] (SPARK-39249) Improve subexpression elimination for conditional expressions

2022-05-21 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-39249: --- Description: Currently we can do subexpression elimination for conditional expressions when the

[jira] [Created] (SPARK-39249) Improve subexpression elimination for conditional expressions

2022-05-21 Thread EdisonWang (Jira)
EdisonWang created SPARK-39249: -- Summary: Improve subexpression elimination for conditional expressions Key: SPARK-39249 URL: https://issues.apache.org/jira/browse/SPARK-39249 Project: Spark

[jira] [Created] (SPARK-39002) StringEndsWith/Contains support push down to Parquet so that we can leverage dictionary filter

2022-04-23 Thread EdisonWang (Jira)
EdisonWang created SPARK-39002: -- Summary: StringEndsWith/Contains support push down to Parquet so that we can leverage dictionary filter Key: SPARK-39002 URL: https://issues.apache.org/jira/browse/SPARK-39002

[jira] [Created] (SPARK-38160) Shuffle by rand could lead to incorrect answers when ShuffleFetchFailed happend

2022-02-09 Thread EdisonWang (Jira)
EdisonWang created SPARK-38160: -- Summary: Shuffle by rand could lead to incorrect answers when ShuffleFetchFailed happend Key: SPARK-38160 URL: https://issues.apache.org/jira/browse/SPARK-38160 Project:

[jira] [Updated] (SPARK-37593) Optimize HeapMemoryAllocator to avoid memory waste when using G1GC

2021-12-15 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-37593: --- Description: Spark's tungsten memory model usually tries to allocate memory by one `page` each

[jira] [Updated] (SPARK-37593) Optimize HeapMemoryAllocator to avoid memory waste when using G1GC

2021-12-15 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-37593: --- Summary: Optimize HeapMemoryAllocator to avoid memory waste when using G1GC (was: Optimize

[jira] [Created] (SPARK-37593) Optimize HeapMmeoryAllocator to avoid memory waste in humongous allocation when using G1GC

2021-12-09 Thread EdisonWang (Jira)
EdisonWang created SPARK-37593: -- Summary: Optimize HeapMmeoryAllocator to avoid memory waste in humongous allocation when using G1GC Key: SPARK-37593 URL: https://issues.apache.org/jira/browse/SPARK-37593

[jira] [Updated] (SPARK-37593) Optimize HeapMemoryAllocator to avoid memory waste in humongous allocation when using G1GC

2021-12-09 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-37593: --- Summary: Optimize HeapMemoryAllocator to avoid memory waste in humongous allocation when using G1GC

[jira] [Updated] (SPARK-34819) MapType supports orderable semantics

2021-03-22 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-34819: --- Issue Type: New Feature (was: Bug) > MapType supports orderable semantics >

[jira] [Created] (SPARK-34819) MapType supports orderable semantics

2021-03-22 Thread EdisonWang (Jira)
EdisonWang created SPARK-34819: -- Summary: MapType supports orderable semantics Key: SPARK-34819 URL: https://issues.apache.org/jira/browse/SPARK-34819 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-34634) Self-join with script transformation failed to resolve attribute correctly

2021-03-05 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-34634: --- Affects Version/s: 2.4.0 2.4.1 2.4.2

[jira] [Updated] (SPARK-34634) Self-join with script transformation failed to resolve attribute correctly

2021-03-04 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-34634: --- Description: To reproduce, ``` create temporary view t as select * from values 0, 1, 2 as t(a);

[jira] [Updated] (SPARK-34634) Self-join with script transformation failed to resolve attribute correctly

2021-03-04 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-34634: --- Description: To reproduce, {code:java} // code placeholder create temporary view t as select * from

[jira] [Updated] (SPARK-34634) Self-join with script transformation failed to resolve attribute correctly

2021-03-04 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-34634: --- Description: To reproduce, {code:java} // code placeholder create temporary view t as select * from

[jira] [Created] (SPARK-34634) Self-join with script transformation failed to resolve attribute correctly

2021-03-04 Thread EdisonWang (Jira)
EdisonWang created SPARK-34634: -- Summary: Self-join with script transformation failed to resolve attribute correctly Key: SPARK-34634 URL: https://issues.apache.org/jira/browse/SPARK-34634 Project:

[jira] [Created] (SPARK-34633) Self-join with script transformation failed to resolve attribute correctly

2021-03-04 Thread EdisonWang (Jira)
EdisonWang created SPARK-34633: -- Summary: Self-join with script transformation failed to resolve attribute correctly Key: SPARK-34633 URL: https://issues.apache.org/jira/browse/SPARK-34633 Project:

[jira] [Updated] (SPARK-33306) TimezoneID is needed when there cast from Date to String

2020-10-31 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-33306: --- Description: A simple way to reproduce this is  ``` spark-shell --conf 

[jira] [Updated] (SPARK-33306) TimezoneID is needed when there cast from Date to String

2020-10-31 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-33306: --- Description: A simple way to reproduce this is  ``` spark-shell --conf 

[jira] [Created] (SPARK-33306) TimezoneID is needed when there cast from Date to String

2020-10-30 Thread EdisonWang (Jira)
EdisonWang created SPARK-33306: -- Summary: TimezoneID is needed when there cast from Date to String Key: SPARK-33306 URL: https://issues.apache.org/jira/browse/SPARK-33306 Project: Spark Issue

[jira] [Updated] (SPARK-32559) Fix the trim logic in UTF8String.toInt/toLong did't handle Chinese characters correctly

2020-08-06 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-32559: --- Description: The trim logic in Cast expression introduced in 

[jira] [Created] (SPARK-32559) Fix the trim logic in UTF8String.toInt/toLong did't handle Chinese characters correctly

2020-08-06 Thread EdisonWang (Jira)
EdisonWang created SPARK-32559: -- Summary: Fix the trim logic in UTF8String.toInt/toLong did't handle Chinese characters correctly Key: SPARK-32559 URL: https://issues.apache.org/jira/browse/SPARK-32559

[jira] [Updated] (SPARK-31952) The metric of MemoryBytesSpill is incorrect when doing Aggregate

2020-06-10 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-31952: --- Description: When doing Aggregate and spill occurs, the Spill(memory) metric is zero while

[jira] [Updated] (SPARK-31952) The metric of MemoryBytesSpill is incorrect when doing Aggregate

2020-06-10 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-31952: --- Attachment: image-2020-06-10-16-35-58-002.png > The metric of MemoryBytesSpill is incorrect when

[jira] [Created] (SPARK-31952) Fix incorrect MemoryBytesSpill metric when doing Aggregate

2020-06-10 Thread EdisonWang (Jira)
EdisonWang created SPARK-31952: -- Summary: Fix incorrect MemoryBytesSpill metric when doing Aggregate Key: SPARK-31952 URL: https://issues.apache.org/jira/browse/SPARK-31952 Project: Spark Issue

[jira] [Updated] (SPARK-31952) The metric of MemoryBytesSpill is incorrect when doing Aggregate

2020-06-10 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-31952: --- Summary: The metric of MemoryBytesSpill is incorrect when doing Aggregate (was: Fix incorrect

[jira] [Created] (SPARK-30806) Evaluate once per group in UnboundedWindowFunctionFrame

2020-02-12 Thread EdisonWang (Jira)
EdisonWang created SPARK-30806: -- Summary: Evaluate once per group in UnboundedWindowFunctionFrame Key: SPARK-30806 URL: https://issues.apache.org/jira/browse/SPARK-30806 Project: Spark Issue

[jira] [Commented] (SPARK-28332) SQLMetric wrong initValue

2019-12-23 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17002148#comment-17002148 ] EdisonWang commented on SPARK-28332: I've taken it [~cloud_fan] > SQLMetric wrong initValue >

[jira] [Created] (SPARK-30088) Adaptive execution should convert SortMergeJoin to BroadcastJoin when plan generates empty result

2019-12-01 Thread EdisonWang (Jira)
EdisonWang created SPARK-30088: -- Summary: Adaptive execution should convert SortMergeJoin to BroadcastJoin when plan generates empty result Key: SPARK-30088 URL: https://issues.apache.org/jira/browse/SPARK-30088

[jira] [Updated] (SPARK-29918) RecordBinaryComparator should check endianness when compared by long

2019-11-15 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-29918: --- Labels: correctness (was: ) > RecordBinaryComparator should check endianness when compared by long

[jira] [Created] (SPARK-29918) RecordBinaryComparator should check endianness when compared by long

2019-11-15 Thread EdisonWang (Jira)
EdisonWang created SPARK-29918: -- Summary: RecordBinaryComparator should check endianness when compared by long Key: SPARK-29918 URL: https://issues.apache.org/jira/browse/SPARK-29918 Project: Spark

[jira] [Resolved] (SPARK-27789) Use stopEarly in codegen of ColumnarBatchScan

2019-10-09 Thread EdisonWang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang resolved SPARK-27789. Resolution: Not A Problem > Use stopEarly in codegen of ColumnarBatchScan >

[jira] [Created] (SPARK-29343) Eliminate sorts without limit in the subquery of Join/Aggregation

2019-10-03 Thread EdisonWang (Jira)
EdisonWang created SPARK-29343: -- Summary: Eliminate sorts without limit in the subquery of Join/Aggregation Key: SPARK-29343 URL: https://issues.apache.org/jira/browse/SPARK-29343 Project: Spark

[jira] [Commented] (SPARK-28123) String Functions: Add support btrim

2019-08-02 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16899349#comment-16899349 ] EdisonWang commented on SPARK-28123: seems it is the same with trim() on both sides? > String

[jira] [Created] (SPARK-28257) Use ConfigEntry for hardcoded configs in SQL module

2019-07-05 Thread EdisonWang (JIRA)
EdisonWang created SPARK-28257: -- Summary: Use ConfigEntry for hardcoded configs in SQL module Key: SPARK-28257 URL: https://issues.apache.org/jira/browse/SPARK-28257 Project: Spark Issue Type:

[jira] [Created] (SPARK-27789) Use stopEarly in codegen of ColumnarBatchScan

2019-05-21 Thread EdisonWang (JIRA)
EdisonWang created SPARK-27789: -- Summary: Use stopEarly in codegen of ColumnarBatchScan Key: SPARK-27789 URL: https://issues.apache.org/jira/browse/SPARK-27789 Project: Spark Issue Type:

[jira] [Closed] (SPARK-26500) Add conf to support ignore hdfs data locality

2019-03-22 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang closed SPARK-26500. -- > Add conf to support ignore hdfs data locality > - > >

[jira] [Updated] (SPARK-27232) Ignore file locality in InMemoryFileIndex if spark.locality.wait is set to

2019-03-21 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-27232: --- Summary: Ignore file locality in InMemoryFileIndex if spark.locality.wait is set to (was: Skip to

[jira] [Updated] (SPARK-27232) Ignore file locality in InMemoryFileIndex if spark.locality.wait is set to

2019-03-21 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-27232: --- Description: `InMemoryFileIndex` needs to request file block location information in order to do

[jira] [Created] (SPARK-27232) Skip to get file block location if locality is ignored

2019-03-21 Thread EdisonWang (JIRA)
EdisonWang created SPARK-27232: -- Summary: Skip to get file block location if locality is ignored Key: SPARK-27232 URL: https://issues.apache.org/jira/browse/SPARK-27232 Project: Spark Issue

[jira] [Created] (SPARK-27202) update comments to keep according with code

2019-03-19 Thread EdisonWang (JIRA)
EdisonWang created SPARK-27202: -- Summary: update comments to keep according with code Key: SPARK-27202 URL: https://issues.apache.org/jira/browse/SPARK-27202 Project: Spark Issue Type:

[jira] [Created] (SPARK-27079) Fix typo & Remove useless imports

2019-03-06 Thread EdisonWang (JIRA)
EdisonWang created SPARK-27079: -- Summary: Fix typo & Remove useless imports Key: SPARK-27079 URL: https://issues.apache.org/jira/browse/SPARK-27079 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-27033) Add rule to optimize binary comparisons to its push down format

2019-03-02 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-27033: --- Affects Version/s: (was: 3.0.0) 2.4.0 > Add rule to optimize binary

[jira] [Updated] (SPARK-27033) Add rule to optimize binary comparisons to its push down format

2019-03-02 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-27033: --- Description: Currently, filters like this "select * from table where a + 1 >= 3" cannot be pushed

[jira] [Updated] (SPARK-27033) Add rule to optimize binary comparisons to its push down format

2019-03-02 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-27033: --- Description: _emphasized text_ > Add rule to optimize binary comparisons to its push down format >

[jira] [Created] (SPARK-27033) Add rule to optimize binary comparisons to its push down format

2019-03-02 Thread EdisonWang (JIRA)
EdisonWang created SPARK-27033: -- Summary: Add rule to optimize binary comparisons to its push down format Key: SPARK-27033 URL: https://issues.apache.org/jira/browse/SPARK-27033 Project: Spark

[jira] [Updated] (SPARK-26544) escape string when serialize map/array make it a valid json (keep alignment with hive)

2019-01-04 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-26544: --- Summary: escape string when serialize map/array make it a valid json (keep alignment with hive)

[jira] [Updated] (SPARK-26544) escape string when serialize map/array to make it a valid json (keep alignment with hive)

2019-01-04 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-26544: --- Summary: escape string when serialize map/array to make it a valid json (keep alignment with hive)

[jira] [Created] (SPARK-26544) the string serialized from map/array type is not a valid json (while hive is)

2019-01-04 Thread EdisonWang (JIRA)
EdisonWang created SPARK-26544: -- Summary: the string serialized from map/array type is not a valid json (while hive is) Key: SPARK-26544 URL: https://issues.apache.org/jira/browse/SPARK-26544 Project:

[jira] [Resolved] (SPARK-26500) Add conf to support ignore hdfs data locality

2018-12-31 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang resolved SPARK-26500. Resolution: Not A Problem > Add conf to support ignore hdfs data locality >

[jira] [Created] (SPARK-26500) Add conf to support ignore hdfs data locality

2018-12-28 Thread EdisonWang (JIRA)
EdisonWang created SPARK-26500: -- Summary: Add conf to support ignore hdfs data locality Key: SPARK-26500 URL: https://issues.apache.org/jira/browse/SPARK-26500 Project: Spark Issue Type: