[jira] [Assigned] (SPARK-42063) Register `byte[][]` to KyroSerializer

2023-01-14 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-42063: --- Assignee: Dongjoon Hyun > Register `byte[][]` to KyroSerializer >

[jira] [Created] (SPARK-42064) Implement bloom filter join hint

2023-01-14 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-42064: --- Summary: Implement bloom filter join hint Key: SPARK-42064 URL: https://issues.apache.org/jira/browse/SPARK-42064 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-39217) Makes DPP support the pruning side has Union

2023-01-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-39217: --- Assignee: Wan Kun > Makes DPP support the pruning side has Union >

[jira] [Resolved] (SPARK-39217) Makes DPP support the pruning side has Union

2023-01-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-39217. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39460

[jira] [Created] (SPARK-41986) Introduce shuffle on SinglePartition

2023-01-11 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-41986: --- Summary: Introduce shuffle on SinglePartition Key: SPARK-41986 URL: https://issues.apache.org/jira/browse/SPARK-41986 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-41741) [SQL] ParquetFilters StringStartsWith push down matching string do not use UTF-8

2023-01-08 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655893#comment-17655893 ] Yuming Wang commented on SPARK-41741: - What is your env? you can put the env in your terminal. >

[jira] [Commented] (SPARK-41741) [SQL] ParquetFilters StringStartsWith push down matching string do not use UTF-8

2023-01-08 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655882#comment-17655882 ] Yuming Wang commented on SPARK-41741: - What is your {{file.encoding}}? > [SQL] ParquetFilters

[jira] [Updated] (SPARK-41498) Union does not propagate Metadata output

2022-12-12 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-41498: Fix Version/s: (was: 3.4.0) > Union does not propagate Metadata output >

[jira] [Commented] (SPARK-41459) Spark Thrift Server operation log output is empty

2022-12-08 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17645111#comment-17645111 ] Yuming Wang commented on SPARK-41459: - cc [~LuciferYang] > Spark Thrift Server operation log output

[jira] [Resolved] (SPARK-34987) AQE improve: change shuffle hash join to sort merge join when skewed shuffle hash join exists

2022-12-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-34987. - Resolution: Not A Problem > AQE improve: change shuffle hash join to sort merge join when

[jira] [Resolved] (SPARK-41167) Optimize LikeSimplification rule to improve multi like performance

2022-12-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-41167. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38682

[jira] [Assigned] (SPARK-41167) Optimize LikeSimplification rule to improve multi like performance

2022-12-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-41167: --- Assignee: Wan Kun > Optimize LikeSimplification rule to improve multi like performance >

[jira] [Commented] (SPARK-41336) BroadcastExchange does not support the execute() code path. when AQE enabled

2022-11-30 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641191#comment-17641191 ] Yuming Wang commented on SPARK-41336: - Could you use Spark 3.3.1? > BroadcastExchange does not

[jira] [Commented] (SPARK-41324) Follow-up on JDK-8180450

2022-11-30 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641155#comment-17641155 ] Yuming Wang commented on SPARK-41324: - cc [~LuciferYang] > Follow-up on JDK-8180450 >

[jira] [Commented] (SPARK-41299) OOM when filter pushdown `last_day` function

2022-11-28 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17640347#comment-17640347 ] Yuming Wang commented on SPARK-41299: - Do you have the query plan? > OOM when filter pushdown

[jira] [Commented] (SPARK-41219) Regression in IntegralDivide returning null instead of 0

2022-11-22 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637102#comment-17637102 ] Yuming Wang commented on SPARK-41219: - cc [~ulysses] > Regression in IntegralDivide returning null

[jira] [Commented] (SPARK-41207) Regression in IntegralDivide

2022-11-20 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17636383#comment-17636383 ] Yuming Wang commented on SPARK-41207: - cc [~ulysses] > Regression in IntegralDivide >

[jira] [Updated] (SPARK-41207) Regression in IntegralDivide

2022-11-20 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-41207: Target Version/s: (was: 3.4.0) > Regression in IntegralDivide > >

[jira] [Updated] (SPARK-41207) Regression in IntegralDivide

2022-11-20 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-41207: Fix Version/s: (was: 3.4.0) > Regression in IntegralDivide > > >

[jira] [Resolved] (SPARK-41017) Support column pruning with multiple nondeterministic Filters

2022-11-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-41017. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38511

[jira] [Updated] (SPARK-41141) avoid introducing a new aggregate expression in the analysis phase when subquery is referencing it

2022-11-14 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-41141: Target Version/s: (was: 3.3.1) > avoid introducing a new aggregate expression in the analysis

[jira] [Created] (SPARK-41088) Add PartialAggregate and FinalAggregate logic operators

2022-11-09 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-41088: --- Summary: Add PartialAggregate and FinalAggregate logic operators Key: SPARK-41088 URL: https://issues.apache.org/jira/browse/SPARK-41088 Project: Spark Issue

[jira] [Resolved] (SPARK-41071) Metaspace OOM when Local run dev/make-distribution.sh

2022-11-09 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-41071. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38577

[jira] [Assigned] (SPARK-41071) Metaspace OOM when Local run dev/make-distribution.sh

2022-11-09 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-41071: --- Assignee: Yang Jie > Metaspace OOM when Local run dev/make-distribution.sh >

[jira] [Commented] (SPARK-41013) spark-3.1.2以cluster模式提交作业报 Could not initialize class com.github.luben.zstd.ZstdOutputStream

2022-11-04 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629240#comment-17629240 ] Yuming Wang commented on SPARK-41013: - Could you test the Spark 3.3.1? > spark-3.1.2以cluster模式提交作业报

[jira] [Updated] (SPARK-40999) Hints on subqueries are not properly propagated

2022-11-03 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40999: Fix Version/s: (was: 3.4.0) > Hints on subqueries are not properly propagated >

[jira] [Resolved] (SPARK-40248) Use larger number of bits to build bloom filter

2022-11-02 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40248. - Fix Version/s: 3.4.0 Assignee: Yuming Wang Resolution: Fixed This is resolved

[jira] [Resolved] (SPARK-40983) Remove Hadoop requirements for zstd mention in Parquet compression codec

2022-11-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40983. - Fix Version/s: 3.3.2 3.2.3 3.4.0 Resolution: Fixed

[jira] [Assigned] (SPARK-40983) Remove Hadoop requirements for zstd mention in Parquet compression codec

2022-11-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-40983: --- Assignee: Cheng Pan > Remove Hadoop requirements for zstd mention in Parquet compression

[jira] [Commented] (SPARK-40972) OptimizeLocalShuffleReader causing data skew

2022-10-31 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17626490#comment-17626490 ] Yuming Wang commented on SPARK-40972: - cc [~michaelzhang-db] > OptimizeLocalShuffleReader causing

[jira] [Resolved] (SPARK-35904) Collapse above RebalancePartitions

2022-10-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-35904. - Resolution: Not A Problem > Collapse above RebalancePartitions >

[jira] [Resolved] (PARQUET-1355) Improvement Binary write performance

2022-10-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved PARQUET-1355. -- Resolution: Won't Fix > Improvement Binary write performance >

[jira] [Resolved] (HIVE-14112) Join a HBase mapped big table shouldn't convert to MapJoin

2022-10-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/HIVE-14112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved HIVE-14112. Resolution: Won't Fix > Join a HBase mapped big table shouldn't convert to MapJoin >

[jira] [Resolved] (SPARK-40929) Add official image dockerfile for Spark v3.3.1

2022-10-26 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40929. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 20

[jira] [Assigned] (SPARK-40929) Add official image dockerfile for Spark v3.3.1

2022-10-26 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-40929: --- Assignee: Yikun Jiang > Add official image dockerfile for Spark v3.3.1 >

[jira] [Assigned] (SPARK-36057) SPIP: Support Customized Kubernetes Schedulers

2022-10-25 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-36057: --- Assignee: Yikun Jiang > SPIP: Support Customized Kubernetes Schedulers >

[jira] [Updated] (SPARK-40874) Fix broadcasts in Python UDFs when encryption is enabled

2022-10-25 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40874: Fix Version/s: 3.3.2 (was: 3.3.1) > Fix broadcasts in Python UDFs when

[jira] [Commented] (SPARK-36057) SPIP: Support Customized Kubernetes Schedulers

2022-10-24 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623505#comment-17623505 ] Yuming Wang commented on SPARK-36057: - Thank you [~dongjoon]. > SPIP: Support Customized Kubernetes

[jira] [Comment Edited] (SPARK-34966) Avoid shuffle if join type do not match

2022-10-24 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17315591#comment-17315591 ] Yuming Wang edited comment on SPARK-34966 at 10/24/22 11:37 AM:

[jira] [Updated] (SPARK-40885) Spark will filter out data field sorting when dynamic partitions and data fields are sorted at the same time

2022-10-22 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40885: Fix Version/s: (was: 3.4.0) > Spark will filter out data field sorting when dynamic

[jira] [Commented] (SPARK-40303) The performance will be worse after codegen

2022-10-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17619978#comment-17619978 ] Yuming Wang commented on SPARK-40303: - How to run benchmark code: # Download latest spark:

[jira] [Resolved] (SPARK-40736) Spark 3.3.0 doesn't works with Hive 3.1.2

2022-10-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40736. - Resolution: Invalid > Spark 3.3.0 doesn't works with Hive 3.1.2 >

[jira] [Commented] (SPARK-40736) Spark 3.3.0 doesn't works with Hive 3.1.2

2022-10-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17619501#comment-17619501 ] Yuming Wang commented on SPARK-40736: - Please do not copy Hive related jars to ${SPARK_HOME}/jars.

[jira] [Commented] (SPARK-40736) Spark 3.3.0 doesn't works with Hive 3.1.2

2022-10-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17619433#comment-17619433 ] Yuming Wang commented on SPARK-40736: - Do you copy hive related jars to ${SPARK_HOME}/jars and then

[jira] [Comment Edited] (SPARK-34966) Avoid shuffle if join type do not match

2022-10-18 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17315591#comment-17315591 ] Yuming Wang edited comment on SPARK-34966 at 10/18/22 7:37 AM: ---

[jira] [Commented] (SPARK-40563) Error at where clause, when sql case executes by else branch

2022-10-17 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17619189#comment-17619189 ] Yuming Wang commented on SPARK-40563: - Thank you [~Zing] > Error at where clause, when sql case

[jira] [Assigned] (SPARK-39951) Support columnar batches with nested fields in Parquet V2

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-39951: --- Assignee: Adam Binford (was: Apache Spark) > Support columnar batches with nested fields

[jira] [Commented] (SPARK-40563) Error at where clause, when sql case executes by else branch

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17618234#comment-17618234 ] Yuming Wang commented on SPARK-40563: - [~Zing] Does branch-3.3 also fixed this issue? > Error at

[jira] [Updated] (SPARK-39200) Stream is corrupted Exception while fetching the blocks from fallback storage system

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-39200: Fix Version/s: 3.3.1 (was: 3.3.2) > Stream is corrupted Exception while

[jira] [Updated] (SPARK-40535) NPE from observe of collect_list

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40535: Fix Version/s: 3.3.1 (was: 3.3.2) > NPE from observe of collect_list >

[jira] [Updated] (SPARK-40547) Fix dead links in sparkr-vignettes.Rmd

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40547: Fix Version/s: 3.3.1 (was: 3.3.2) > Fix dead links in sparkr-vignettes.Rmd

[jira] [Updated] (SPARK-40322) Fix all dead links

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40322: Fix Version/s: 3.4.0 3.3.1 (was: 3.3.2) > Fix all dead

[jira] [Updated] (SPARK-40562) Add spark.sql.legacy.groupingIdWithAppendedUserGroupBy

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40562: Fix Version/s: 3.3.1 (was: 3.3.2) > Add

[jira] [Updated] (SPARK-38717) Handle Hive's bucket spec case preserving behaviour

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-38717: Fix Version/s: 3.3.1 (was: 3.3.2) > Handle Hive's bucket spec case

[jira] [Updated] (SPARK-40583) Documentation error in "Integration with Cloud Infrastructures"

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40583: Fix Version/s: 3.3.1 (was: 3.3.2) > Documentation error in "Integration

[jira] [Updated] (SPARK-40636) Fix wrong remained shuffles log in BlockManagerDecommissioner

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40636: Fix Version/s: 3.3.1 (was: 3.3.2) > Fix wrong remained shuffles log in

[jira] [Updated] (SPARK-40648) Add `@ExtendedLevelDBTest` to the leveldb relevant tests in the yarn module

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40648: Fix Version/s: 3.3.1 (was: 3.3.2) > Add `@ExtendedLevelDBTest` to the

[jira] [Updated] (SPARK-40574) Add PURGE to DROP TABLE doc

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40574: Fix Version/s: 3.3.1 (was: 3.3.2) > Add PURGE to DROP TABLE doc >

[jira] [Updated] (SPARK-40612) On Kubernetes for long running app Spark using an invalid principal to renew the delegation token

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40612: Fix Version/s: 3.3.1 (was: 3.3.2) > On Kubernetes for long running app

[jira] [Updated] (SPARK-39725) Upgrade jetty-http from 9.4.46.v20220331 to 9.4.48.v20220622

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-39725: Fix Version/s: 3.3.1 (was: 3.3.2) > Upgrade jetty-http from

[jira] [Updated] (SPARK-40682) Set spark.driver.maxResultSize to 3g in SqlBasedBenchmark

2022-10-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40682: Fix Version/s: 3.3.1 (was: 3.3.2) > Set spark.driver.maxResultSize to 3g

[jira] [Updated] (SPARK-40801) Upgrade Apache Commons Text to 1.10

2022-10-15 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40801: Fix Version/s: 3.3.2 (was: 3.3.1) > Upgrade Apache Commons Text to 1.10 >

[jira] [Commented] (SPARK-40736) Spark 3.3.0 doesn't works with Hive 3.1.2

2022-10-15 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17618047#comment-17618047 ] Yuming Wang commented on SPARK-40736: - Could you just upgrade Hive metastore to 3.1.2? > Spark

[jira] [Commented] (SPARK-40741) spark项目bin/beeline对于distribute by sort by语句支持不好,输出结果错误

2022-10-15 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17618044#comment-17618044 ] Yuming Wang commented on SPARK-40741: - [~lkqqingcao] How to reproduce this issue? >

[jira] [Resolved] (SPARK-40801) Upgrade Apache Commons Text to 1.10

2022-10-15 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40801. - Fix Version/s: 3.3.1 3.4.0 Resolution: Fixed Issue resolved by pull

[jira] [Assigned] (SPARK-40801) Upgrade Apache Commons Text to 1.10

2022-10-15 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-40801: --- Assignee: Bjørn Jørgensen > Upgrade Apache Commons Text to 1.10 >

[jira] [Updated] (SPARK-8731) Beeline doesn't work with -e option when started in background

2022-10-14 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-8731: --- Fix Version/s: 3.3.1 (was: 3.3.2) > Beeline doesn't work with -e option when

[jira] [Assigned] (SPARK-40703) Performance regression for joins in Spark 3.3 vs Spark 3.2

2022-10-14 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-40703: --- Assignee: Chao Sun > Performance regression for joins in Spark 3.3 vs Spark 3.2 >

[jira] [Resolved] (SPARK-40703) Performance regression for joins in Spark 3.3 vs Spark 3.2

2022-10-14 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40703. - Fix Version/s: 3.3.1 3.4.0 Resolution: Fixed Issue resolved by pull

[jira] [Commented] (SPARK-40764) Extract partitioning through all children output expressions

2022-10-12 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17616303#comment-17616303 ] Yuming Wang commented on SPARK-40764: - Another case if pull out join condition: {code:scala}

[jira] [Commented] (SPARK-40764) Extract partitioning through all children output expressions

2022-10-12 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17616301#comment-17616301 ] Yuming Wang commented on SPARK-40764: - Another case: {code:scala}

[jira] [Updated] (SPARK-40767) Compile Spark example module will hang

2022-10-12 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40767: Description: !screenshot-1.png! {noformat} yumwang@LM-SHC-16508156 spark-release % jcmd | grep

[jira] [Updated] (SPARK-40767) Compile Spark example module will hang

2022-10-12 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40767: Summary: Compile Spark example module will hang (was: Spark example module will hang if built on

[jira] [Updated] (SPARK-40767) Compile Spark example module will hang

2022-10-12 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40767: Attachment: screenshot-1.png > Compile Spark example module will hang >

[jira] [Created] (SPARK-40767) Spark example module will hang if built on JDK 8

2022-10-12 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40767: --- Summary: Spark example module will hang if built on JDK 8 Key: SPARK-40767 URL: https://issues.apache.org/jira/browse/SPARK-40767 Project: Spark Issue Type:

[jira] [Created] (SPARK-40764) Extract partitioning through all children output expressions

2022-10-11 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40764: --- Summary: Extract partitioning through all children output expressions Key: SPARK-40764 URL: https://issues.apache.org/jira/browse/SPARK-40764 Project: Spark

[jira] [Updated] (SPARK-40703) Performance regression for joins in Spark 3.3 vs Spark 3.2

2022-10-07 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40703: Target Version/s: 3.3.1 > Performance regression for joins in Spark 3.3 vs Spark 3.2 >

[jira] [Created] (SPARK-40708) Auto update table statistics based on write metrics

2022-10-07 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40708: --- Summary: Auto update table statistics based on write metrics Key: SPARK-40708 URL: https://issues.apache.org/jira/browse/SPARK-40708 Project: Spark Issue

[jira] [Updated] (SPARK-40660) Switch to XORShiftRandom to distribute elements

2022-10-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40660: Issue Type: Bug (was: Improvement) > Switch to XORShiftRandom to distribute elements >

[jira] [Updated] (SPARK-40660) Switch to XORShiftRandom to distribute elements

2022-10-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40660: Fix Version/s: 3.3.1 3.2.3 > Switch to XORShiftRandom to distribute elements >

[jira] [Commented] (SPARK-40664) Union in query can remove cache from the plan

2022-10-05 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17612970#comment-17612970 ] Yuming Wang commented on SPARK-40664: - This is a know issue, please see comment:

[jira] [Updated] (SPARK-40660) Switch to XORShiftRandom to distribute elements

2022-10-04 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40660: Summary: Switch to XORShiftRandom to distribute elements (was: Switch XORShiftRandom to

[jira] [Created] (SPARK-40660) Switch XORShiftRandom to distribute elements

2022-10-04 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40660: --- Summary: Switch XORShiftRandom to distribute elements Key: SPARK-40660 URL: https://issues.apache.org/jira/browse/SPARK-40660 Project: Spark Issue Type:

[jira] [Created] (SPARK-40632) Do not inject runtime filter if join condition reference non simple expression

2022-10-02 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40632: --- Summary: Do not inject runtime filter if join condition reference non simple expression Key: SPARK-40632 URL: https://issues.apache.org/jira/browse/SPARK-40632

[jira] [Updated] (SPARK-40626) Reorder join keys impact performance

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Attachment: Pull out complex join condition and infer more filters.png > Reorder join keys impact

[jira] [Created] (SPARK-40628) Do not push complex left semi/anti join condition through project

2022-10-01 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40628: --- Summary: Do not push complex left semi/anti join condition through project Key: SPARK-40628 URL: https://issues.apache.org/jira/browse/SPARK-40628 Project: Spark

[jira] [Updated] (SPARK-40626) Reorder join keys impact performance

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Description: {code:scala} sql("CREATE TABLE t1 (itemid BIGINT, eventType STRING, dt STRING) USING

[jira] [Updated] (SPARK-40626) Reorder join keys impact performance

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Summary: Reorder join keys impact performance (was: Do not reorder join keys in

[jira] [Updated] (SPARK-40626) Do not reorder join keys in EnsureRequirements if they are not simple expressions

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Attachment: Pull out complex join condition.png > Do not reorder join keys in EnsureRequirements

[jira] [Updated] (SPARK-40626) Do not reorder join keys in EnsureRequirements if they are not simple expressions

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Attachment: (was: Pull out complex join condition.png) > Do not reorder join keys in

[jira] [Updated] (SPARK-40626) Do not reorder join keys in EnsureRequirements if they are not simple expressions

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Attachment: Pull out complex join condition.png > Do not reorder join keys in EnsureRequirements

[jira] [Updated] (SPARK-40626) Do not reorder join keys in EnsureRequirements if they are not simple expressions

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Attachment: change the aggregate key order.png > Do not reorder join keys in EnsureRequirements

[jira] [Updated] (SPARK-40626) Do not reorder join keys in EnsureRequirements if they are not simple expressions

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Attachment: default(join order will change).png > Do not reorder join keys in EnsureRequirements

[jira] [Updated] (SPARK-40626) Do not reorder join keys in EnsureRequirements if they are not simple expressions

2022-10-01 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40626: Description: {code:scala} sql("CREATE TABLE t1 (itemid BIGINT, eventType STRING, dt STRING)

[jira] [Created] (SPARK-40626) Do not reorder join keys in EnsureRequirements if they are not simple expressions

2022-09-30 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-40626: --- Summary: Do not reorder join keys in EnsureRequirements if they are not simple expressions Key: SPARK-40626 URL: https://issues.apache.org/jira/browse/SPARK-40626

[jira] [Commented] (SPARK-40616) Loss of precision using SparkSQL shell on high-precision DECIMAL types

2022-09-29 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17611328#comment-17611328 ] Yuming Wang commented on SPARK-40616: - Please try to set

[jira] [Updated] (SPARK-40613) Update sbt-protoc to 1.0.6

2022-09-29 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40613: Parent: SPARK-39375 Issue Type: Sub-task (was: Improvement) > Update sbt-protoc to 1.0.6

[jira] [Assigned] (SPARK-40605) Connect module should use log4j2.properties to configure test log output as other modules

2022-09-29 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-40605: --- Assignee: Yang Jie > Connect module should use log4j2.properties to configure test log

[jira] [Resolved] (SPARK-40605) Connect module should use log4j2.properties to configure test log output as other modules

2022-09-29 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40605. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38041

[jira] [Updated] (SPARK-40613) Update sbt-protoc to 1.0.6

2022-09-29 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-40613: Attachment: screenshot-1.png > Update sbt-protoc to 1.0.6 > -- > >

<    1   2   3   4   5   6   7   8   9   10   >