[jira] [Created] (SPARK-3186) Enable parallelism for Reduce Side Join [Spark Branch]

2014-08-22 Thread Szehon Ho (JIRA)
Szehon Ho created SPARK-3186: Summary: Enable parallelism for Reduce Side Join [Spark Branch] Key: SPARK-3186 URL: https://issues.apache.org/jira/browse/SPARK-3186 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3186) Enable parallelism for Reduce Side Join [Spark Branch]

2014-08-22 Thread Szehon Ho (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-3186: - Description: (was: Blocked by SPARK-2978. See parent JIRA for design details.) Enable parallelism

[jira] [Resolved] (SPARK-3186) Enable parallelism for Reduce Side Join [Spark Branch]

2014-08-22 Thread Szehon Ho (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho resolved SPARK-3186. -- Resolution: Invalid Sorry please ignore, meant to file this in Hive project. Enable parallelism for

[jira] [Closed] (SPARK-3186) Enable parallelism for Reduce Side Join [Spark Branch]

2014-08-22 Thread Szehon Ho (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho closed SPARK-3186. Enable parallelism for Reduce Side Join [Spark Branch]

[jira] [Created] (SPARK-44060) Code-gen for build side outer shuffled hash join

2023-06-14 Thread Szehon Ho (Jira)
Szehon Ho created SPARK-44060: - Summary: Code-gen for build side outer shuffled hash join Key: SPARK-44060 URL: https://issues.apache.org/jira/browse/SPARK-44060 Project: Spark Issue Type:

[jira] [Updated] (SPARK-44060) Code-gen for build side outer shuffled hash join

2023-06-14 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-44060: -- Description: Here, build side outer join means LEFT OUTER join with build left, or RIGHT OUTER join

[jira] [Updated] (SPARK-44060) Code-gen for build side outer shuffled hash join

2023-06-14 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-44060: -- Description: Here, build side outer join means LEFT OUTER join with build left, or RIGHT OUTER join

[jira] [Created] (SPARK-44641) Results duplicated when SPJ partial-cluster and pushdown enabled but conditions unmet

2023-08-02 Thread Szehon Ho (Jira)
Szehon Ho created SPARK-44641: - Summary: Results duplicated when SPJ partial-cluster and pushdown enabled but conditions unmet Key: SPARK-44641 URL: https://issues.apache.org/jira/browse/SPARK-44641

[jira] [Updated] (SPARK-44641) Results duplicated when SPJ partial-cluster and pushdown enabled but conditions unmet

2023-08-02 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-44641: -- Description: Adding the following test case in KeyGroupedPartitionSuite demonstrates the problem.  

[jira] [Updated] (SPARK-44641) Results duplicated when SPJ partial-cluster and pushdown enabled but conditions unmet

2023-08-02 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-44641: -- Description: Adding the following test case in KeyGroupedPartitionSuite demonstrates the problem.  

[jira] [Created] (SPARK-44647) Support SPJ when join key is subset of partition keys

2023-08-02 Thread Szehon Ho (Jira)
Szehon Ho created SPARK-44647: - Summary: Support SPJ when join key is subset of partition keys Key: SPARK-44647 URL: https://issues.apache.org/jira/browse/SPARK-44647 Project: Spark Issue Type:

[jira] [Updated] (SPARK-44647) Support SPJ when join key is subset of partition keys

2023-08-05 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-44647: -- Parent: SPARK-37375 Issue Type: Sub-task (was: New Feature) > Support SPJ when join key is

[jira] [Updated] (SPARK-48012) SPJ: Support Transfrom Expressions for One Side Shuffle

2024-04-26 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-48012: -- Parent: SPARK-37375 Issue Type: Sub-task (was: New Feature) > SPJ: Support Transfrom

[jira] [Created] (SPARK-48012) SPJ: Support Transfrom Expressions for One Side Shuffle

2024-04-26 Thread Szehon Ho (Jira)
Szehon Ho created SPARK-48012: - Summary: SPJ: Support Transfrom Expressions for One Side Shuffle Key: SPARK-48012 URL: https://issues.apache.org/jira/browse/SPARK-48012 Project: Spark Issue

[jira] [Created] (SPARK-48065) SPJ: allowJoinKeysSubsetOfPartitionKeys is too strict

2024-04-30 Thread Szehon Ho (Jira)
Szehon Ho created SPARK-48065: - Summary: SPJ: allowJoinKeysSubsetOfPartitionKeys is too strict Key: SPARK-48065 URL: https://issues.apache.org/jira/browse/SPARK-48065 Project: Spark Issue Type:

[jira] [Updated] (SPARK-47094) SPJ : Dynamically rebalance number of buckets when they are not equal

2024-02-26 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szehon Ho updated SPARK-47094: -- Parent: SPARK-37375 Issue Type: Sub-task (was: New Feature) > SPJ : Dynamically rebalance

[jira] [Created] (SPARK-48421) SPJ: Add documentation

2024-05-25 Thread Szehon Ho (Jira)
Szehon Ho created SPARK-48421: - Summary: SPJ: Add documentation Key: SPARK-48421 URL: https://issues.apache.org/jira/browse/SPARK-48421 Project: Spark Issue Type: Documentation

[jira] [Commented] (SPARK-48329) Default spark.sql.sources.v2.bucketing.pushPartValues.enabled to true

2024-05-20 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847963#comment-17847963 ] Szehon Ho commented on SPARK-48329: --- I cherry picked your doc change to my pr to be co-author, is it

[jira] [Commented] (SPARK-48329) Default spark.sql.sources.v2.bucketing.pushPartValues.enabled to true

2024-05-20 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847962#comment-17847962 ] Szehon Ho commented on SPARK-48329: --- Oh sorry I just saw this, I was about to make this pr but was

[jira] [Comment Edited] (SPARK-48329) Default spark.sql.sources.v2.bucketing.pushPartValues.enabled to true

2024-05-20 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847962#comment-17847962 ] Szehon Ho edited comment on SPARK-48329 at 5/20/24 6:58 PM: Oh sorry I just

[jira] [Created] (SPARK-48329) Default spark.sql.sources.v2.bucketing.pushPartValues.enabled to true

2024-05-17 Thread Szehon Ho (Jira)
Szehon Ho created SPARK-48329: - Summary: Default spark.sql.sources.v2.bucketing.pushPartValues.enabled to true Key: SPARK-48329 URL: https://issues.apache.org/jira/browse/SPARK-48329 Project: Spark

[jira] [Comment Edited] (SPARK-48329) Default spark.sql.sources.v2.bucketing.pushPartValues.enabled to true

2024-05-20 Thread Szehon Ho (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847963#comment-17847963 ] Szehon Ho edited comment on SPARK-48329 at 5/20/24 10:37 PM: - I cherry