[GitHub] [spark] wangshisan commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-02 Thread GitBox
wangshisan commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-667772074 @cloud-fan @JkSelf Could you have a look? This is an automated message from the Apache Git Service. To respo

[GitHub] [spark] wangshisan commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668926319 > Yea I'm also wondering the approach here. The skew join handling needs to split the skew side, and repeat the other side. I don't think we can split the buckets of bucketed

[GitHub] [spark] wangshisan commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668932395 > this is with AQE? if so can we please add that to description and it might be nice to describe approach taken to handle it in description as well. Added. --