[GitHub] [spark] cloud-fan edited a comment on issue #24442: [SPARK-27547][SQL] fix DataFrame self-join problems

2019-04-23 Thread GitBox
cloud-fan edited a comment on issue #24442: [SPARK-27547][SQL] fix DataFrame self-join problems URL: https://github.com/apache/spark/pull/24442#issuecomment-485815769 cc @hvanhovell @gatorsmile @viirya @mgaido91 @HyukjinKwon @dongjoon-hyun

[GitHub] [spark] cloud-fan edited a comment on issue #24442: [SPARK-27547][SQL] fix DataFrame self-join problems

2019-04-24 Thread GitBox
cloud-fan edited a comment on issue #24442: [SPARK-27547][SQL] fix DataFrame self-join problems URL: https://github.com/apache/spark/pull/24442#issuecomment-486149813 The basic idea is the same: assign a globally unique id to dataset, and carry the dataset id in the column reference(the `A