shrprasa commented on PR #40258: URL: https://github.com/apache/spark/pull/40258#issuecomment-1482368131
> > It works because the resolved column has just one match > > But there are two id columns. Does Spark already do deduplication somewhere? Not sure about the deduplication before, but even if it was doing it at some stage, in the second use case it might not have converted the column name to lowercase by that time, that's why that would still treat the two id and ID columns as different. Only at end result of column match, we see that both column matches are same id#17. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org