Github user datumbox commented on the issue: https://github.com/apache/spark/pull/17059 @srowen I believe that this needs to be fixed for 2 reasons: 1. Casting the ids to double just to convert it back to integer is not an elegant solution and it is rather confusing. 2. The double casting puts more strain on the garbage collector and I've personally measured it in an earlier version with and without the hack. Finally I do not believe the proposed fix slows down things as it does a similar number of comparisons as the original code. If you still believe this is not worth it feel free to close the PR.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org