[ https://issues.apache.org/jira/browse/SPARK-13304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15154570#comment-15154570 ]
Apache Spark commented on SPARK-13304: -------------------------------------- User 'davies' has created a pull request for this issue: https://github.com/apache/spark/pull/11188 > Broadcast join with two ints could be very slow > ----------------------------------------------- > > Key: SPARK-13304 > URL: https://issues.apache.org/jira/browse/SPARK-13304 > Project: Spark > Issue Type: Bug > Components: SQL > Reporter: Davies Liu > > If the two join columns have the same value, the hash code of them will be (a > ^ b), which is 0, then the HashMap will be very very slow. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org