[ https://issues.apache.org/jira/browse/SPARK-21271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16070961#comment-16070961 ]
Kazuaki Ishizaki commented on SPARK-21271: ------------------------------------------ I see. For var-length part, its regulation (or specification) is a multiple of 4. Should we use {{hashUnsafeBytes}} instead of {{Murmur3_x86_32.hashUnsafeWords}} for {{UnsafeRow.hashCode()}}? > UnsafeRow.hashCode assertion when sizeInBytes not multiple of 8 > --------------------------------------------------------------- > > Key: SPARK-21271 > URL: https://issues.apache.org/jira/browse/SPARK-21271 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.2.0 > Reporter: Bogdan Raducanu > > The method is: > {code} > public int hashCode() { > return Murmur3_x86_32.hashUnsafeWords(baseObject, baseOffset, > sizeInBytes, 42); > } > {code} > but sizeInBytes is not always a multiple of 8 (in which case hashUnsafeWords > throws assertion) - for example here: > {code}FixedLengthRowBasedKeyValueBatch.appendRow{code} > The fix could be to use hashUnsafeBytes or to use hashUnsafeWords but on a > prefix that is multiple of 8. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org