[ https://issues.apache.org/jira/browse/HADOOP-6546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12831021#action_12831021 ]
Clark Jefcoat commented on HADOOP-6546: --------------------------------------- The issue is with the call {{ bloomKey.set(buf.getData(), 1.0); }} which appears twice in the BloomMapFile source. The buf variable is a DataOutputBuffer. The documentation for DataOutputBuffer clearly states that getData() is only valid to getLength(). But bloomKey is an o.a.h.util.bloom.Key which expects the entire array that it is getting to be valid. > BloomMapFile can return false negatives > --------------------------------------- > > Key: HADOOP-6546 > URL: https://issues.apache.org/jira/browse/HADOOP-6546 > Project: Hadoop Common > Issue Type: Bug > Components: io > Affects Versions: 0.20.1 > Reporter: Clark Jefcoat > > BloomMapFile can return false negatives when using keys of varying sizes. If > the amount of data written by the write() method of your key class differs > between instance of your key, your BloomMapFile may return false negatives. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.