[ https://issues.apache.org/jira/browse/HADOOP-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12549498 ]
Jim Kellerman commented on HADOOP-2365: --------------------------------------- Andrzej, Do you think it is occurring for the same key? If you could provide your initialization parameters and a test example, that would be very helpful. > Result of HashFunction.hash() contains all identical values > ----------------------------------------------------------- > > Key: HADOOP-2365 > URL: https://issues.apache.org/jira/browse/HADOOP-2365 > Project: Hadoop > Issue Type: Bug > Components: contrib/hbase > Affects Versions: 0.16.0 > Reporter: Andrzej Bialecki > Assignee: Jim Kellerman > Fix For: 0.16.0 > > Attachments: hash-v1.patch, patch.txt, patch.txt > > > There is a small bug in HashFunction:112 - initvalue should be changed > between the loop iterations in order to spread the hash values over the whole > allowed range. Instead the current code uses a fixed initvalue = 0, which > gives all identical hash values in the result array. As a result, > BloomFilter-s have extremely high rate of false positives. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.