[
https://issues.apache.org/jira/browse/HBASE-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13239222#comment-13239222
]
Laxman commented on HBASE-5564:
-------------------------------
Findbugs reported by QA bot are about usage of default encoding. This behavior
is inline with existing code.
bug #1
{noformat}
TEST Unknown bug pattern DM_DEFAULT_ENCODING in
org.apache.hadoop.hbase.mapreduce.ImportTsv$TsvParser$ParsedLine.getTimestamp()
{noformat}
bug #2
{noformat}
TEST Unknown bug pattern DM_DEFAULT_ENCODING in
org.apache.hadoop.hbase.mapreduce.ImportTsv.createSubmittableJob(Configuration,
String[])
{noformat}
bug #2 already existing in code. just included in patch file with no changes.
And test case failures are not because of this patch. Test failures to be
addressed as part of HBASE-5608
> Bulkload is discarding duplicate records
> ----------------------------------------
>
> Key: HBASE-5564
> URL: https://issues.apache.org/jira/browse/HBASE-5564
> Project: HBase
> Issue Type: Bug
> Components: mapreduce
> Affects Versions: 0.90.7, 0.92.2, 0.94.0, 0.96.0
> Environment: HBase 0.92
> Reporter: Laxman
> Assignee: Laxman
> Labels: bulkloader
> Fix For: 0.96.0
>
> Attachments: 5564.lint, HBASE-5564_trunk.1.patch,
> HBASE-5564_trunk.1.patch, HBASE-5564_trunk.2.patch, HBASE-5564_trunk.patch
>
>
> Duplicate records are getting discarded when duplicate records exists in same
> input file and more specifically if they exists in same split.
> Duplicate records are considered if the records are from diffrent different
> splits.
> Version under test: HBase 0.92
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira