Keith Ly created MAPREDUCE-6299: ----------------------------------- Summary: bzip2 codec read duplicate rows Key: MAPREDUCE-6299 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6299 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 2.4.0 Reporter: Keith Ly Assignee: Jason Lowe Priority: Critical
select count(*) from bzip_table shows 36 rows count when there are 18 actual rows in bzip_table. Create table bzip_table2 as select * from bzip_table results in 36 rows in bzip_table2 and so on. -- This message was sent by Atlassian JIRA (v6.3.4#6332)