david created HIVE-12844:
----------------------------
Summary: hive-1.2.1 doesn't return correct value when run select
count query
Key: HIVE-12844
URL: https://issues.apache.org/jira/browse/HIVE-12844
Project: Hive
Issue Type: Bug
Components: Hive
Affects Versions: 1.2.1
Reporter: david
Priority: Critical
in hbase 1.0.2,I created a table 'test1',it has below rows and values:
hbase(main):027:0> scan 'test1'
ROW COLUMN+CELL
a1 column=df1:a2,
timestamp=1452505991743, value=ddd
a1 column=df1:a3,
timestamp=1452506082723, value=eee
a1 column=df1:c2,
timestamp=1452505705391, value=bbb
b1 column=df1:a2,
timestamp=1452505838737, value=ccc
b1 column=df1:a3,
timestamp=1452506149461, value=fff
r1 column=df1:a,
timestamp=1452507261849, value=hhh
r1 column=df1:a1,
timestamp=1452507100774, value=ggg
r1 column=df1:c1,
timestamp=1451221711588, value=aaa
then I created hive-1.2.1 table:
create external table test3(
key string,
coll string,
col2 string,
col3 string,
col4 string,
col5 string,
col6 string)
STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES
("hbase.columns.mapping" =
":key,df1:a,df1:1,df1:a2,df1:a3,df1:c1,df1:c2")
TBLPROPERTIES("hbase.table.name" = "test1");
when I run query in hive:
hive> select * from test3;
OK
a1 NULL NULL ddd eee NULL bbb
b1 NULL NULL ccc fff NULL NULL
r1 hhh NULL NULL NULL aaa NULL
the result is correct,but when I run:
select count(1) from test3;
Total MapReduce CPU Time Spent: 6 seconds 770 msec
OK
1
it returns "1",I find that it doesn't count the rows where the first column is
null,
Could you help to analyze this?
by the way the hadoop version is 2.6.0
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)