We're seeing some dangerously inconsistent behaviour in retrieving data from
HBase.  In particular circumstances whose conditions are still unclear, get
and scan (without timestamp params) are returning different versions of a
column.  We are running 0.20.2.  See below for evidence.

hbase(main):006:0> scan 'generated_pages',{STARTROW=>'240:
http://com.golflink.www/golf-courses/course.aspx?course=1008656
',LIMIT=>2,COLUMNS=>['attribute:url']}
ROW                          COLUMN+CELL

 240:http://com.golflink.www column=attribute:url, timestamp=*
5429280163307928320*, value=\001http://www.golflin
 /golf-courses/course.aspx?c k.com/golf-courses/course.aspx?course=1008656

 ourse=1008656

2 row(s) in 0.0100 seconds

hbase(main):007:0> get 'generated_pages', '240:
http://com.golflink.www/golf-courses/course.aspx?course=1008656',
COLUMN=>'attribute:url'
timestamp=*5429243797819101088*, value=\001
http://www.golflink.com/golf-courses/course.aspx?course=1008656
1 row(s) in 0.0020 seconds

Any ideas about how this is possible?

joost.

Reply via email to