We're seeing some dangerously inconsistent behaviour in retrieving data from HBase. In particular circumstances whose conditions are still unclear, get and scan (without timestamp params) are returning different versions of a column. We are running 0.20.2. See below for evidence.
hbase(main):006:0> scan 'generated_pages',{STARTROW=>'240: http://com.golflink.www/golf-courses/course.aspx?course=1008656 ',LIMIT=>2,COLUMNS=>['attribute:url']} ROW COLUMN+CELL 240:http://com.golflink.www column=attribute:url, timestamp=* 5429280163307928320*, value=\001http://www.golflin /golf-courses/course.aspx?c k.com/golf-courses/course.aspx?course=1008656 ourse=1008656 2 row(s) in 0.0100 seconds hbase(main):007:0> get 'generated_pages', '240: http://com.golflink.www/golf-courses/course.aspx?course=1008656', COLUMN=>'attribute:url' timestamp=*5429243797819101088*, value=\001 http://www.golflink.com/golf-courses/course.aspx?course=1008656 1 row(s) in 0.0020 seconds Any ideas about how this is possible? joost.