Got Duplicate Records for the Same Row Key from a Snapshot

2018-05-17 Thread shanghaihyj
When we query a table by a particular row key, there is only one row returned by HBase, which is expected. However, when we query a snapshot for that same table, by the same particular row key, five duplicate rows are returned. Why ? In the log of the master server, we see some snapshot-rela

Re:Got Duplicate Records for the Same Row Key from a Snapshot

2018-05-17 Thread shanghaihyj
We are loading data from the HBase table or its snapshot by hbase-rdd (https://github.com/unicredit/hbase-rdd). It uses TableInputFormat / TableSnapshotInputFormat as the underlying input format. The scaner has max version set to 1. At 2018-05-17 15:35:08, "shanghaihyj" wrote: When we query

Re: How to parse data stored in child node of ZK /hbase/region-in-transition

2018-05-17 Thread Debraj Manna
Hi We posted the same in stacker flow. https://stackoverflow.com/q/50378236/785523 We didn't get any reply there also. Anyone any thoughts? On Wed 16 May, 2018, 6:54 PM Nand kishor Bansal, wrote: > Hi Users, > > I'm trying to find out if any region is going through split by looking at > data