Hi James, I'm new to HBase too.
How about this:
with "a range of orderIds", select the first id.
Step1 : set this ID as startRow, then checkout the closest id(Only fetch
one),
Step2:then with this fetched ID, setStartRow(fetchedID-startTimestamp),
setEndRow(fetchedID-endTimestamp),
Step3:
Thank you Ian! Yes, the orderIds are ordered.
I might try timeStamp filter. But it still doesn't provide the early
out feature. not sure how the performance it could be. Do you think it
might be worth having a custom filter to do two partial scans?
Thanks again.
James
On Wed, Feb 15, 2012 at 2:0
James,
Are your orderIds ordered? You say "a range of orderIds", which implies that
(i.e. they're sequential numbers like 001, 002, etc, not hashes or random
values). If so, then a single scan can hit the rows for multiple contiguous
orderIds (you'd set the start and stop rows based on a prefix
Hi there,
I am pretty new to HBase and i am trying to understand the best
practice to do the scan based on two/multiple partial scans for the
row key.
For example, I have a row key like: orderId-timeStamp-item. The
orderId has nothing to with the timeStamp and i have a requirement to
scan rows f