Re: multiple partial scans in the row

2012-02-15 Thread NNever
Hi James, I'm new to HBase too. How about this: with "a range of orderIds", select the first id. Step1 : set this ID as startRow, then checkout the closest id(Only fetch one), Step2:then with this fetched ID, setStartRow(fetchedID-startTimestamp), setEndRow(fetchedID-endTimestamp), Step3:

Re: multiple partial scans in the row

2012-02-14 Thread James Young
Thank you Ian! Yes, the orderIds are ordered. I might try timeStamp filter. But it still doesn't provide the early out feature. not sure how the performance it could be. Do you think it might be worth having a custom filter to do two partial scans? Thanks again. James On Wed, Feb 15, 2012 at 2:0

Re: multiple partial scans in the row

2012-02-14 Thread Ian Varley
James, Are your orderIds ordered? You say "a range of orderIds", which implies that (i.e. they're sequential numbers like 001, 002, etc, not hashes or random values). If so, then a single scan can hit the rows for multiple contiguous orderIds (you'd set the start and stop rows based on a prefix

multiple partial scans in the row

2012-02-14 Thread James Young
Hi there, I am pretty new to HBase and i am trying to understand the best practice to do the scan based on two/multiple partial scans for the row key. For example, I have a row key like: orderId-timeStamp-item. The orderId has nothing to with the timeStamp and i have a requirement to scan rows f