Adam, You have to exit when you reach your limit, but you can specify start and stop rows which is usually very useful with well designed row keys.
By default the scanner client fetches rows one by one. You can set scanner caching with Scan.setCaching which improves the performance of the scan by lowering the number of RPCs. J-D On Fri, Nov 20, 2009 at 11:43 AM, Adam Silberstein <[email protected]> wrote: > Hi, > Is there a way to specify a limit on number of returned records for scan? I > don¹t see any way to do this when building the scan. If there is, that > would be great. If not, what about when iterating over the result? If I > exit the loop when I reach my limit, will that approximate this clause? I > guess my real question is about how scan is implemented in the client. I.e. > How many records are returned from Hbase at a time as I iterate through the > scan result? If I want 1,000 records and 100 get returned at a time, then > I¹m in good shape. On the other hand, if I want 10 records and get 100 at a > time, it¹s a bit wasteful, though the waste is bounded. > > Thanks, > Adam >
