[ 
https://issues.apache.org/jira/browse/HADOOP-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508864
 ] 

stack commented on HADOOP-1439:
-------------------------------

This comment applies to this issue and to hadoop-1531.

After the exposition above, I'm now of the opinion that the endRow parameter 
will be little used.  Better for now to have a set of filters available for the 
client to choose from.  If 'performance' becomes an issue, we can backfill the 
endRow parameter later.  

We can divide the work if you'd like. I need the endRow functionality *a tout 
de suite*.   If you add the 'stop condition trigger' to the interface I can 
work on a couple of filter implementations and their tests.


> Add endRow parameter to HClient#obtainScanner
> ---------------------------------------------
>
>                 Key: HADOOP-1439
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1439
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: contrib/hbase
>            Reporter: stack
>            Assignee: stack
>            Priority: Minor
>
> Currently the HClient#obtainScanner looks like this:
> {code}
> public synchronized HScannerInterface obtainScanner(Text[] columns, Text 
> startRow) throws IOException;
> {code}
> Add an overload that allows specification of endRow:
> {code}
> public synchronized HScannerInterface obtainScanner(Text[] columns, Text 
> startRow, Text endRow) throws IOException;
> {code}
> Use Case: Table contains the whole web.  Client just wants to scan google's 
> pages.  Currently, client could cut off the scanner as soon as the row key 
> leaves the google domain but cleaner if {{HScannerInterface#next()}} returns 
> false

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to