[jira] Commented: (HADOOP-1439) Add endRow parameter to HClient#obtainScanner

Jim Kellerman (JIRA) Wed, 27 Jun 2007 17:41:46 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508698
 ]


Jim Kellerman commented on HADOOP-1439:
---------------------------------------

It seems to me that if I specify a filter that is row key filter, then if the 
filter finds a match, next() keeps returning values so long as the row filter 
matches. Once it stops matching, the filter should close out the scanner since 
there will be no additional rows that match that filter.

In this particular case, I am talking about row key filters based on >, =, < 
and not regexp filters, because a regexp can potentially match any row.


> Add endRow parameter to HClient#obtainScanner
> ---------------------------------------------
>
>                 Key: HADOOP-1439
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1439
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: contrib/hbase
>            Reporter: stack
>            Assignee: stack
>            Priority: Minor
>
> Currently the HClient#obtainScanner looks like this:
> {code}
> public synchronized HScannerInterface obtainScanner(Text[] columns, Text 
> startRow) throws IOException;
> {code}
> Add an overload that allows specification of endRow:
> {code}
> public synchronized HScannerInterface obtainScanner(Text[] columns, Text 
> startRow, Text endRow) throws IOException;
> {code}
> Use Case: Table contains the whole web.  Client just wants to scan google's 
> pages.  Currently, client could cut off the scanner as soon as the row key 
> leaves the google domain but cleaner if {{HScannerInterface#next()}} returns 
> false

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-1439) Add endRow parameter to HClient#obtainScanner

Reply via email to