[
https://issues.apache.org/jira/browse/HADOOP-1045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12480283
]
Jim Kellerman commented on HADOOP-1045:
---------------------------------------
TODO: the current HBase scanner interface allows you to scan through multiple
rows for an explicit set of column family members (e.g., contents:firstcolumn
and anchor:secondcolumn), but it doesn't let you iterate over all the members
of a column family for a particular row unless you explicitly enumerate them.
This is a problem as you may not know apriori the names of all the family
members.
The Bigtable paper states that "For example, we could restrict the scan above
to only produce anchors whose columns match the regular expression
anchor:*.cnn.com" (ignore for the moment that if "anchor:*.cnn.com" were
applied as a regular expression, :* means zero or more ':'s and that the '.'
between the '*' and 'cnn' and between 'cnn' and 'com' match any character). You
should be able to say 'anchor:' which means every member of the anchor family
or 'anchor:anchornum-[0-9]+' which would match every anchor family member that
starts with 'anchornum-' and then has one or more digits that follow it.
This was uncovered in unit testing using the tests that were commented out in
HRegion.java
> Code for HBase
> --------------
>
> Key: HADOOP-1045
> URL: https://issues.apache.org/jira/browse/HADOOP-1045
> Project: Hadoop
> Issue Type: New Feature
> Environment: All environments
> Reporter: Mike Cafarella
> Assigned To: Jim Kellerman
> Attachments: hbase.patch, hbase.patch, hbase.patch, hbase.patch,
> hbase.tar.gz
>
>
> I've written some code for HBase, a BigTable-like file store. It's not
> perfect, but it's ready for other people to play with and examine.
> The attached tarball has the source and a README
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.