Re: Secondary indexes

Bryan Duxbury Tue, 22 Apr 2008 12:31:50 -0700

This doesn't have to be all that complicated.

Why not keep another HBase table as the index? The keys would be thecolumn values. There'd be a single matchingRows column family, andthe qualifiers and values would be the rows that match that column.Then, when you want to scan in column order instead of row order, youscan the index table, find the list of rows that match each column,and then do a random read to grab those individually. It'll for surebe slower than scanning a table ordered by rows, but it'll get youwhat you want. It'll also handle the case where the column valuesaren't unique.

If you need custom sorting for those values, then HBASE-82 wouldsolve that problem.


-Bryan

On Apr 22, 2008, at 11:58 AM, stack wrote:

Some questions interlaced below:

Clint Morgan wrote:
All,
We want to put secondary indexes into hbase. The motivation isthat we
are storing data in hbase that we want to serve to users. We would
like to be able to serve rows sorted by column values. Our queries
will be over rows with a given key prefix, so we should not behitting
to many regions.
  I was thinking it would work roughly like this:

- At table creation time, individual columns can be declared as
indexed. By default we could sort the column valueslexicographically,or we can provide a WritableComparatorFactory<T> which has theability
to make values of type T from a byte [], as well as providing a
Comparator<T>. (Better than providing a Comparator<byte[]> as it only
costs once per row insert for deserialization, rather that twice on
each comparison).
I don't follow what the Factory adds.
We're talking about getting HBASE-82 into 0.2. Does that interferewith this proposal? (I'm thinking that along w/ rows becoming bytearrays rather Text with a client-supplied Comparator, columnqualifiers would shift to be byte arrays also -- though yeah,implies that if your sort is not byte-lexicographical, yes, thecompares can be costly involving two deserializations per compare).
- We catch all writes/deletes and maintain a SortedMap<T, HStoreKey>
which keeps the column values in order, and maps them back to row
keys. First cut may just keep all this in memory, but it should be
backed with MapFile(s).
Would be sweet if you could leverage the HBase memcache code andflusher to do the above.
This Map would be global for the table?  Or per Region?

A lucene index wouldn't work for you because you want ordering?
- Add to the hregion the ability to scan through keys in columnorder.Just iterate through the SortedMap, run a filter on the key, andif it
passes do a get on the row.
You'd be random reading rows. You're OK w/ current performance?(For sure it will only improve but....).
- Add a ColumnOrderedClientScanner which will open column order
scanners to all applicable hregions, and continuously pick row with
the lowest column value from each of the client scanners.
This scanner would have a significant client-side component to dothe arbitrage between all regions to figure the lowest columnvalue? If you had a new type of 'region' -- one denoted by lowestand upper column then the client-side logic would fade away andyour scanner would look like current scanners.
- Region splits should be easy enough, just a scan through the
SortedMap to partition.
Splits would not be row-based and run as they currently do, butrather sorted-column based?
Of course, the index could also be used for more efficientquerying on
the indexed column's values.

Do other users have a need for this functionality?
What do developers think about this? I know hbase is more intendedfor
back-end batch style processing, but we have this need.
How are you thinking of adding in this new functionality?Subclassing HRegionServer?
St.Ack
Cheers,
-clint

Re: Secondary indexes

Reply via email to