No real numbers at the moment. HBASE-2167 adds a PerformanceEvaluation for IHBase (Indexed HBase). PE is sort of not the right use-case for IHBase with its largish, random values -- the latter requires RAM and writes are slowed. Nonetheless, search for random values with the IHBase index can be up to two orders of magnitude better in this hostile test: e.g. 20 scans for 20 random values on a single node cluster with 1.5GB of memory allocated to the RS VM.
Without an index: 732989ms at offset 0 for 1048576 rows With an index: 2160ms at offset 0 for 1048576 rows St.Ack On Sun, Jan 24, 2010 at 1:17 AM, Andrew Purtell <[email protected]> wrote: > Stack, any way you might persuade the IHBase guys to post some numbers > publicly? > I'd like to know more. > > - Andy > > > > ----- Original Message ---- >> From: Stack <[email protected]> >> Subject: Re: Support for MultiGet / SQL In clause -- error in patch >> HBASE-1845 > [...] >> Let us know how IHBase works out for you (indexed hbase). Its a RAM >> hog but the speed improvement finding matching cells can be startling. > > > > >
