Re: HBase for Small Key Value Tables

Manish Maheshwari Mon, 29 Aug 2016 07:36:32 -0700

Hi Dima,

Thanks for the suggestion. We can load the data in heap, but Hbase makes it
easier for one to write and another to read. With heap we need to build a
process to handle both processes and also write to log so as to not lose
the updates in case of process failure.


Thanks
Manish

On Aug 29, 2016 2:18 PM, "Dima Spivak" <dspi...@cloudera.com> wrote:

> (Though if it is only 7 GB, why not just store it in memory?)
>
> On Sunday, August 28, 2016, Dima Spivak <dspi...@cloudera.com> wrote:
>
> > If your data can all fit on one machine, HBase is not the best choice. I
> > think you'd be better off using a simpler solution for small data and
> leave
> > HBase for use cases that require proper clusters.
> >
> > On Sunday, August 28, 2016, Manish Maheshwari <mylogi...@gmail.com
> > <javascript:_e(%7B%7D,'cvml','mylogi...@gmail.com');>> wrote:
> >
> >> We dont want to invest into another DB like Dynamo, Cassandra and
> Already
> >> are in the Hadoop Stack. Managing another DB would be a pain. Why HBase
> >> over RDMS, is because we call HBase via Spark Streaming to lookup the
> >> keys.
> >>
> >> Manish
> >>
> >> On Mon, Aug 29, 2016 at 1:47 PM, Dima Spivak <dspi...@cloudera.com>
> >> wrote:
> >>
> >> > Hey Manish,
> >> >
> >> > Just to ask the naive question, why use HBase if the data fits into
> >> such a
> >> > small table?
> >> >
> >> > On Sunday, August 28, 2016, Manish Maheshwari <mylogi...@gmail.com>
> >> wrote:
> >> >
> >> > > Hi,
> >> > >
> >> > > We have a scenario where HBase is used like a Key Value Database to
> >> map
> >> > > Keys to Regions. We have over 5 Million Keys, but the table size is
> >> less
> >> > > than 7 GB. The read volume is pretty high - About 50x of the
> >> put/delete
> >> > > volume. This causes hot spotting on the Data Node and the region is
> >> not
> >> > > split. We cannot change the maxregionsize parameter as that will
> >> impact
> >> > > other tables too.
> >> > >
> >> > > Our idea is to manually inspect the row key ranges and then split
> the
> >> > > region manually and assign them to different region servers. We will
> >> > > continue to then monitor the rows in one region to see if needs to
> be
> >> > > split.
> >> > >
> >> > > Any experience of doing this on HBase. Is this a recommended
> approach?
> >> > >
> >> > > Thanks,
> >> > > Manish
> >> > >
> >> >
> >> >
> >> > --
> >> > -Dima
> >> >
> >>
> >
> >
> > --
> > -Dima
> >
> >
>
> --
> -Dima
>

Re: HBase for Small Key Value Tables

Reply via email to