Ok..but what if I have 2 multivalue dimensions on which I have to analyse
no of users. Say Category can have 50 values and another dimension is
country of user(say 100+ values). I need weekly count on category and
country + I need overall distinct user count on category and country.

How to achieve this in Hbase.


On Mon, May 19, 2014 at 3:11 PM, Michael Segel <michael_se...@hotmail.com>wrote:

> The point is that choosing a field that has a small finite set of values
> is not a good candidate for indexing using an inverted table or b-tree etc …
>
> I’d say that you’re actually going to be better off using a scan with a
> start and stop row, then doing the counts on the client side.
>
> So as you get back your result set… you process the data. (Either in a M/R
> job or single client thread.)
>
> HTH
>
> On May 19, 2014, at 8:48 AM, Shushant Arora <shushantaror...@gmail.com>
> wrote:
>
> > I cannot apply server side filter.
> > 2nd requirement is not just get users with supreme category rather
> > distribution of users category wise.
> >
> > 1.How many of supreme , how many of normal and how many of medium till
> date.
> >
> >
> > On Mon, May 19, 2014 at 12:58 PM, Michael Segel
> > <michael_se...@hotmail.com>wrote:
> >
> >> Whoa!
> >>
> >> BAD BOY. This isn’t a good idea for secondary index.
> >>
> >> You have a row key (primary index) which is time.
> >> The secondary is a filter… with 3 choices.
> >>
> >> HINT: Do you really want a secondary index based on a field that only
> has
> >> 3 choices for a value?
> >>
> >> What are they teaching in school these days?
> >>
> >> How about applying a server side filter?  ;-)
> >>
> >>
> >>
> >> On May 18, 2014, at 12:33 PM, John Hancock <jhancock1...@gmail.com>
> wrote:
> >>
> >>> Shushant,
> >>>
> >>> Here's one idea, there might be better ways.
> >>>
> >>> Take a look at phoenix it supports secondary indexing:
> >>> http://phoenix.incubator.apache.org/secondary_indexing.html
> >>>
> >>> -John
> >>>
> >>>
> >>> On Sat, May 17, 2014 at 8:34 AM, Shushant Arora
> >>> <shushantaror...@gmail.com>wrote:
> >>>
> >>>> Hi
> >>>>
> >>>> I have a requirement to query my data base on date and user category.
> >>>> User category can be Supreme,Normal,Medium.
> >>>>
> >>>> I want to query how many new users are there in my table from date
> range
> >>>> (2014-01-01) to (2014-05-16) category wise.
> >>>>
> >>>> Another requirement is to query how many users of Supreme category are
> >>>> there in my table Broken down wise month in which they came.
> >>>>
> >>>> What should be my key
> >>>> 1.If i take key as combination of date#category. I cannot query based
> on
> >>>> category?
> >>>> 2.If I take key as category#date I cannot query based on date.
> >>>>
> >>>>
> >>>> Thanks
> >>>> Shushant.
> >>>>
> >>
> >>
>
>

Reply via email to