Tough one in that if your events are keyed on time alone, you will hit a hot spot on write. Reads,not so much...
TSDB would be a good start ... You may not need 'buckets' but just a time stamp and set up a start and stop key values. Sent from a remote device. Please excuse any typos... Mike Segel On Jan 28, 2013, at 7:06 AM, Oleg Ruchovets <oruchov...@gmail.com> wrote: > Hi , > > I have such row data structure: > > event_id | time > ============= > event1 | 10:07 > event2 | 10:10 > event3 | 10:12 > > event4 | 10:20 > event5 | 10:23 > event6 | 10:25 > > > Numbers of records is 50-100 million. > > > Question: > > I need to find group of events starting form eventX and enters to the time > window bucket = T. > > > For example: if T=7 munutes. > Starting from event event1- {event1, event2 , event3} were detected durint > 7 minutes. > > Starting from event event2- {event2 , event3} were detected durint 7 > minutes. > > Starting from event event4 - {event4, event5 , event6} were detected during > 7 minutes. > Is there a way to model the data in hbase to get? > > Thanks