Hi,
i'm pretty interested in learning hbase. what i want to do is store
financial data for analytical/graphing/displaying purposes. there hundreds
of millions of rows and of course, i want fast response when retrieving the
data.
if i were to do it in a RDBMS it would be
REPORT, MARKET, OPERATING_DATE, OPERATING_INTERVAL, HOUR_ENDING VALUE
where the bolded column name are PK. if i were to store this in hbase would
it look like this?
REPORT.MARKET.OPERATING_DATE.OPERATING_INTERVAL.HOUR_ENDING.TIMESTAMP{
VALUE: 92.29
}
so that i can do queries like below:
- give me all reports with the name of "ABC"
- give me all the values where OPERATING_DATE is from jan-01-2010 to
jan-10-2010
- give me all the values where OPERATING_DATE is from jan-01-2010 to
jan-10-2010 and HOUR_ENDING is between 5 and 10 (or simply 5 or variations
thereof)
in short, is hbase the wrong way to go about it or would it yield better
performance? also, you folks happen to know any good links/articles on
hbase table & schema?
thanks
--
View this message in context:
http://old.nabble.com/learning-hbase---schema-design-advice-tp27252203p27252203.html
Sent from the HBase User mailing list archive at Nabble.com.