Hi all, I am a newbie doing some research to put-together a system to process large number of log-records.
A) Hbase system with clients executing MR jobs on the data B) there may be some instances where we need to run ad-hoc queries on the data. I am trying to see if this can be done with out users having to resort to writing MR programs. HQL might be enough. But if it is not, I am considering Hive. so that brings to my questions 1) Can both Hive and Hbase co-exist on the same cluster? 2) can they 'share' the same data/schema? or do I need to insert the same data into Hbase table and Hive table? thanks S
