Agreed--as far as I can tell there isn't any support for this currently.
This JIRA (https://issues.apache.org/jira/browse/HIVE-3727, referenced
in http://hortonworks.com/blog/hbase-via-hive-part-1/) seems relevant,
but there's no recent work on it, and I imagine the patch included is
out of date with trunk. Perhaps it's worth resurrecting?
Andrew
On 7/24/14, 4:45 PM, java8964 wrote:
I don't think Hbase-Hive integration part is that smart, be able to
utilize the index existing in the HBase. But I think it depends on the
version you are using.
From my experience, there are a lot of improvement space in the
Hbase-hive integration, especially "push down" logic into HBase engine.
Yong
------------------------------------------------------------------------
From: [email protected]
Date: Thu, 24 Jul 2014 14:03:42 -0700
Subject: does the HBase-Hive integration support using HBase index
(primary key or secondary index) in the JOIN implementatoin?
To: [email protected]
if I do a join of a table based on txt file and a table based on
HBase, and say the latter is very large, is HIVE smart enough to
utilize the HBase table's index to do the join, instead of
implementing this as a regular map reduce job, where each table is
scanned fully, bucketed on join keys, and then the matching items
found out through the reducer?
thanks
Yang