Re: does the HBase-Hive integration support using HBase index (primary key or secondary index) in the JOIN implementatoin?

Andrew Mains Thu, 24 Jul 2014 16:56:37 -0700

Agreed--as far as I can tell there isn't any support for this currently.

This JIRA (https://issues.apache.org/jira/browse/HIVE-3727, referencedin http://hortonworks.com/blog/hbase-via-hive-part-1/) seems relevant,but there's no recent work on it, and I imagine the patch included isout of date with trunk. Perhaps it's worth resurrecting?


Andrew

On 7/24/14, 4:45 PM, java8964 wrote:

I don't think Hbase-Hive integration part is that smart, be able toutilize the index existing in the HBase. But I think it depends on theversion you are using.
From my experience, there are a lot of improvement space in theHbase-hive integration, especially "push down" logic into HBase engine.
Yong

------------------------------------------------------------------------
From: [email protected]
Date: Thu, 24 Jul 2014 14:03:42 -0700
Subject: does the HBase-Hive integration support using HBase index(primary key or secondary index) in the JOIN implementatoin?
To: [email protected]
if I do a join of a table based on txt file and a table based onHBase, and say the latter is very large, is HIVE smart enough toutilize the HBase table's index to do the join, instead ofimplementing this as a regular map reduce job, where each table isscanned fully, bucketed on join keys, and then the matching itemsfound out through the reducer?
thanks
Yang

Re: does the HBase-Hive integration support using HBase index (primary key or secondary index) in the JOIN implementatoin?

Reply via email to