Re: Is Hive Index officially not recommended?

2016-02-08 Thread Gopal Vijayaraghavan
> Is anybody storing there index in a non-native table such as HBase? ... > Can you please point to implementations of HiveIndexHandler or >AbstractIndexHandler > that have usesIndexTable=false I don't think there are any publically available implementations yet. The Hive HBase-metastore project

Re: Is Hive Index officially not recommended?

2016-02-04 Thread Amey Barve
Hi Gopal, As you suggested in your email above that *Part #1 of using hive indexes effectively is to write your ownHiveIndexHandler, with usesIndexTable=false;* *And then write a IndexPredicateAnalyzer, which lets you map arbitrarylookups into other range conditions.* Is anybody storing there

Re: Is Hive Index officially not recommended?

2016-01-05 Thread Lefty Leverenz
I'd like to revise the Indexing and IndexDev docs in the wiki to include this information (as well as information from a previous thread, if I can find it) so peopl

Re: Is Hive Index officially not recommended?

2016-01-05 Thread Gopal Vijayaraghavan
>So in a nutshell in Hive if "external" indexes are not used for improving >query response, what value they add and can we forget them for now? The builtin indexes - those that write data as smaller tables are only useful in a pre-columnar world, where the indexes offer a huge reduction in IO. P

RE: Is Hive Index officially not recommended?

2016-01-05 Thread Mich Talebzadeh
--Original Message- From: Gopal Vijayaraghavan [mailto:go...@hortonworks.com] On Behalf Of Gopal Vijayaraghavan Sent: 05 January 2016 21:49 To: user@hive.apache.org Subject: Re: Is Hive Index officially not recommended? > I am going to run the same query in Hive. However, I only see

Re: Is Hive Index officially not recommended?

2016-01-05 Thread Gopal Vijayaraghavan
> I am going to run the same query in Hive. However, I only see a table >scan below and no mention of that index. May be I am missing something >here? Hive Indexes are an incomplete feature, because they are not maintained over an ACID storage & demand FileSystem access to check for validity.

RE: Is Hive Index officially not recommended?

2016-01-05 Thread Mich Talebzadeh
ree, therefore neither Peridale Ltd, its subsidiaries nor their employees accept any responsibility. From: Jörn Franke [mailto:jornfra...@gmail.com] Sent: 05 January 2016 19:59 To: user@hive.apache.org Subject: Re: Is Hive Index officially not recommended? Btw this is not Hive specific, but also fo

Re: Is Hive Index officially not recommended?

2016-01-05 Thread Ting(Goden) Yao
yes. we tried mr and it works fine. so it's more likely a tez issue. Thanks for your comments. On Tue, Jan 5, 2016 at 11:58 AM Jörn Franke wrote: > You can still use execution Engine mr for maintaining the index. Indeed > with the ORC or parquet format there are min/max indexes and bloom filters

Re: Is Hive Index officially not recommended?

2016-01-05 Thread Jörn Franke
Btw this is not Hive specific, but also for other relational database systems, such as Oracle Exadata. > On 05 Jan 2016, at 20:57, Jörn Franke wrote: > > You can still use execution Engine mr for maintaining the index. Indeed with > the ORC or parquet format there are min/max indexes and bloom

Re: Is Hive Index officially not recommended?

2016-01-05 Thread Jörn Franke
You can still use execution Engine mr for maintaining the index. Indeed with the ORC or parquet format there are min/max indexes and bloom filters, but you need to sort your data appropriately to benefit from performance. Alternatively you can create redundant tables sorted in different order. T

RE: Is Hive Index officially not recommended?

2016-01-05 Thread Mich Talebzadeh
es nor their employees accept any responsibility. From: Ting(Goden) Yao [mailto:t...@pivotal.io] Sent: 05 January 2016 18:18 To: user@hive.apache.org Subject: Is Hive Index officially not recommended? Hi, We hit an issue when doing Hive testing to rebuild index on Tez. We were told by our H

Is Hive Index officially not recommended?

2016-01-05 Thread Ting(Goden) Yao
Hi, We hit an issue when doing Hive testing to rebuild index on Tez. We were told by our Hadoop distro vendor that it's not recommended (or should avoid) using index with Hive. But I don't see an official message on Hive wiki or document