AFAIK, ORC uses “hive.optimize.index.filter” hive config to enable predicate 
pushdown. Can you please try by setting hive.optimize.index.filter to true?

Thanks
Prasanth Jayachandran

On Nov 7, 2013, at 4:04 PM, Avrilia Floratou <avrilia.flora...@gmail.com> wrote:

> Hi all,
> 
> I'm using hive-12. I have a file that contains 10 integer columns stored in 
> ORC format. The ORC file is zlib compressed and indexing is enabled.
> I'm running a simple select count(*) with a predicate of the form (Col1 =0 OR 
> col2 = 0 etc). The predicate touches all 10 columns but its selectivity is 0 
> (none of the records in the file 
> satisfy the predicate). I have set the hive.optimize.ppd property to true. 
> For this query basically none of the stripes should be read. However after 
> the MR job finishes, I can see that the whole file has been read. Why is 
> that? Is there any other parameter that I should enable for predicate 
> pushdown/indexing to work?
> 
> Thanks,
> Avrilia


-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

Reply via email to