[ https://issues.apache.org/jira/browse/HIVE-5831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xuefu Zhang updated HIVE-5831: ------------------------------ Resolution: Duplicate Status: Resolved (was: Patch Available) > filter input files for bucketed tables > -------------------------------------- > > Key: HIVE-5831 > URL: https://issues.apache.org/jira/browse/HIVE-5831 > Project: Hive > Issue Type: Improvement > Components: Query Processor > Reporter: Rui Li > Attachments: hive-5831.patch > > > When the users query a bucketed table and use the bucketed column in the > predicate, only the buckets that satisfy the predicate need to be scanned, > thus improving the performance. > Given a table test: > CREATE TABLE test (x INT, y STRING) CLUSTERED BY ( x ) INTO 10 BUCKETS; > The following query only has to scan bucket 5: > SELECT * FROM test WHERE x=5; -- This message was sent by Atlassian JIRA (v6.3.15#6346)