[ 
https://issues.apache.org/jira/browse/HIVE-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12884869#action_12884869
 ] 

John Sichi commented on HIVE-417:
---------------------------------

Had a chat with Ashish and Yongqiang offline, and came up with three 
alternatives.

1)  "Shortest path to checkin":  Treat current code as prototype and move it 
into contrib, providing a utility for creating/updating the index, and keeping 
changes to core classes to a minimum.  As Yongqiang pointed out, this makes it 
harder to follow up with automatic use of the index due to the lack of 
metadata.  If we do this, we should create a new JIRA issue for its limited 
scope.

2) "Full-fledged index support":  change the JDO metamodel to add support for 
indexes as first class objects, and come up with a pluggable index 
creation+access design framework which can encompass a variety of index types 
likely to be needed in the future.  Code from this patch would become the first 
such index implementation provided.  If we do this, we should continue on in 
this truly epic JIRA issue.

3) "Rework as materialized view":  keep the JDO metamodel as is (adding a new 
table type for MATERIALIZED_VIEW) but change the DDL to CREATE MATERIALIZED 
VIEW AS SELECT ... and then come up with the system functions needed (e.g. for 
accessing file offsets) in order to be able to express the index construction 
as SQL.  We would then execute view materialization in a fashion similar to 
CREATE TABLE AS SELECT.  This approach best reflects the way the current code 
models an index as an ordinary table, but requires some other changes (e.g. 
CTAS + dynamic partitioning, something we want anyway).  If we do this, we 
should create a new JIRA issue since it's a different feature from the user POV.

We're aiming to reach a decision next week; input is welcome on whether these 
alternatives make sense (and on others we should consider).

Since this JIRA issue is already so overloaded, we would also like to treat the 
following two items as separate followup JIRA issues rather than trying to 
address it all at once:

* rewrite framework
* automatic usage of index or materialized view by optimizer


> Implement Indexing in Hive
> --------------------------
>
>                 Key: HIVE-417
>                 URL: https://issues.apache.org/jira/browse/HIVE-417
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Metastore, Query Processor
>    Affects Versions: 0.3.0, 0.3.1, 0.4.0, 0.6.0
>            Reporter: Prasad Chakka
>            Assignee: He Yongqiang
>         Attachments: hive-417.proto.patch, hive-417-2009-07-18.patch, 
> hive-indexing.3.patch, hive-indexing.5.thrift.patch, 
> indexing_with_ql_rewrites_trunk_953221.patch
>
>
> Implement indexing on Hive so that lookup and range queries are efficient.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to