[ 
https://issues.apache.org/jira/browse/HIVE-21059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-21059:
---------------------------------
    Description: 
Hive has ability to query data from external sources such as other RDBMS, 
Kafka, Druid, Hbase.

For example, to be able to query data from external sources such as a mysql 
table, an external table has to be explicitly created in Hive for every table 
in mysql that needs to be made accessible.
Moreover, for creating such a table, the schema and login credentials have to 
be specified. 

By supporting "external catalogs" in Hive, we can have references to all tables 
in an entire mysql database by just creating one external catalog. The schema 
of the tables would also get automatically detected from the underlying source. 

Where possible, additional information such as statistics of the tables can 
also be imported from the underlying datasource, to enable Hive cost based 
optimizer to create optimized query plans.

To be able to support the use of external catalog, some of the work tracked 
under HIVE-18685 for catalog support (including catalog in SQL syntax of Hive) 
is also needed.



  was:
Hive has ability to query data from external sources such as other RDBMS, 
Kafka, Druid, Hbase.

For example, to be able to query data from external sources such as a mysql 
table, an external table has to be explicitly created in Hive for every table 
in mysql that needs to be made accessible.
Moreover, for creating such a table, the schema and login credentials have to 
be specified. 

By supporting "external catalogs" in Hive, we can have references to all tables 
in an entire mysql database by just creating one external catalog. The schema 
of the tables would also get automatically detected from the underlying source. 

Where possible, additional information such as statistics of the tables can 
also be imported from the underlying datasource, to enable Hive cost based 
optimizer to create optimized query plans.



> Support external catalogs
> -------------------------
>
>                 Key: HIVE-21059
>                 URL: https://issues.apache.org/jira/browse/HIVE-21059
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Thejas M Nair
>            Priority: Critical
>
> Hive has ability to query data from external sources such as other RDBMS, 
> Kafka, Druid, Hbase.
> For example, to be able to query data from external sources such as a mysql 
> table, an external table has to be explicitly created in Hive for every table 
> in mysql that needs to be made accessible.
> Moreover, for creating such a table, the schema and login credentials have to 
> be specified. 
> By supporting "external catalogs" in Hive, we can have references to all 
> tables in an entire mysql database by just creating one external catalog. The 
> schema of the tables would also get automatically detected from the 
> underlying source. 
> Where possible, additional information such as statistics of the tables can 
> also be imported from the underlying datasource, to enable Hive cost based 
> optimizer to create optimized query plans.
> To be able to support the use of external catalog, some of the work tracked 
> under HIVE-18685 for catalog support (including catalog in SQL syntax of 
> Hive) is also needed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to