[ 
https://issues.apache.org/jira/browse/HIVE-936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12778929#action_12778929
 ] 

Jay Booth commented on HIVE-936:
--------------------------------

This would be super useful..  only issue I can see is inserting super sparse 
data, we've had issues with unbounded MultipleOutputs, eventually you run out 
of RAM, XCeivers or both.  I suppose it could just be a client responsibility 
to make sure your query doesn't generate enough outputs to break it..  still 
better than not having the functionality at all.

> dynamic partitions creation based on values
> -------------------------------------------
>
>                 Key: HIVE-936
>                 URL: https://issues.apache.org/jira/browse/HIVE-936
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Ning Zhang
>
> If a Hive table is created as partitioned, DML could only inserted into one 
> partitioin per query. Ideally partitions should be created on the fly based 
> on the value of the partition columns. As an example:
> {{{
>   create table T (a int, b string) partitioned by (ds string);
>   insert overwrite table T select a, b, ds from S where ds >= '2009-11-01' 
> and ds <= '2009-11-16';
> }}}
> should be able to execute in one DML rather than possibley 16 DML for each 
> distinct ds values. CTAS and alter table should be able to do the same thing:
> {{{
>   create table T partitioned by (ds string) as select * from S where ds >= 
> '2009-11-01' and ds <= '2009-11-16';
> }}}
>  and
> {{{
>   create table T(a int, b string, ds string);
>   insert overwrite table T select * from S where ds >= '2009-11-1' and ds <= 
> '2009-11-16';
>   alter table T partitioned by (ds);
> }}}
> should all return the same results.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to