[ 
https://issues.apache.org/jira/browse/OOZIE-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14017657#comment-14017657
 ] 

Satish Mittal commented on OOZIE-1741:
--------------------------------------

Thanks [~thiruvel] for your inputs!

- For 1), we already have an EL dataInPartitionFilter().
- For 2), the proposed EL in this jira dataInPartitions() can serve the purpose.
- For 3), I think we should add new ELs as per the requirement, since a large 
number of combinations are possible here (comparator:- =, !=, <, > etc and 
separator:- AND, OR). 

The use-case of 3) is closer to 1) since it aims to generate a filter criteria, 
as against 2) which aims to generate the value of a partition spec to be used 
in DDL/import/export queries.

I checked that in Hive, the format of partition spec in various DDL queries 
(alter, truncate, drop, show, describe), DML queries (load, insert) is: 
(partcol1=val1, partcol2=val2 ...). The format of partition spec for 
IMPORT/EXPORT queries is: (partcol1="val1", partcol2="val2" ...). I verified 
that the latter format is a super-set, i.e. specifying (partcol1="val1", 
partcol2="val2" ...) works for all these queries. That is the format generated 
by EL dataInPartitions() in this patch as well.

Regarding partial/full partition spec, even the existing EL dataInPartitionSpec 
writes all partition cols specified in HCat URI for dataset. The same approach 
is taken for EL dataInPartitions() introduced in this patch.

[~rohini], can you provide your comments here?

> Add new coord EL function to get input partitions value string
> --------------------------------------------------------------
>
>                 Key: OOZIE-1741
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1741
>             Project: Oozie
>          Issue Type: Improvement
>            Reporter: Satish Mittal
>            Assignee: Satish Mittal
>         Attachments: OOZIE-1741-v2.patch, OOZIE-1741-v3.patch, 
> OOZIE-1741.patch.txt
>
>
> Currently oozie provides coord EL functions to get partition filter string 
> (where key-values are separated by AND)  for various action types - 
> java/pig/hive. However this doesn't work in other cases, e.g. hive action 
> that performs export/import of hive partition. In that case, the partition 
> value is expected to be in a different format: 
> (col1="value1",col2="value2"...). We need to add a new EL function that 
> returns partition value in the above format.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to