[ 
https://issues.apache.org/jira/browse/HIVE-27726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17768570#comment-17768570
 ] 

Stamatis Zampetakis commented on HIVE-27726:
--------------------------------------------

HIVE-19770 and HIVE-27399 showcase some examples where enhanced constant 
folding led to "missing lineage" information.

> No lineage for constant literals
> --------------------------------
>
>                 Key: HIVE-27726
>                 URL: https://issues.apache.org/jira/browse/HIVE-27726
>             Project: Hive
>          Issue Type: Improvement
>          Components: HiveServer2
>    Affects Versions: 4.0.0-beta-1
>            Reporter: Stamatis Zampetakis
>            Priority: Minor
>
> Consider the following statement where we create a table based on another 
> table and some constant expressions/literals.
> {code:java}
> CREATE TABLE tbl1 AS (SELECT 'Bob', 'Alice', key, value || 'some' FROM src)
> {code}
> Observe that column 0, and column 1 of tbl1 originate from constant literals. 
> Currently (commit e5a7ce2f091da1f8a324da6e489cda59b9e4bfc6), there is no 
> lineage information for columns originating from constants.
> The {{org.apache.hadoop.hive.ql.hooks.LineageLogger}} will display the 
> following the aforementioned DDL statement.
>   
> {noformat}
> POSTHOOK: Lineage: tbl1._c0 SIMPLE []
> POSTHOOK: Lineage: tbl1._c1 SIMPLE []
> POSTHOOK: Lineage: tbl1._c3 EXPRESSION [(src)src.FieldSchema(name:value, 
> type:string, comment:default), ]
> POSTHOOK: Lineage: tbl1.key SIMPLE [(src)src.FieldSchema(name:key, 
> type:string, comment:default), ]
> {noformat}
> This is not really a bug since we cannot really say that something is broken 
> but maybe there is a way to reflect that {{c0}} and {{c1}} originate from 
> constants to avoid the misconception that lineage is missing.
> Note, that constant folding may also lead to the same behavior since 
> expressions may be simplified to constants and the latter do not appear in 
> the lineage output.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to