[ 
https://issues.apache.org/jira/browse/HIVE-29614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-29614:
----------------------------------
    Labels: pull-request-available  (was: )

> Incorrect column lineage for multiple window functions with identical 
> partition keys
> ------------------------------------------------------------------------------------
>
>                 Key: HIVE-29614
>                 URL: https://issues.apache.org/jira/browse/HIVE-29614
>             Project: Hive
>          Issue Type: Bug
>          Components: lineage
>    Affects Versions: 1.1.0
>         Environment:  
>            Reporter: jinqi long
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.3.0
>
>
> An error occurs in column lineage when a query contains multiple PTFs with 
> identical partition and order keys, for example:
>  
> {code:java}
> create table table_2 as 
> select 
> sum(id1) over(partition by key ) sum1, 
> sum(id2) over(partition by key ) sum2 
> from table_1;{code}
>  
> {code:java}
> The current result is:
>   
>  
> {
>     "version": "1.0",
>     "engine": "tez",
>     "database": "default",
>     "hash": "f81777f9774d12cc77dd583ea9ff99b3",
>     "queryText": "create table table_2 as select\nsum(id1) over(partition by 
> key ) sum1,\nsum(id2) over(partition by key ) sum2\nfrom table_1",
>     "edges": [
>         {
>             "sources": [
>                 2,
>                 3
>             ],
>             "targets": [
>                 0,
>                 1
>             ],
>             "expression": "sum(table_1.id1) over (partition by table_1.key 
> order by table_1.key ROWS between unbounded and unbounded)",
>             "edgeType": "PROJECTION"
>         }
>     ],
>     "vertices": [
>         {
>             "id": 0,
>             "vertexType": "COLUMN",
>             "vertexId": "default.table_2.sum1"
>         },
>         {
>             "id": 1,
>             "vertexType": "COLUMN",
>             "vertexId": "default.table_2.sum2"
>         },
>         {
>             "id": 2,
>             "vertexType": "COLUMN",
>             "vertexId": "default.table_1.id1"
>         },
>         {
>             "id": 3,
>             "vertexType": "COLUMN",
>             "vertexId": "default.table_1.key"
>         }
>     ]
> }{code}
> The correct result should be two PROJECTION edges:
> "sources": [id1,key],"targets": [sum1]
> "sources": [id2,key],"targets": [sum2]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to