[ 
https://issues.apache.org/jira/browse/HIVE-24203?focusedWorklogId=530609&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-530609
 ]

ASF GitHub Bot logged work on HIVE-24203:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 04/Jan/21 11:02
            Start Date: 04/Jan/21 11:02
    Worklog Time Spent: 10m 
      Work Description: okumin opened a new pull request #1531:
URL: https://github.com/apache/hive/pull/1531


   ### What changes were proposed in this pull request?
   
   Estimate statistics of LATERAL VIEW correctly.
   
   StatsRulesProcFactory doesn't have any rules to handle a JOIN by LATERAL 
VIEW.
   This can cause an underestimation in case that UDTF in LATERAL VIEW 
generates multiple rows.
   
   ### Why are the changes needed?
   
   Significant underestimation can happen when LATERAL VIEW increases the 
number of records a lot and the source table has large.
   
   ### Does this PR introduce _any_ user-facing change?
   No.
   
   ### How was this patch tested?
   Added one test case.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 530609)
    Time Spent: 3.5h  (was: 3h 20m)

> Implement stats annotation rule for the LateralViewJoinOperator
> ---------------------------------------------------------------
>
>                 Key: HIVE-24203
>                 URL: https://issues.apache.org/jira/browse/HIVE-24203
>             Project: Hive
>          Issue Type: Improvement
>          Components: Physical Optimizer
>    Affects Versions: 2.3.7, 3.1.2, 4.0.0
>            Reporter: okumin
>            Assignee: okumin
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> StatsRulesProcFactory doesn't have any rules to handle a JOIN by LATERAL VIEW.
> This can cause an underestimation in case that UDTF in LATERAL VIEW generates 
> multiple rows.
> HIVE-20262 has already added the rule for UDTF.
> This issue would add the rule for LateralViewJoinOperator.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to