[ 
https://issues.apache.org/jira/browse/HIVE-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-21866:
----------------------------------
    Labels: pull-request-available  (was: )

> LLAP status service driver may get stuck with wrong Yarn app ID
> ---------------------------------------------------------------
>
>                 Key: HIVE-21866
>                 URL: https://issues.apache.org/jira/browse/HIVE-21866
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Ádám Szita
>            Assignee: Ádám Szita
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0-alpha-1
>
>         Attachments: HIVE-21866.0.patch
>
>
> LLAPStatusDriver might get stuck polling status from Yarn if the following 
> happen in this order:
>  * there was a running LLAP Yarn app previously which is now finished / killed
>  * Yarn was restarted
>  * LLAPStatusDriver is invoked before any new LLAP app gets kicked off
>  * LLAPStatusDriver receives the old app ID, which is then cached in the Yarn 
> serviceClient object (no evicition)
>  * In the meantime if any new LLAP app gets kicked off, LLAPStatusDriver will 
> not see it, as it constantly retries fetching info about the wrong, old app 
> ID (this is because we don't create new serviceClient objects)
> {code:java}
> ERROR status.LlapStatusServiceDriver: FAILED: 20: Failed to get Yarn AppReport
> org.apache.hadoop.hive.llap.cli.status.LlapStatusCliException: 20: Failed to 
> get Yarn AppReport
>       at 
> org.apache.hadoop.hive.llap.cli.status.LlapStatusServiceDriver.getAppReport(LlapStatusServiceDriver.java:292)
>  [hive-llap-server-3.1.0.7.0.0.0-112.jar:3.1.0.7.0.0.0-134]
>       at 
> org.apache.hadoop.hive.llap.cli.status.LlapStatusServiceDriver.run(LlapStatusServiceDriver.java:209)
>  [hive-llap-server-3.1.0.7.0.0.0-112.jar:3.1.0.7.0.0.0-134]
>       at 
> org.apache.hadoop.hive.llap.cli.status.LlapStatusServiceDriver.main(LlapStatusServiceDriver.java:537)
>  [hive-llap-server-3.1.0.7.0.0.0-112.jar:3.1.0.7.0.0.0-134]....{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to