[ 
https://issues.apache.org/jira/browse/HIVE-24695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan reassigned HIVE-24695:
---------------------------------------

    Assignee: Rajesh Balamohan

> Clean up session resources, if TezSession is unable to start
> ------------------------------------------------------------
>
>                 Key: HIVE-24695
>                 URL: https://issues.apache.org/jira/browse/HIVE-24695
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> There are cases when TezSessionState would not be able to start. (e.g 
> resource constraints on YARN queues). 
> {noformat}
> Caused by: org.apache.hadoop.yarn.exceptions.YarnException: Failed to submit 
> application_1611791897439_0058 to YARN : 
> org.apache.hadoop.security.AccessControlException: Queue root.default already 
> has X applications from user hive cannot accept submission of application: 
> application_1611791897439_0058
>         at 
> org.apache.hadoop.yarn.client.api.impl.YarnClientImpl.submitApplication(YarnClientImpl.java:322)
>  ~[hadoop-yarn-client-3.x.jar:?]
>         at 
> org.apache.tez.client.TezYarnClient.submitApplication(TezYarnClient.java:77) 
> ~[tez-api-0.9.x.jar:0.9.x]
>         at org.apache.tez.client.TezClient.start(TezClient.java:405) 
> ~[tez-api-0.9.x.jar:0.9.x]
>         at 
> org.apache.hadoop.hive.ql.exec.tez.TezSessionState.startSessionAndContainers(TezSessionState.java:535)
>  ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hadoop.hive.ql.exec.tez.TezSessionState.openInternal(TezSessionState.java:373)
>  ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hadoop.hive.ql.exec.tez.TezSessionState.open(TezSessionState.java:298)
>  ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hadoop.hive.ql.exec.tez.TezSessionPoolSession.open(TezSessionPoolSession.java:106)
>  ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hadoop.hive.ql.exec.tez.TezTask.ensureSessionHasResources(TezTask.java:403)
>  ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hadoop.hive.ql.exec.tez.TezTask.execute(TezTask.java:209) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:213) 
> ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:105) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.Executor.launchTask(Executor.java:359) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.Executor.launchTasks(Executor.java:330) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.Executor.runTasks(Executor.java:246) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.Executor.execute(Executor.java:109) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:721) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.Driver.run(Driver.java:488) 
> ~[hive-exec-3.x.jar:3.x]
>         at org.apache.hadoop.hive.ql.Driver.run(Driver.java:482) 
> ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hadoop.hive.ql.reexec.ReExecDriver.run(ReExecDriver.java:166) 
> ~[hive-exec-3.x.jar:3.x]
>         at 
> org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:225)
>  ~[hive-service-3.x.jar:3.x]
> {noformat}
> However by this time, session directories & certain resources are localized. 
> (e.g, hive-exec jars are stored in  
> hdfs:///tmp/hive/hive/_tez_session_dir/*/hive-exec*.jar). When tezClient is 
> not started, it does not clear up the resources. This leaks ~100MB data in 
> HDFS per failure. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to