[ https://issues.apache.org/jira/browse/HIVE-27317?focusedWorklogId=860598&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-860598 ]
ASF GitHub Bot logged work on HIVE-27317: ----------------------------------------- Author: ASF GitHub Bot Created on: 04/May/23 17:21 Start Date: 04/May/23 17:21 Worklog Time Spent: 10m Work Description: sercanCyberVision opened a new pull request, #4293: URL: https://github.com/apache/hive/pull/4293 ### What changes were proposed in this pull request? When `ClearDanglingScratchDir` service identifies the dangling sessions to clean HDFS FS, we will be cleaning files/dirs in `HiveConf.ConfVars.LOCALSCRATCHDIR` (local FS) as well. ### Why are the changes needed? When Hive session is killed, no chance for shutdown hook to clean-up tmp files. This causes accumulation of tmp files/dirs in local FS as below; ``` > ll /tmp/user/97c4ef50-5e80-480e-a6f0-4f779050852b* drwx---- Issue Time Tracking ------------------- Worklog Id: (was: 860598) Remaining Estimate: 0h Time Spent: 10m > Temporary (local) session files cleanup improvements > ---------------------------------------------------- > > Key: HIVE-27317 > URL: https://issues.apache.org/jira/browse/HIVE-27317 > Project: Hive > Issue Type: Improvement > Reporter: Sercan Tekin > Assignee: Sercan Tekin > Priority: Major > Attachments: HIVE-27317.patch > > Time Spent: 10m > Remaining Estimate: 0h > > When Hive session is killed, no chance for shutdown hook to clean-up tmp > files. > There is a Hive service to clean residual files > https://issues.apache.org/jira/browse/HIVE-13429, and later on its execution > is scheduled inside HS2 https://issues.apache.org/jira/browse/HIVE-15068 to > make sure not to leave any temp file behind. But this service cleans up only > HDFS temp files, there are still residual files/dirs in > *HiveConf.ConfVars.LOCALSCRATCHDIR* location as follows; > {code:java} > > ll /tmp/user/97c4ef50-5e80-480e-a6f0-4f779050852b* > drwx------ 2 user user 4096 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b > -rw------- 1 user user 0 Oct 29 10:09 > 97c4ef50-5e80-480e-a6f0-4f779050852b10571819313894728966.pipeout > -rw------- 1 user user 0 Oct 29 10:09 > 97c4ef50-5e80-480e-a6f0-4f779050852b16013956055489853961.pipeout > -rw------- 1 user user 0 Oct 29 10:09 > 97c4ef50-5e80-480e-a6f0-4f779050852b4383913570068173450.pipeout > -rw------- 1 user user 0 Oct 29 10:09 > 97c4ef50-5e80-480e-a6f0-4f779050852b889740171428672108.pipeout {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)