[ 
https://issues.apache.org/jira/browse/TEZ-3350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17041870#comment-17041870
 ] 

László Bodor commented on TEZ-3350:
-----------------------------------

it still applies, in recent logs:
{code}
2020-02-19 23:56:53,908 [INFO] [TezChild] |impl.PipelinedSorter|: Reducer 2: 
Spilling to 
/grid/05/hadoop/yarn/local/usercache/hive/appcache/application_1581931657676_3139/output/attempt_1581931657676_3139_1_00_000008_0_10002_0/file.out
{code}
https://github.com/apache/tez/blob/cf111b1c896db94d40ea92fc6da2a6fcce356114/tez-runtime-library/src/main/java/org/apache/tez/runtime/library/common/task/local/output/TezTaskOutputFiles.java#L78-L95

> Shuffle spills are not spilled to a container-specific directory
> ----------------------------------------------------------------
>
>                 Key: TEZ-3350
>                 URL: https://issues.apache.org/jira/browse/TEZ-3350
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.7.1
>            Reporter: Jason Darrell Lowe
>            Assignee: László Bodor
>            Priority: Major
>
> If a Tez task receives too much input data and needs to spill the inputs to 
> disk it ends up using a path that is not container-specific.  Therefore YARN 
> will not automatically cleanup these files when the container exits as it 
> should, and instead the files linger until the entire application completes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to