[ 
https://issues.apache.org/jira/browse/TEZ-3362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15533633#comment-15533633
 ] 

Jonathan Eagles commented on TEZ-3362:
--------------------------------------

[~kshukla], couple of minor things. In general, I think we can live with the 
current design for dag level delete and can do the future vertex level design 
in a follow up jira. Once these are fixed, I think this patch can go in.

{code:title=AMNodeImpl}
  // Access should be package
  public int shufflePort = ShuffleUtils.UNDEFINED_PORT;
{code}

{code:title=AMNodeTracker#nodeSeen}
  // This get is using the getName instead of getID().getId()
  Set<NodeId> nodeIds = perDagNodeMap.get(appContext.getCurrentDAG().getName());
{code}

{code:title=AMNodeTracker#dagDelete}
  // we should protect ourselves from null pointer
  for (NodeId nodeId : getPerDagNodeMap().get(dag.getID().getId())) {
{code}


> Delete intermediate data at DAG level for Shuffle Handler
> ---------------------------------------------------------
>
>                 Key: TEZ-3362
>                 URL: https://issues.apache.org/jira/browse/TEZ-3362
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Jonathan Eagles
>            Assignee: Kuhu Shukla
>         Attachments: TEZ-3362.001.patch, TEZ-3362.002.patch, 
> TEZ-3362.003.patch, TEZ-3362.004.patch
>
>
> Applications like hive that use tez in session mode need the ability to 
> delete intermediate data after a DAG completes and while the application 
> continues to run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to