[ 
https://issues.apache.org/jira/browse/SPARK-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13972864#comment-13972864
 ] 

Mridul Muralidharan commented on SPARK-1524:
--------------------------------------------

The expectation is to fallback to a previous schedule type in case the higher 
level is not valid : though this is tricky in general case.
Will need to take a look at it - though given that I am tied up with other 
things, if someone else wants to take a crack, please feel free to do so !

Btw, use of IP's and multiple hostnames for a host is not supported in spark - 
so that is something that will need to be resolved at the deployment end.

> TaskSetManager'd better not schedule tasks which has no preferred executorId 
> using PROCESS_LOCAL in the first search process
> ----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-1524
>                 URL: https://issues.apache.org/jira/browse/SPARK-1524
>             Project: Spark
>          Issue Type: Improvement
>          Components: Spark Core
>            Reporter: YanTang Zhai
>            Priority: Minor
>
> ShuffleMapTask is constructed with TaskLocation which has only host not 
> (host, executorID) pair in DAGScheduler.
> When TaskSetManager schedules ShuffleMapTask which has no preferred 
> executorId using specific execId host and PROCESS_LOCAL locality level, no 
> tasks match the given locality constraint in the first search process.
> We also find that the host used by Scheduler is hostname while the host used 
> by TaskLocation is IP in our cluster. The tow hosts do not match, that makes 
> pendingTasksForHost HashMap empty and the finding task process against our 
> expectation.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to