gczsjdy commented on issue #24462: [SPARK-26268][CORE] Do not resubmit tasks 
when executors are lost
URL: https://github.com/apache/spark/pull/24462#issuecomment-519354047
 
 
   @squito Index and data files are both stored on DFS, the difference is that: 
data files are directly read from DFS, however, for index files, a reducer 
fetches them from the executors('s cache) who wrote them, if there aren't 
required index files in cache, they will be loaded from DFS. This approach 
simulates the external shuffle service's cache, but instead of in another Java 
process, it's in Executor.
   
   This approach needs a reasonable place(and it's the coordinated map 
executor) to cache index files.  Returning a `None` location for mapper task 
will make
   - The no resubmit tasks' need satisfied
   - But the cache feature not satisfied : (

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to