[ 
https://issues.apache.org/jira/browse/YARN-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14030643#comment-14030643
 ] 

Robert Joseph Evans commented on YARN-2140:
-------------------------------------------

We are working on similar things for storm.  I am very interested in your 
design, because for any streaming system to truly have a chance on YARN soft 
guarantees on network I/O are critical.  There are several big problems with 
network I/O even if the user can effectively estimate what they will need.  The 
first is that the resource is not limited to a single node in the cluster.  The 
network has a topology and a bottlekneck can show up at any point in that 
topology.  So you may think you are fine because each node in a rack is not 
scheduled to be using the full bandwidth that the network card(s) can support.  
But you can easily have saturated the top of rack switch without knowing it.  
To solve this problem you effectively have to know the topology of the 
application itself.  So that you can schedule the node to node network 
connections within that application. if users don't know how much network they 
are going to use at a high level, they will never have any idea at a low level. 
 But then you also have the big problem of batch being very bursty in its 
network usage.  The only way to solve this is going to require network hardware 
support for prioritizing packets.

But I'll wait for your design before writing too much more.

> Add support for network IO isolation/scheduling for containers
> --------------------------------------------------------------
>
>                 Key: YARN-2140
>                 URL: https://issues.apache.org/jira/browse/YARN-2140
>             Project: Hadoop YARN
>          Issue Type: New Feature
>            Reporter: Wei Yan
>            Assignee: Wei Yan
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to