Re: [openstack-dev] [nova][scheduling] Can VM placement consider the VM network traffic need?

Balazs Gibizer Tue, 05 Sep 2017 00:24:52 -0700

On Mon, Sep 4, 2017 at 9:11 PM, Jay Pipes <[email protected]> wrote:

On 09/01/2017 04:42 AM, Rua, Philippe (Nokia - FI/Espoo) wrote:
> Will it be possible to include network bandwidth as a resource inNova scheduling, for VM placement decision?
Yes.

See here for a related Neutron spec that mentions Placement:
https://review.openstack.org/#/c/396297/7/specs/pike/strict-minimum-bandwidth-support.rst
> Context: in telecommunication applications, the network traffic isan important dimension of resource usage. For example, it is oftenimportant to distribute "bandwidth-greedy" VMs to different computenodes. There were some earlier discussions on this topic, but I couldnot find a concrete outcome. [1][2][3]
>
> After some reading, I wonder whether the Custom resource classescan provide a generic mechanism? [4][5][6]
No :) Custom resource classes are antithetical to generic/standard
mechanisms.

We want to add two *standard* resource classes, one called
NET_INGRESS_BYTES_SEC and another called NET_EGRESS_BYTES_SEC which
would represent the total bandwidth in bytes per second the for
corresponding traffic directions.

While I agree that the end goal is to have standard resource classesfor bandwidth I think custom resource classes are generic enough tomodel bandwidth resource. If you want to play with the bandwidth basedscheduling idea based on Pike then custom resource classes areavailable as a tool for a proof of concept.

What would be the resource provider, though? There are at least two
potential answers here:

1) A network interface controller on the compute host

In this case, the NIC on the host would be a child provider of the
compute host resource provider. It would have an inventory record of
resource class NET_INGRESS_BYTES_SEC with a total value representingtheentire bandwidth of the host NIC. Instances would consume some amountofNET_INGRESS_BYTES_SEC corresponding to *either* the Nova flavor (ifthe
resources:NET_INGRESS_BYTES_SEC extra-spec is set) *or* to the sum of
consumed bandwidth amounts from the port profile of any portsspecified
when launching the instance (and thus would be part of the pci device
request collection attached to the build request).
2) A "network slice" of a network interface controller on the computehost
In this case, assume that the NIC on the compute host has had itstotal
bandwidth constrained via traffic control so that 50% of its available
ingress bandwidth is allocated to network A and 50% is allocated to
network B.

There would be multiple resources providers, each with an inventory
record of resource class NET_INGRESS_BYTES_SEC with a total value of1/2the total NIC bandwidth. Both of these resource providers would bechild
providers of the compute host resource provider. One of these child
resource providers will be decorated with the trait "CUSTOM_NETWORK_A"
and the other with trait "CUSTOM_NETWORK_B".

The scheduler would be able to determine which resource provider to
consume the NET_INGRESS_BYTES_SEC resources from by looking for a
resource provider that has both the required amount of
NET_INGRESS_BYTES_SEC as well as the trait required by the portprofile.If, say, the port profile specifies that the port is to go on a NICwithaccess to network "A", then the build request would contain a requestto
the scheduler for CUSTOM_NETWORK_A trait...

The above setup can be simulated with custom resource classes andindividual resource providers per compute node connected to the givencompute node's resource provider via an aggregate. You most probablyneed to simulate the above network traits with individual customresource classes in Pike.

I definitely don't think it is something I would do in production basedon Pike due to two reasons:1) we have bugs in Pike GA that prevents nova to handle some edge cases(especially in VM moving scenarios)2) I agree with Jay that nested providers and neutron support willallows us to do something much more cleaner in the future.

However I think Pike is a good base to build a PoC and gather feedback.For example I already foresee a need to model OVS packet processinglimits and in the long run even include the capacity of the TORswitches into the picture.

If you're coming to Denver, I encourage you to get with me, SeanMooney,Moshe Levi and others who are interested in seeing this work moveforward.


@Jay: sign me up for this list.

Cheers,
gibi

Best,
-jay

> Here is what I have in mind:
> - The VM need is specified in the flavor extra-specs, e.g.resources:CUSTOM_BANDWIDTH=123.> - The compute node total capacity is specified in host aggregatemetadata, e.g. CUSTOM_BANDWIDTH=999.> - Nova then takes care of the rest: scheduling where the freecapacity is sufficient, and performing simple resource usageaccounting (updating the compute node free network bandwidth capacityas required).
>
> Is the outline above according to current plans?
> If not, what would be possible/needed in order to achieve the sameresult, i.e. consider the VM network traffic need during VM placement?
>
> BR,
> Philippe
>
> [1]https://blueprints.launchpad.net/nova/+spec/bandwidth-as-scheduler-metric
> [2] https://wiki.openstack.org/wiki/NetworkBandwidthEntitlement
> [3]https://openstack.nimeyo.com/80515/openstack-scheduling-bandwidth-resources-nic_bw_kb-resource
> [4] https://docs.openstack.org/nova/latest/user/placement.html
> [5]http://specs.openstack.org/openstack/nova-specs/priorities/pike-priorities.html#placement
> [6] https://review.openstack.org/#/c/473627/
>
>__________________________________________________________________________
> OpenStack Development Mailing List (not for usage questions)
> Unsubscribe:[email protected]?subject:unsubscribe
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe:[email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev



__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [nova][scheduling] Can VM placement consider the VM network traffic need?

Reply via email to