Re: [Openstack-operators] [nova] Live migration performance tests on 100 compute nodes

2017-01-03 Thread Rui Chen
And you can use the nova option "live_migration_inbound_addr" [1] to avoid
the impact of live migration at all, but that require additional network
plane.

[1]:
https://github.com/openstack/nova/blob/master/nova/conf/libvirt.py#L243-L258
[2]:
https://blueprints.launchpad.net/nova/+spec/split-network-plane-for-live-migration

2016-12-30 20:56 GMT+08:00 Koniszewski, Pawel :

> Matt,
>
>
>
> This was a bandwidth issue. Nova kept being connected to the broker, but
> it started to timeout on RPC messages, e.g., we lost some RPC messages
> triggering post live migration steps which are there to update nova DB to
> reflect new host of an instance.
>
>
>
> The good workaround for this issue is to slightly limit bandwidth used for
> live migrations through nova.conf [1], live_migration_bandwidth config
> option in [libvirt] section. By default it is set to 0, so, basically, it
> is unlimited. Also please be aware that we changed default live migration
> configuration in OpenStack Newton and tunneling is now off by default
> (live_migration_tunnelled in [libvirt] section is set to False) due to huge
> performance impact.
>
>
>
> [1] http://docs.openstack.org/newton/config-reference/
> compute/config-options.html
>
>
>
> Kind Regards,
>
> Pawel Koniszewski
>
>
>
> *From:* tadow...@gmail.com [mailto:tadow...@gmail.com] *On Behalf Of *Matt
> Fischer
> *Sent:* Friday, December 30, 2016 5:31 AM
> *To:* Koniszewski, Pawel 
> *Cc:* openstack-operators@lists.openstack.org
> *Subject:* Re: [Openstack-operators] [nova] Live migration performance
> tests on 100 compute nodes
>
>
>
> On Wed, Dec 28, 2016 at 6:11 AM, Koniszewski, Pawel <
> pawel.koniszew...@intel.com> wrote:
>
> Hello everyone,
>
> We made a research to see how live migration performance varies between
> different configurations, especially we aimed to test tunneled vs
> non-tunneled live migrations. To test live migration we simulated a case of
> 0-day patching of 100 compute nodes (including reboot) with workloads that
> are close to the real world workloads. All the results were published [1]
> along with environment configuration and how we built test framework. Hope
> you find this useful.
>
> [1] https://01.org/openstack/blogs/pkoniszewski/2016/ossc-
> zero-day-patching
>
> Kind Regards,
> Pawel Koniszewski
>
>
>
> Thanks for the write-up. I'm curious about your RabbitMQ connection
> failures. Was it nova-compute failing to connect? Was it a bandwidth or
> heartbeat issue?
>
> ___
> OpenStack-operators mailing list
> OpenStack-operators@lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
>
>
___
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators


Re: [Openstack-operators] [nova] Live migration performance tests on 100 compute nodes

2016-12-30 Thread Koniszewski, Pawel
Matt,

This was a bandwidth issue. Nova kept being connected to the broker, but it 
started to timeout on RPC messages, e.g., we lost some RPC messages triggering 
post live migration steps which are there to update nova DB to reflect new host 
of an instance.

The good workaround for this issue is to slightly limit bandwidth used for live 
migrations through nova.conf [1], live_migration_bandwidth config option in 
[libvirt] section. By default it is set to 0, so, basically, it is unlimited. 
Also please be aware that we changed default live migration configuration in 
OpenStack Newton and tunneling is now off by default (live_migration_tunnelled 
in [libvirt] section is set to False) due to huge performance impact.

[1] 
http://docs.openstack.org/newton/config-reference/compute/config-options.html

Kind Regards,
Pawel Koniszewski

From: tadow...@gmail.com [mailto:tadow...@gmail.com] On Behalf Of Matt Fischer
Sent: Friday, December 30, 2016 5:31 AM
To: Koniszewski, Pawel 
Cc: openstack-operators@lists.openstack.org
Subject: Re: [Openstack-operators] [nova] Live migration performance tests on 
100 compute nodes

On Wed, Dec 28, 2016 at 6:11 AM, Koniszewski, Pawel 
mailto:pawel.koniszew...@intel.com>> wrote:
Hello everyone,

We made a research to see how live migration performance varies between 
different configurations, especially we aimed to test tunneled vs non-tunneled 
live migrations. To test live migration we simulated a case of 0-day patching 
of 100 compute nodes (including reboot) with workloads that are close to the 
real world workloads. All the results were published [1] along with environment 
configuration and how we built test framework. Hope you find this useful.

[1] https://01.org/openstack/blogs/pkoniszewski/2016/ossc-zero-day-patching

Kind Regards,
Pawel Koniszewski

Thanks for the write-up. I'm curious about your RabbitMQ connection failures. 
Was it nova-compute failing to connect? Was it a bandwidth or heartbeat issue?
___
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators


Re: [Openstack-operators] [nova] Live migration performance tests on 100 compute nodes

2016-12-29 Thread Matt Fischer
On Wed, Dec 28, 2016 at 6:11 AM, Koniszewski, Pawel <
pawel.koniszew...@intel.com> wrote:

> Hello everyone,
>
> We made a research to see how live migration performance varies between
> different configurations, especially we aimed to test tunneled vs
> non-tunneled live migrations. To test live migration we simulated a case of
> 0-day patching of 100 compute nodes (including reboot) with workloads that
> are close to the real world workloads. All the results were published [1]
> along with environment configuration and how we built test framework. Hope
> you find this useful.
>
> [1] https://01.org/openstack/blogs/pkoniszewski/2016/ossc-
> zero-day-patching
>
> Kind Regards,
> Pawel Koniszewski
>

Thanks for the write-up. I'm curious about your RabbitMQ connection
failures. Was it nova-compute failing to connect? Was it a bandwidth or
heartbeat issue?
___
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators


[Openstack-operators] [nova] Live migration performance tests on 100 compute nodes

2016-12-28 Thread Koniszewski, Pawel
Hello everyone,

We made a research to see how live migration performance varies between 
different configurations, especially we aimed to test tunneled vs non-tunneled 
live migrations. To test live migration we simulated a case of 0-day patching 
of 100 compute nodes (including reboot) with workloads that are close to the 
real world workloads. All the results were published [1] along with environment 
configuration and how we built test framework. Hope you find this useful.

[1] https://01.org/openstack/blogs/pkoniszewski/2016/ossc-zero-day-patching

Kind Regards,
Pawel Koniszewski

___
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators