Re: [openstack-dev] [nova] How to debug no valid host failures with placement

Ben Nemec Wed, 01 Aug 2018 10:19:33 -0700


On 08/01/2018 11:23 AM, Chris Friesen wrote:

On 08/01/2018 09:58 AM, Andrey Volkov wrote:
Hi,
It seems you need first to check what placement knows about resourcesof your cloud.
This can be done either with REST API [1] or with osc-placement [2].
For osc-placement you could use:

pip install osc-placement
openstack allocation candidate list --resource DISK_GB=20 --resource
MEMORY_MB=2048 --resource VCPU=1 --os-placement-api-version 1.10
And you can explore placement state with other commands like openstackresourceprovider list, resource provider inventory list, resource providerusage show.
Unfortunately this doesn't help figure out what the missing resourceswere *at the time of the failure*.
The fact that there is no real way to get the equivalent of the olddetailed scheduler logs is a known shortcoming in placement, and willbecome more of a problem if/when we move more complicated things likeCPU pinning, hugepages, and NUMA-awareness into placement.
The problem is that getting useful logs out of placement would requiresignificant development work.

Yeah, in my case I only had one compute node so it was obvious what theproblem was, but if I had a scheduling failure on a busy cloud withhundreds of nodes I don't see how you would ever track it down. Maybewe need to have a discussion with operators about how often they dopost-mortem debugging of this sort of thing?


__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [nova] How to debug no valid host failures with placement

Reply via email to