[Openstack-operators] [nova] Queens PTG recap - cells

Matt Riedemann Sat, 16 Sep 2017 09:42:09 -0700

The full etherpad for cells discussions at the PTG is here [1].

We mostly talked about the limitations with multiple cells identified inPike [2] and priorities.


Top priorities for cells in Queens
----------------------------------

* Alternate hosts: with multiple cells in a tiered (super) conductormode, we don't have reschedules happening when a server build fails on acompute. Ed Leafe has already started working on the code to build anobject to pass from the scheduler to the super conductor. We'll thensend that from the super conductor down to the compute service in thecell and then reschedules can happen within a cell using that providedlist of alternative hosts (and pre-determined allocation requests forPlacement provided by the scheduler). We agreed that we should get thisdone early in Queens so that we have ample time to flush out and fix bugs.

* Instance listing across multiple cells: this is going to involvesorting the instance lists we get back from multiple cells, which todayare filtered/sorted in each cell and then returned out of the API in a"barber pole" pattern. We are not going to use Searchlight for this, butinstead do it with more efficient cross-cell DB queries. Dan Smith isgoing to work on this.


Dealing with up-calls
---------------------

In a multi-cell or tiered (super) conductor mode, the cell conductor andcompute services cannot reach the top-level database or message queue.This breaks a few existing things today.

* Instance affinity reporting from the computes to the scheduler won'twork without the MQ up-call. There is also a check that happens late inthe build process on the compute which checks to see if server groupaffinity/anti-affinity policies are maintained which is an up-call tothe API database. Both of these will be solved long-term when we modeldistance in Placement, but we are deferring that from Queens. The lateaffinity check in the compute is not an issue if you're running a singlecell (not using a tiered super conductor mode deployment) and if you'rerunning multiple cells, you can configure the cell conductors to haveaccess to the API database as a workaround. We wouldn't test with thisworkaround in CI, but it's an option for people that need it.

* There is a host aggregate up-call when performing live migration withthe xen driver and you're letting the driver determine if blockmigration should be used. We decided to just put a note in the code thatthis doesn't work and leave it as a limitation for that driver andscenario, which xen driver maintainers or users can fix if they want,but we aren't going to make it a priority.

* There is a host aggregate up-call when doing boot from volume and thecompute service creates the volume, it checks to see if the instance AZand volume AZ match when [cinder]/cross_az_attach is False (not thedefault). Checking the AZ for the instance involves getting the hostaggregates that the instance is in, and those are in the API database.We agreed that for now, people running multiple cells and using thiscross_az_attach=False setting can configure the cell conductor to reachthe API database, like the late affinity check described above. SylvainBauza is also looking at reasons why we even do this check if the userdid not request a specific AZ, so there could be other general changesin the design for this cross_az_check later. That is being discussedhere [3].


Other discussion
----------------

* We have a utility to concurrently run database queries againstmultiple cells. We are going to look to see if we can retrofit somelinear paths of the code with this utility to improve performance.

* Making the consoleauth service run per-cell is going to be lowpriority until some large cells v2 deployments start showing up andsaying that a global consoleauth service is not scaling and it needs tobe fixed.

* We talked about using the "GET /usages" Placement API for countingquotas rather than iterating that information from the cells, but thereare quite a few open questions about design and edge cases like moveoperations and Ironic with custom resource classes. So while this issomething that should make counting quotas perform better, it'scomplicated and not a priority for Queens.

* Finally, we also talked about the future of cells v1 and when we canofficially deprecate and remove it. We've already been putting warningsin the code, docs and config options for a long time about not usingcells v1 and it being replaced with cells v2. *We agreed that if we canget efficient multi-cell instance listing fixed in Queens, we'll removeboth cells v1 and nova-network in Rocky.* We've been asking that largecells v1 deployments start checking out cells v2 and what issues theyrun into with the transition, at least since the Boston Pike summit, andso far we haven't gotten any feedback, so we're hoping this timelinewill spur some movement on that front. Dan Smith also called dibs on thecode removal.


[1] https://etherpad.openstack.org/p/nova-ptg-queens-cells

[2]https://docs.openstack.org/nova/latest/user/cellsv2_layout.html#caveats-of-a-multi-cell-deployment[3]http://lists.openstack.org/pipermail/openstack-operators/2017-September/014200.html


--

Thanks,

Matt

_______________________________________________
OpenStack-operators mailing list
[email protected]
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators

[Openstack-operators] [nova] Queens PTG recap - cells

Reply via email to