Re: [openstack-dev] [oslo] Can we stop global requirements update?

Mike Bayer Fri, 19 May 2017 14:10:45 -0700


On 05/19/2017 04:23 AM, Mehdi Abaakouk wrote:

And some applications rely

on implicit internal contract/behavior/assumption.

IMO that's a bug for them. I'm inspired to see that Keystone, Novaetc. are able to move between and eventlet backend and a mod_wsgibackend. IMO eventlet is really not needed for those services thatpresent a REST interface. Although for a message queue with lots oflong-running connections that receive events, that's a place where I*would* want to use a polling / non-blocking model. But I'd use itexplicitly, not with monkeypatching.


Since a new API is needed, why not writing a new lib. Anyways when you
get rid of eventlet you have so many thing to change to ensure your

performance will not drop.

While I don't know the specifics for your project(s), I don't buy thatin general because IMO eventlet is not giving us any performance boostin the majority of cases. most of our IO is blocking on the databaseand all the applications have DB connections throttled at about 50 perprocess at the max, and that's only recently, it used to be just 15.




Changing from oslo.service to cotyledon is

really easy on the side.

I'd ask why not oslo.cotyledon but it seems there's a faction here thatis overall moving out of the Openstack umbrella in any case.

Docs state: "oslo.service being impossible to fix and bringing anheavy dependency on eventlet, " is there a discussion thread on that?
Not really, I just put some comments on reviews and discus this on IRC.
Since nobody except Telemetry have expressed/try to get rid of eventlet.

Many (most?) of the web services can run under mod_wsgi with threads,Keystone seems to be standard on this now and I get the impression Novais going in that direction too. (anyone correct me if I'm wrong onany of that, I looked to ask around on IRC but it's too late in the day).


For the story we first get rid of eventlet in Telemetry, fixes couple of
performance issue due to using threading/process instead
greenlet/greenthread.

Then we fall into some weird issue due to oslo.service internal
implementation. Process not exiting properly, signals not received,
deadlock when signal are received, unkillable process,
tooz/oslo.messaging heartbeat not scheduled correctly, worker not
restarted when they are dead. All of what we expect from oslo.service
was not working correctly anymore because we remove the line
'eventlet.monkeypatch()'.

So, I've used gevent more than eventlet in my own upstream non-blockingwork, and while this opinion is like spilling water in the ocean, Ithink applications should never use monkeypatching. They should callinto the eventlet/gevent greenlet API directly if that's what they wantto do.

Of course this means that database logic has to move out of greenletsentirely since none of the DBAPIs use non-blocking IO. That's fine.Database-oriented business logic should not be in greenlets. I'vewritten about this as well. If one is familiar enough with greenletsand threads you can write an application that makes explicit use ofboth. However, that's application level stuff. Web service apps likeNova conductor / Neutron / Keystone should not be aware of any of that.They should be coded to assume nothing about context switching. IMOthe threading model is "safer" to code towards since you have to handlelocking and concurrency contingencies explicitly without hardwiring thatto your assumptions about when context switching is to take place andwhen it's not.


For example, when oslo.service receive a signal, it can arrive on any
thread, this thread is paused, the callback is run in this thread
context, but if the callback try to discus to your code in this thread,
the process lockup, because your code is paused. Python
offers tool to avoid that (signal.set_wakeup_fd), but oslo.service don't
use it. I have tried to run callbacks only on the main thread with
set_wakeup_fd, to avoid this kind of issue but I fail. The whole
oslo.service code is clearly not designed to be threadsafe/signalsafe.
Well, it works for eventlet because you have only one real thread.

And this is just one example on complicated thing I have tried to fix,
before starting cotyledon.

I've no doubt oslo.service has major eventlet problems baked in, I'velooked at it a little bit but didn't go too far with it. That stilldoesn't mean there shouldn't be an "oslo.service2" that can effectivelyproduce a concurrency-agnostic platform. It of course would have thegoal in mind of moving projects off eventlet since as I mentioned,eventlet monkeypatching should not be used which means our servicesshould do most of their "implicitly concurrent" work within threads.

Basically I think openstack should be getting off eventlet in a big wayso I guess my sentiment here is that the Gnocchi / Cotyledon /etc.faction is just splitting off rather than serving as any kind ofdirection for the rest of Openstack to start looking. But that's onlyan impression, maybe projects will use Cotyledon anyway. If everyproject goes off and uses something completely different though, then Ithink we're losing. The point of oslo was to prevent that.

I'm finding it hard to believe that only a few years ago, everyone sawthe wisdom of not re-implementing everything in their own projects andusing a common layer like oslo, and already that whole situation isbecoming forgotten - not just for consistency, but also when a bug isfound, if fixed in oslo it gets fixed for everyone.
Because the internal of cotyledon and oslo.service are so different.
Having the code in oslo or not doesn't help for maintenance anymore.
Cotyledon is a lib, code and bugs :) can already be shared between
projects that doesn't want eventlet.
An increase in the scope of oslo is essential to dealing with theissue of "complexity" in openstack.
Increasing the scope of oslo works only if libs have maintainers. But
most of them lack of people today. Most of oslo libs are in maintenance
mode. But that another subject.
The state of openstack as dozens of individual software projects eachwith their own idiosyncratic quirks, CLIs, process and deploymentmodels, and everything else that is visible to operators is groundzero for perceived operator complexity.
Cotyledon have been written to be Openstack agnostic. But I have also
write an optional module within the library to glue oslo.config and
cotyledon. Mainly to mimic the oslo.config options/reload of
oslo.service and make operators experience unchanged for Openstack
people.


OK, so that would be your oslo.cotyledon.   That works.


--
Mehdi Abaakouk
mail: sil...@sileht.net
irc: sileht


__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [oslo] Can we stop global requirements update?

Reply via email to