Re: [modwsgi] Unexplained lag during startup

Jamie Biggar Fri, 23 Oct 2020 04:51:48 -0700

Thank you for the extremely quick turn-around and suggestions, Graham! Per your advice, I removed `WSGIImportScript` and added`WSGIRestrictEmbedded On`.

After some more research, setting up a dummy hello world Django app, Iconfirmed the startup delay wasn't with mod_wsgi after all. It ended upbeing a large number of static files getting scanned at startup byWhitenoise (https://github.com/evansd/whitenoise/).

Thanks again for the quick response and apologies for the noise. I hadbeen spinning my wheels for longer than I care to admit and appreciateyour help ruling out mod_wsgi as the cause.


Best,
Jamie

On 10/22/20 6:40 PM, Graham Dumpleton wrote:

Remove:

    WSGIImportScript /path/to/django-wsgi.py \
      process-group=eslive application-group=%{GLOBAL}
Setting both process-group and application-group on WSGIScriptAliashas the same effect of preloading the WSGI script file usingWSGIScriptAlias. I am not sure what will happen if both ways offorcing preloading are set.
Also a memory corruption bug was also recently reported to me alongwith a fix. This has been an outstanding issue for many years butwhich so rarely occurred on full Linux and macOS platforms (AlpineLinux would crash all the time though), that never been able to trackit down. This bug relates to the preloading of the WSGI script file,so there is an outside chance it is related.
Disabling the preloading may not be desirable though because lazyloading has greater risk of delaying first requests longer as canqueue up on process which is still loading the application. That said,it may not be noticeable since only one thread per process. Thus worthtrying:
    WSGIProcessGroup eslive
    WSGIScriptAlias / /path/to/django-wsgi.py application-group=%{GLOBAL}
which because no WSGIImportScript, but both process-group andapplication-group aren't said, means no pre-loading.
BTW, if you don't already have it said, ensure you are setting:

    WSGIRestrictEmbedded On
if only using daemon mode. Not related, but good practice and cutsdown on memory usage and startup load on Apache child worker processes.
So first up try that. The bug fix I mention hasn't actually beenreleased yet as had some other unfinished stuff in code which wasn'tsure if I wanted to change. If you wanted to be brave though, youcould try the 'develop' branch of mod_wsgi on GitHub. If can replicatein testing system, could perhaps try it there.
The only other thing can think of is if there is a cross processconflict with initialisation done by your app in relation to adatabase or backend service, when multiple processes are starting upat the same time.
Finally, not sure whether might be adapted, but as very first thing inWSGI script file you could start a background thread which watches foran event set at end of WSGI script file import, and if takes more thancertain time to see that event, indicating slow WSGI script file load,dump out Python stack traces. Code related to this is found at:
https://modwsgi.readthedocs.io/en/master/user-guides/debugging-techniques.html#extracting-python-stack-traces
It will need to be updated to Python 3 as probably still Python 2, andthen adapt it as mentioned.
Graham
On 23 Oct 2020, at 8:52 am, Jamie Biggar <[email protected]<mailto:[email protected]>> wrote:
Hi all,
I've been a mod_wsgi user for many years (Graham, thank you for yourfantastic community support!), but this week ran into a mystery Ihaven't been able to solve on my own.
We've been running a fairly hefty Django app in production withmod_wsgi for years without much issue. In August, with no obviouslycorrelated change in code or server architecture, we started havingissues where a restart (usually triggered by `touch`ing the WSGIscript via `WSGIScriptReloading On`, though sometimes also by`systemctl restart httpd.service`) would occasionally lead to anunending stream of 504 timeouts (and sometimes some 503s as well)lasting indefinitely. Another restart would sometimes fix it, butnot always. The issue seems to be load related -- the busier theserver is, the more likely it is to get stuck in the 504 loop. Mostrestarts would work fine and yield a normally-running site after abrief pause as the app was loaded into memory.
While troubleshooting today (not under production load), I noticedsomething that I think is likely exacerbating load-related restarttimeout issues: it seems that after a flurry of activity on initialserver (re)start which clearly includes loading our WSGI script (as Isee entries in the Apache error log related to Python packages itimports), there's a period of roughly 45 seconds when the CPU is idleand no requests are served via mod_wsgi before it wakes up andfinally emits `Started thread 0 in daemon process ...` log messages,then a few seconds later it's able to reply to HTTP requests.
*Any idea what could cause that ~45 second idle period duringstartup?* I've tried tuning the *-timeout options forWSGIDaemonProcess, with no apparent effect on the idle time. I alsotried disabling our NewRelic APM code to rule out a network APIbottleneck.
Software versions:

* Amazon Linux 2
* Python 3.6 (via IUS: https://ius.io/ )
* mod_wsgi/4.6.2 (also via IUS, compiled against Python 3.6)
* Apache/2.4.46
* Django 2.2

Apache config:

WSGIDaemonProcess eslive display-name='(wsgi:es-site)' \
  processes=6 threads=1 \
  user=apache group=apache \
  python-home=/path/to/virtualenv \
  python-path=/path/to/code/root \
  python-eggs=/var/www/.python-eggs \
  lang='en_US.UTF-8' locale='en_US.UTF-8' \
  queue-timeout=45 \
  socket-timeout=60 \
  connect-timeout=15 \
  request-timeout=120 \
  startup-timeout=30 \
  deadlock-timeout=60 \
  eviction-timeout=0 \
  shutdown-timeout=5 \
  graceful-timeout=15 \
  restart-interval=0 \
  inactivity-timeout=0 \
  maximum-requests=0
WSGIImportScript /path/to/django-wsgi.py \
  process-group=eslive application-group=%{GLOBAL}
WSGISocketPrefix run/httpd-wsgi
<VirtualHost ...>
WSGIScriptAlias / /path/to/django-wsgi.py \
  process-group=eslive application-group=%{GLOBAL}
 WSGIPassAuthorization On
</VirtualHost>

Thanks in advance for any recommendations!

-Jamie


--
You received this message because you are subscribed to the GoogleGroups "modwsgi" group.To unsubscribe from this group and stop receiving emails from it,send an email to [email protected]<mailto:[email protected]>.To view this discussion on the web visithttps://groups.google.com/d/msgid/modwsgi/bcb386ac-7c83-459d-bced-792d535a09d0n%40googlegroups.com<https://groups.google.com/d/msgid/modwsgi/bcb386ac-7c83-459d-bced-792d535a09d0n%40googlegroups.com?utm_medium=email&utm_source=footer>.
--
You received this message because you are subscribed to a topic in theGoogle Groups "modwsgi" group.To unsubscribe from this topic, visithttps://groups.google.com/d/topic/modwsgi/EYQ6O5NLC3k/unsubscribe.To unsubscribe from this group and all its topics, send an email to[email protected]<mailto:[email protected]>.To view this discussion on the web visithttps://groups.google.com/d/msgid/modwsgi/4D0C89F5-4F66-478A-B61D-049C3C8622AD%40gmail.com<https://groups.google.com/d/msgid/modwsgi/4D0C89F5-4F66-478A-B61D-049C3C8622AD%40gmail.com?utm_medium=email&utm_source=footer>.


--

*Jamie Biggar*
VP Engineering & CTO, EnergySage <https://www.energysage.com/>
617.396.7215 | [email protected] <mailto:[email protected]>

Get an _instant estimate_<https://www.energysage.com/solar/calculator/> to see your solar savings!


--
You received this message because you are subscribed to the Google Groups 
"modwsgi" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/modwsgi/43736233-355c-9205-71d1-0286aad37d34%40energysage.com.

Re: [modwsgi] Unexplained lag during startup

Reply via email to