Re: [DNG] Detailed technical treatise of systemd

Arnt Gulbrandsen Sat, 07 Nov 2015 07:05:00 -0800

Steve Litt writes:

I'd like to discuss this. Now, after a year of thought, I still see no
benefit to "starting servers in parallel" except for boot time.


Because you're thinking of the happy path.

Suppose you have a few dozen servers on three continents, providing auser-facing service, using something like zk or etcd to coordinate theservers.

Suppose further that something on the servers does five DNS lookups atstartup. On the happy path that takes 5*0.008=0.04 seconds and who cares,but the worst case is in minutes. Say five 90-second timeouts. If thingsstart up serially, zk or etcd will begin to initialise about eight minutesafter the server started booting. The cluster can be without a quorum foreight minutes, and if you're lucky that's just a horrible backlog of failedor blocking transactions. If you're unlucky the node has been declaredunhealthy and the cluster has started copying terabytes of data in order torestore redundancy.


For want of an X, Y. In real life ;)

BTW, systemd's approach to parallelism isn't particularly good for thissort of service. Parallelism is good, but not just any kind. Systemd thinksit can start services according to a DAG, but in reality that DAG is notknowable on any single host. For example: Service X on nodes 1-A8 needsservice Y, which runs on nodes 3-5 and 12-15 today. The only sensibleapproach is to start everything and require that all services behaverobustly when a dependency isn't ready.


Arnt

_______________________________________________
Dng mailing list
Dng@lists.dyne.org
https://mailinglists.dyne.org/cgi-bin/mailman/listinfo/dng

Re: [DNG] Detailed technical treatise of systemd

Reply via email to