I am wondering whether we have downgraded the machines to 5.10.0-20 kernel to 
get rid of the kernel bug which is known to cause issue in user processes at 
random - described in the cover letter here: 
https://lists.debian.org/debian-s390/2023/02/msg00019.html

The following patch fixes this issue:
https://lists.debian.org/debian-s390/2023/02/msg00019.html
The patch fixes the problem introduced in commit 75309018a24d ("s390: add 
support for TIF_NOTIFY_SIGNAL")

Can we check if the patch solves any of the CI issues?

-Dipak


On 16.02.23 17:49, Paul Gevers wrote:
> As you can see e.g. here [1,2] it comes and goes (albeit sometimes the
> queue was empty). I don't think its very different, I just never got out
> of the s390x host what I was expecting. Long time I blamed it on the
> "stealing" that happens on a shared host, but I think there's more.
>
> https://ci.debian.net/munin/ci-worker-s390x-01/ci-worker-s390x-01/debci_packages_processed.html

So a pet peeve of mine are unitless graphs. Can we please annotate what
the unit is? If we're looking at "Packages processed by architecture"[1]
(which again, isn't a workable unit), then we see that s390x does not
have a bad average, nor an overly bad max - given that it's with one
worker instead of like 14 for amd64 and 10 for arm64?

The average/max for the week is double for amd64 vs. s390x, so what does
the queue size mean? Is there still obsolete work in the queue as well
or does every item have the same value? (There's no way right now it can
catch up with that many items in the queue. Although that again depends
on the unit...)

Kind regards
Philipp Kern

[1]
https://ci.debian.net/munin/debian.net/ci-master.debian.net/debci_total_packages_processed.html

Reply via email to