Hi, > Le 2 juin 2023 à 07:43, Petr Štetiar <yn...@true.cz> a écrit : > > Thibaut <ha...@slashdirt.org> [2023-06-01 18:21:22]: > > Hi, > >>> There has been many timeouts of "3600 seconds without output" in master, >> >> These look like connectivity issues. > > I'm not sure, as there is a keep alive going on between master/worker so > master would remove the worker quite sooner due to keep alive response > timeout, wouldn't it? Putting asside some buildbot bugs of course.
You are correct, I was talking out of my rear end ;) This is not a connectivity issue, the build is actually hung. dmesg might have more info. > Workers osuosl-dock-09,10,11,12 are on one build host and > osuosl-dock-05,06,07,08 are on the second build host, wouldn't they have same > connectivity issues at the same time? Correct. I noticed you adjusted cpu affinity, on NUMA nodes it helps performance. On my buildbot setup I used cgroups and assigned each buildworker to a specific cgroup, which enables me to adjust CPU affinity *and* memory affinity, which you also want. It also enables you to easily set memory limits for each build worker. I’m happy to share my config if interested, it’s not a very complex setup. > I'm not saying it's not possible, there has been similar network issues in the > past, so it might be it. > >>> and quite too many "out of space" errors in the 23.05 packages buildbot. >> >> 23.05 package builders are nearly all out of space, possibly due to >> accumulated cruft in dl dir. > > from the quick look it seems like Rust has increased the disk space > requirements in shared work directory. I’m confused with that sentence: the du step shows 36G used, but df says all 60G are full; which suggests something *outside* of the build directory is eating space? Cheers, Thibaut _______________________________________________ openwrt-devel mailing list openwrt-devel@lists.openwrt.org https://lists.openwrt.org/mailman/listinfo/openwrt-devel