I've made some modifications to the script (see attached), the changes
include:

1. kill with ALRM first, then kill with KILL if this does not work after a 
small grace period. Also report on unkillable stressors
2. bump up async I/O threshold for machines with lots of CPUs
3. force hdd to do sync writes, that way we don't backlog with gazillions of 
pending I/Os on machines with a lot of memory and many CPUs
4. limit readahead file size so that this stressor does not spend most of it's 
time generating a test file before it can start testing readaheads

I've run this through several times with the latest stress-ng and it
runs through to completion.

So I think we were suffering from issues where loads of pending I/Os
from stressors plus bad cleanup on nuked stressors were causing massive
I/O backlogs which caused the system to clag up.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1640547

Title:
  stress-ng based disk tests failing

To manage notifications about this bug go to:
https://bugs.launchpad.net/stress-ng/+bug/1640547/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to