1] Support per-test execution timeout factor)

Jacob Bachmeyer Wed, 03 Jan 2024 19:19:03 -0800

Maciej W. Rozycki wrote:

On Wed, 3 Jan 2024, Hans-Peter Nilsson wrote:
The test execution timeout is different from the tool execution timeoutwhere it is GCC execution that is being guarded against taking excessiveamount of time on the test host rather than the resulting test caseexecutable run on the target afterwards, as concerned here. GCC alreadyhas a `dg-timeout-factor' setting for the tool execution timeout, but hasno means to increase the test execution timeout. The GCC side of thesechanges adds a corresponding `dg-test-timeout-factor' setting.
Hmm. I think it would be more correct to emphasize that theexisting dg-timeout-factor affects both the tool execution *and*the test execution, whereas your new dg-test-timeout-factor onlyaffects the test execution. (And still measured on the host.)
Not really, `dg-timeout-factor' is only applied to tool execution and itdoesn't affect test execution. Timeout value reporting used to be limitedin DejaGNU, but you can enable it easily now by adding the DejaGNU patchseries referred in the cover letter and see that `dg-timeout-factor' isignored for test execution.

Then we need a better name for this new feature that more clearlyindicates that it applies to running executables compiled as part of atest. Also, 'test_timeout' is documented as a knob for siteconfiguration to twiddle, not for testsuites to adjust. I supportadding scale factors for testsuites to indicate "this test takes longerthan usual" but these will need to be thought through. This quick hackwill cause future maintenance problems.

Usually the compilation time is close to 0, so is this based onan actual need more than an itchy "wart"?
Or did I miss something?
Compilation is usually quite fast, but this is not always the case. Ifyou look at the tests that do use `dg-timeout-factor' in GCC, and somecommits that added the setting, then you ought to find actual use cases.I saw at least one such a test that takes an awful lot of time here on areasonably fast host machine and still passes where GCC has been builtwith optimisation enabled, but does time out in the compilation phase ifthe compiler has been built at -O0 for debugging purposes. I'd have tochase it though if you couldn't find it as I haven't written the namedown.
So yes, `dg-timeout-factor' does have its use, but it is different fromthat of `dg-test-timeout-factor', hence the need for a separate setting.

This name has already caused confusion and the patch has not even beenaccepted yet. The feature is desirable but this implementation is notacceptable.


At the moment, there are two blocking issues with this patch:

1. The global variable name 'test_timeout_factor' is not acceptablebecause it has already caused confusion, apparently among GCC developerswho should be familiar with the GCC testsuite. If it already confusesGCC testsuite domain experts, its meaning is too unclear for generaluse. While looking for alternative names, I found the fundamentalproblem with this proposed implementation: test phases (such as runninga test program versus running the tool itself) are defined by thetestsuite, not by the framework. DejaGnu therefore cannot explicitlysupport this as offered because the proposal violates encapsulation bothways.

2. New code in DejaGnu using expr(n) is to have the expression bracedas recommended in the expr(n) manpage, unless it actually uses thesemantics provided by unbraced expr expressions, in which case it*needs* a comment explaining and justifying that.


The second issue is trivially fixable, but the first appears fatal.

There is a new "testcase" mulitplex command in Git master, which will beincluded in the next release, that is intended for testsuites to expressdynamic state. The original planned use was to support hierarchicaltest groups, for which a "testcase group" command is currently defined.In the future, dg.exp will be extended to use "testcase group" todelimit each testcase that it processes, and the framework will itselfexplicitly track each test script as a group. (DejaGnu's currentsemantics implicitly group tests by test scripts, but only by (*.exp)scripts.) Could this multiplex be a suitable place to put this API feature?

Using a command also has the advantage that it will cause a hard failureif the framework does not implement it, unlike a variable that a testscript can set for the framework to silently ignore, leading tohard-to-reproduce test (timeout) failures if an older framework is usedwith a testsuite expecting this feature. The semantics of "testcasepatience" or similar would be defined to extend to the end of the group(or test script in versions of DejaGnu that do not fully implementgroups) in which it is executed. This limited scope is needed becauseallowing timeout scale factors to "bleed over" to the next test scriptwould play havoc with the planned native parallel testing support, wherethe "next" script could have already started in another process.


I suggest a few possible commands off the top of my head:
   testcase ask patience WHAT FACTOR
   testcase declare patience WHAT FACTOR
   testcase patience WHAT FACTOR

The FACTOR is a scale factor, similar to the proposed'test_timeout_factor' or possibly the keyword "reset" (or special value0?) to clear a previous factor before leaving a group. Multipleinvocations stack: the effective scale factor is the product of allapplicable scale factors. (This will have straightforward interactionswith groups: leaving a group will restore the scale factor in effectwhen the group was entered. The initial scale factor at top-level is 1,for any WHAT.)

The WHAT is a keyword from a to-be-determined set. There is apossibility that parts of the framework might eventually respond tocertain WHAT values, but for now, would "dg-run" be suitable to expressa timeout for running a test program and "dg-compile" for the timeout onrunning GCC itself? This could lead to reserving dg-* WHAT values fordg.exp based testsuites to define, with a convention that dg-WHAT scalesthe timeout for "dg-do WHAT".

Leaving the definition of WHAT to the testsuite is not an insurmountablebarrier, as providing an inquiry command for the testsuite to use wouldnot be difficult. This seems to lead towards a "testcase declarepatience WHAT FACTOR" and "testcase inquire patience WHAT" pair. Theformer multiplies the current WHAT scale factor by FACTOR, while thelatter returns the appropriate running product.

All this provides a nice way to add upstream support for dg-patience ("{dg-patience dg-run 3 }" or "{ dg-patience dg-compile 2 }") or a similartag to dg.exp, but still leaves the issue of communicating /which/ scalefactor to use to the various command execution procedures. Here we comeback to the same problem, since the current API shape (not changinganytime soon) does not provide a way to pass a timeout value or scalefactor, other than using a "magic" variable. So we are back to'timeout_scale_factor', but documented in the procedure documentationfor the remote_* procedures. In this case, the framework could useuplevel to read the variable as a local variable in the caller's frame,so the gcc-dg-test procedure would only need to do {settimeout_scale_factor [testcase inquire patience dg-run]} before usingremote_load to run the test program. (Expect does similar things,according to its manpage.)

The *_load procedures in the config/*.exp are not documented andconfig/README specifically says that they are to be called using theremote_* procedures. While using a "magic" variable would require someneat tricks with uplevel/upvar, it should work as long as testsuites usethe documented entrypoints. (The *_load procedures from config/*.expare likely to disappear into Tcl namespaces and/or parent interpretersin the future anyway.)


Comments before I start on an implementation?


-- Jacob

Generalizing DejaGnu timeout scaling (was: Re: [PATCH DejaGNU/GCC 0/1] Support per-test execution timeout factor)

Reply via email to