We recently created 16 compute instances for the Jenkins. Each one of them has 16 CPUs, means they consume 256 CPU in total. I guess that is why the CPU usage in us-central1 remains high. We're working on the migrating the rest of old Jenkins agents, and the old instances will be removed once finish. That should relieve the pain of quota.
Yifan On Tue, Apr 16, 2019 at 1:58 PM Valentyn Tymofieiev <valen...@google.com> wrote: > FYI, I have recently observed a large amount of test failures in Beam test > suites where Dataflow Jobs failed due to a lack of CPU quota in > apache-beam-testing project. > > We have been adding new suites for Python 3.x versions, which may have > contributed to this. problem. > > I have not investigated yet what consumes the quota yet, but the usage > remains high. > > Possible mitigation options: > - Increase quota. > - Decrease per-suite parallelism [1]. Currently we may run 1-8 tests from > the same suite concurrently. > - Audit usage, perhaps kill stale jobs or VMs. > > Ideas/opinions welcome. > > I opened https://issues.apache.org/jira/browse/BEAM-7085 to track this. > > [1] > https://github.com/apache/beam/search?q=%22--processes%3D%22&unscoped_q=%22--processes%3D%22 >