Hi Arnaud

Surem feel free to tweak the node configs directly, these configs are not
in version control or puppet.

On Wed, Jul 21, 2021 at 6:38 PM Arnaud Héritier <aherit...@gmail.com> wrote:

> I am looking at our builds and I try to understand why our agents are
> often disconnected during the builds.
> We have in general a stacktrace like
>
> maven6 was marked offline: Connection was broken: java.io.IOException: Pipe 
> closed after 0 cycles
>       at 
> org.apache.sshd.common.channel.ChannelPipedInputStream.read(ChannelPipedInputStream.java:118)
>       at 
> org.apache.sshd.common.channel.ChannelPipedInputStream.read(ChannelPipedInputStream.java:101)
>       at 
> hudson.remoting.FlightRecorderInputStream.read(FlightRecorderInputStream.java:92)
>       at 
> hudson.remoting.ChunkedInputStream.readHeader(ChunkedInputStream.java:73)
>       at 
> hudson.remoting.ChunkedInputStream.readUntilBreak(ChunkedInputStream.java:103)
>       at 
> hudson.remoting.ChunkedCommandTransport.readBlock(ChunkedCommandTransport.java:39)
>       at 
> hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
>       at 
> hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:63)
>
>
>
> As far I can see we are using 16Gb "hosts" for linux agents
>
> Something very strange is that the jenkins agent (this small component
> doing the link between the build host and the controller) is configured
> with `-Xms8g -Xmx8g` thus we are reserving for it 50% of the server mem
> (even more because of the non-heap)
> This one in general should require in general really less. 1Gb is already
> a lot from my exp.
> Due to this, the OS can see it has the biggest process on the host and
> decide to kill it when the rest of the memory is used by the build.
> I think we should decrease this value.
> (I can do it but I don't know how was configured the ci.apache.org agents
> and I would like to not add more issue if this setting was here in the past
>
> I don't think it is the root cause of our instabilities (at least all) and
> there is something else I have to find but it's a cheap fix to try
>
> FYI our agents VMs are ~like this today:
>
> - Java
> + Home: `/usr/local/asfpackages/java/oraclejdk-1.8.0-291/jre`
> + Vendor: Oracle Corporation
> + Version: 1.8.0&#95;291
> + Maximum memory: 7.67 GB (8232370176)
> + Allocated memory: 7.67 GB (8232370176)
> + Free memory: 6.03 GB (6470953760)
> + In-use memory: 1.64 GB (1761416416)
> + GC strategy: ParallelGC
> + Available CPUs: 4
>
> 8Gb is reserved, 1Gb is used (because the GC does nothing as the Free mem
> is high)
>
> I would be in favor to try to launch them with -Xms128m
> -Xmx1g -XX:+UseG1GC -XX:+UseStringDeduplication
>
> I think it's enough customization to start with
>
> Cheers
>
> On Wed, Jul 21, 2021 at 1:28 PM Arnaud Héritier <aherit...@gmail.com>
> wrote:
>
>> I am not sure about the setup
>> AFAICS we don't use any JDK installer (
>> https://ci-maven.apache.org/configureTools/ ) thus I suppose that the
>> different JDKs are supposed to be installed directly on the agent ?
>> I am not sure how it was done on the previous environment
>>
>> On Sun, Jul 18, 2021 at 5:30 PM Tibor Digana <tibordig...@apache.org>
>> wrote:
>>
>>> The new CI  system has the following issue:
>>>
>>> /home/jenkins/tools/java/latest1.7/bin/java: not found
>>>
>>>
>>> https://ci-maven.apache.org/job/Maven/job/maven-box/job/maven-surefire/job/master/104/execution/node/183/log/
>>>
>>>
>>>
>>> On Wed, Jun 30, 2021 at 8:03 PM Gavin McDonald <gmcdon...@apache.org>
>>> wrote:
>>>
>>> > Hi Maven folks.
>>> >
>>> > Infra has decided to separate off the Maven build jobs from
>>> > ci-builds.apache.org over to its very own Jenkins Controller and
>>> Agents.
>>> >
>>> > This means that Maven now has a dedicated Jenkins environment for
>>> itself.
>>> > It
>>> > also means that no other projects build jobs can build on the Maven
>>> nodes;
>>> > and
>>> > then Maven jobs will no longer  be able to build on the ci-builds jobs.
>>> >
>>> > Your new Controller is set up as https://ci-maven.apache.org and all
>>> Maven
>>> > Committers
>>> > can login via LDAP and create jobs.
>>> >
>>> > At the time of writing, there is one node/agent attached but I am
>>> building
>>> > 4 more  - all
>>> > Ubuntu 20.04 and based in our Azure account.
>>> >
>>> > We can automagically move all your jobs over from ci-builds to
>>> ci-maven - I
>>> > just need someone to tell me go ahead and do it.
>>> >
>>> > In the meantime, feel free to have a test. The remaining 4 agents will
>>> be
>>> > online
>>> > by tomorrow. We will review after a month if 5 is enough nodes.
>>> >
>>> > As with other projects having their own dedicated controller, who have
>>> > taken advantage
>>> > of this isolation by having some nodes/agents given to the project as a
>>> > 'targeted donation'
>>> > so someone here may know of a Company will to donate 5 - 10 or more
>>> nodes
>>> > specifically
>>> > for Maven Jenkins environment. Infra can afford to hand you over 5
>>> right
>>> > now.
>>> >
>>> > Let me know if you have any questions, otherwise let me know when I can
>>> > make the
>>> > transfer of your jobs.
>>> >
>>> > Thanks
>>> >
>>> > --
>>> >
>>> > *Gavin McDonald*
>>> > Systems Administrator
>>> > ASF Infrastructure Team
>>> >
>>>
>>
>>
>> --
>> Arnaud Héritier
>> Twitter/Skype : aheritier
>>
>
>
> --
> Arnaud Héritier
> Twitter/Skype : aheritier
>


-- 

*Gavin McDonald*
Systems Administrator
ASF Infrastructure Team

Reply via email to