ok Gavin

Thanks for your feedback
It's what I wanted to be sure, I didn't want to break something you are
maintaining

On Wed, Jul 21, 2021 at 6:47 PM Gavin McDonald <gmcdon...@apache.org> wrote:

> Hi Arnaud
>
> Surem feel free to tweak the node configs directly, these configs are not
> in version control or puppet.
>
> On Wed, Jul 21, 2021 at 6:38 PM Arnaud Héritier <aherit...@gmail.com>
> wrote:
>
>> I am looking at our builds and I try to understand why our agents are
>> often disconnected during the builds.
>> We have in general a stacktrace like
>>
>> maven6 was marked offline: Connection was broken: java.io.IOException: Pipe 
>> closed after 0 cycles
>>      at 
>> org.apache.sshd.common.channel.ChannelPipedInputStream.read(ChannelPipedInputStream.java:118)
>>      at 
>> org.apache.sshd.common.channel.ChannelPipedInputStream.read(ChannelPipedInputStream.java:101)
>>      at 
>> hudson.remoting.FlightRecorderInputStream.read(FlightRecorderInputStream.java:92)
>>      at 
>> hudson.remoting.ChunkedInputStream.readHeader(ChunkedInputStream.java:73)
>>      at 
>> hudson.remoting.ChunkedInputStream.readUntilBreak(ChunkedInputStream.java:103)
>>      at 
>> hudson.remoting.ChunkedCommandTransport.readBlock(ChunkedCommandTransport.java:39)
>>      at 
>> hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
>>      at 
>> hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:63)
>>
>>
>>
>> As far I can see we are using 16Gb "hosts" for linux agents
>>
>> Something very strange is that the jenkins agent (this small component
>> doing the link between the build host and the controller) is configured
>> with `-Xms8g -Xmx8g` thus we are reserving for it 50% of the server mem
>> (even more because of the non-heap)
>> This one in general should require in general really less. 1Gb is already
>> a lot from my exp.
>> Due to this, the OS can see it has the biggest process on the host and
>> decide to kill it when the rest of the memory is used by the build.
>> I think we should decrease this value.
>> (I can do it but I don't know how was configured the ci.apache.org
>> agents and I would like to not add more issue if this setting was here in
>> the past
>>
>> I don't think it is the root cause of our instabilities (at least all)
>> and there is something else I have to find but it's a cheap fix to try
>>
>> FYI our agents VMs are ~like this today:
>>
>> - Java
>> + Home: `/usr/local/asfpackages/java/oraclejdk-1.8.0-291/jre`
>> + Vendor: Oracle Corporation
>> + Version: 1.8.0&#95;291
>> + Maximum memory: 7.67 GB (8232370176)
>> + Allocated memory: 7.67 GB (8232370176)
>> + Free memory: 6.03 GB (6470953760)
>> + In-use memory: 1.64 GB (1761416416)
>> + GC strategy: ParallelGC
>> + Available CPUs: 4
>>
>> 8Gb is reserved, 1Gb is used (because the GC does nothing as the Free mem
>> is high)
>>
>> I would be in favor to try to launch them with -Xms128m
>> -Xmx1g -XX:+UseG1GC -XX:+UseStringDeduplication
>>
>> I think it's enough customization to start with
>>
>> Cheers
>>
>> On Wed, Jul 21, 2021 at 1:28 PM Arnaud Héritier <aherit...@gmail.com>
>> wrote:
>>
>>> I am not sure about the setup
>>> AFAICS we don't use any JDK installer (
>>> https://ci-maven.apache.org/configureTools/ ) thus I suppose that the
>>> different JDKs are supposed to be installed directly on the agent ?
>>> I am not sure how it was done on the previous environment
>>>
>>> On Sun, Jul 18, 2021 at 5:30 PM Tibor Digana <tibordig...@apache.org>
>>> wrote:
>>>
>>>> The new CI  system has the following issue:
>>>>
>>>> /home/jenkins/tools/java/latest1.7/bin/java: not found
>>>>
>>>>
>>>> https://ci-maven.apache.org/job/Maven/job/maven-box/job/maven-surefire/job/master/104/execution/node/183/log/
>>>>
>>>>
>>>>
>>>> On Wed, Jun 30, 2021 at 8:03 PM Gavin McDonald <gmcdon...@apache.org>
>>>> wrote:
>>>>
>>>> > Hi Maven folks.
>>>> >
>>>> > Infra has decided to separate off the Maven build jobs from
>>>> > ci-builds.apache.org over to its very own Jenkins Controller and
>>>> Agents.
>>>> >
>>>> > This means that Maven now has a dedicated Jenkins environment for
>>>> itself.
>>>> > It
>>>> > also means that no other projects build jobs can build on the Maven
>>>> nodes;
>>>> > and
>>>> > then Maven jobs will no longer  be able to build on the ci-builds
>>>> jobs.
>>>> >
>>>> > Your new Controller is set up as https://ci-maven.apache.org and all
>>>> Maven
>>>> > Committers
>>>> > can login via LDAP and create jobs.
>>>> >
>>>> > At the time of writing, there is one node/agent attached but I am
>>>> building
>>>> > 4 more  - all
>>>> > Ubuntu 20.04 and based in our Azure account.
>>>> >
>>>> > We can automagically move all your jobs over from ci-builds to
>>>> ci-maven - I
>>>> > just need someone to tell me go ahead and do it.
>>>> >
>>>> > In the meantime, feel free to have a test. The remaining 4 agents
>>>> will be
>>>> > online
>>>> > by tomorrow. We will review after a month if 5 is enough nodes.
>>>> >
>>>> > As with other projects having their own dedicated controller, who have
>>>> > taken advantage
>>>> > of this isolation by having some nodes/agents given to the project as
>>>> a
>>>> > 'targeted donation'
>>>> > so someone here may know of a Company will to donate 5 - 10 or more
>>>> nodes
>>>> > specifically
>>>> > for Maven Jenkins environment. Infra can afford to hand you over 5
>>>> right
>>>> > now.
>>>> >
>>>> > Let me know if you have any questions, otherwise let me know when I
>>>> can
>>>> > make the
>>>> > transfer of your jobs.
>>>> >
>>>> > Thanks
>>>> >
>>>> > --
>>>> >
>>>> > *Gavin McDonald*
>>>> > Systems Administrator
>>>> > ASF Infrastructure Team
>>>> >
>>>>
>>>
>>>
>>> --
>>> Arnaud Héritier
>>> Twitter/Skype : aheritier
>>>
>>
>>
>> --
>> Arnaud Héritier
>> Twitter/Skype : aheritier
>>
>
>
> --
>
> *Gavin McDonald*
> Systems Administrator
> ASF Infrastructure Team
>


-- 
Arnaud Héritier
Twitter/Skype : aheritier

Reply via email to