Hi again,

Here are the thread dumps as promised:

1) Both TabletServers are very busy at compressing at close time. The
following stacks are dumped in ~5 secs interval:

"tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=68425.44ms
elapsed=75.42s tid=0x0000fffeac074800 nid=0x33077e runnable
 [0x0000fffe8f3fd000]
   java.lang.Thread.State: RUNNABLE
        at sun.security.provider.SHA5.implCompressCheck(java.base@11.0.11
/SHA5.java:232)
        at sun.security.provider.SHA5.implCompress(java.base@11.0.11
/SHA5.java:221)
        at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11
/DigestBase.java:124)
        at
java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11
/MessageDigest.java:623)
        at java.security.MessageDigest.update(java.base@11.0.11
/MessageDigest.java:345)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:421)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125)
        at
org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66)
        at
org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179)
        at
org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148)
        at
org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169)
        at
org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236)
        at
org.apache.accumulo.core.clientImpl.ClientContext.createScanner(ClientContext.java:635)
        at
org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.buildNonRoot(TabletsMetadata.java:177)
        at
org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.build(TabletsMetadata.java:125)
        at
org.apache.accumulo.core.metadata.schema.AmpleImpl.readTablet(AmpleImpl.java:46)
        at
org.apache.accumulo.core.metadata.schema.Ample.readTablet(Ample.java:141)
        at
org.apache.accumulo.tserver.tablet.Tablet.closeConsistencyCheck(Tablet.java:1379)
        at
org.apache.accumulo.tserver.tablet.Tablet.completeClose(Tablet.java:1331)
        - locked <0x00000000f1585830> (a
org.apache.accumulo.tserver.tablet.Tablet)
        at org.apache.accumulo.tserver.tablet.Tablet.close(Tablet.java:1221)
        at
org.apache.accumulo.tserver.UnloadTabletHandler.run(UnloadTabletHandler.java:92)
        at io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207)
        at
io.opentelemetry.context.Context$$Lambda$209/0x000000010035c840.run(Unknown
Source)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@11.0.11
/ThreadPoolExecutor.java:1128)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@11.0.11
/ThreadPoolExecutor.java:628)
        at io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207)
        at
io.opentelemetry.context.Context$$Lambda$209/0x000000010035c840.run(Unknown
Source)
        at java.lang.Thread.run(java.base@11.0.11/Thread.java:829)

"tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=72485.20ms
elapsed=79.71s tid=0x0000fffeac074800 nid=0x33077e runnable
 [0x0000fffe8f3fd000]
   java.lang.Thread.State: RUNNABLE
        at
sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11
/DigestBase.java:149)
        at
sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11
/DigestBase.java:144)
        at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11
/DigestBase.java:131)
        at
java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11
/MessageDigest.java:623)
        at java.security.MessageDigest.update(java.base@11.0.11
/MessageDigest.java:345)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:403)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125)
        at
org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66)
        at
org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179)
        at
org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148)
        at
org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169)
        ...

"tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=81174.59ms
elapsed=89.01s tid=0x0000fffeac074800 nid=0x33077e runnable
 [0x0000fffe8f3fd000]
   java.lang.Thread.State: RUNNABLE
        at sun.security.provider.ByteArrayAccess.l2bBig(java.base@11.0.11
/ByteArrayAccess.java:449)
        at sun.security.provider.SHA5.implDigest(java.base@11.0.11
/SHA5.java:131)
        at sun.security.provider.DigestBase.engineDigest(java.base@11.0.11
/DigestBase.java:210)
        at sun.security.provider.DigestBase.engineDigest(java.base@11.0.11
/DigestBase.java:189)
        at
java.security.MessageDigest$Delegate.engineDigest(java.base@11.0.11
/MessageDigest.java:639)
        at java.security.MessageDigest.digest(java.base@11.0.11
/MessageDigest.java:385)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:439)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125)
        at
org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66)
        at
org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179)
        at
org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148)
        at
org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169)
        ...

"tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=86499.01ms
elapsed=94.68s tid=0x0000fffeac074800 nid=0x33077e runnable
 [0x0000fffe8f3fd000]
   java.lang.Thread.State: RUNNABLE
        at
sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11
/DigestBase.java:149)
        at
sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11
/DigestBase.java:144)
        at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11
/DigestBase.java:131)
        at
java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11
/MessageDigest.java:623)
        at java.security.MessageDigest.update(java.base@11.0.11
/MessageDigest.java:345)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:403)
        at
org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78)
        at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120)
        at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125)
        at
org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66)
        at
org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179)
        at
org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148)
        at
org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169)
        ...

"tablet migration-Worker-1" #6107 daemon prio=5 os_prio=0 cpu=109551.37ms
elapsed=117.48s tid=0x0000fffeac01b000 nid=0x33174d runnable
 [0x0000fffe7bffd000]
14012    java.lang.Thread.State: RUNNABLE
14013   at
sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11
/DigestBase.java:149)
14014   at
sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11
/DigestBase.java:144)
14015   at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11
/DigestBase.java:131)
14016   at
java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11
/MessageDigest.java:623)
14017   at java.security.MessageDigest.update(java.base@11.0.11
/MessageDigest.java:345)
14018   at
org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:432)
14019   at
org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585)
14020   at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78)
14021   at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167)
14022   at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120)
14023   at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125)
14024   at
org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66)
14025   at
org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179)
14026   at
org.apache.accumulo.server.ServerInfo.getAuthenticationToken(ServerInfo.java:153)
14027   at
org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:168)
14028   at
org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236)

Notice that ClientContext.getProperties(ClientContext.java:236) most of the
times calls ServerInfo.getPrincipal(ServerInfo.java:148) but in the last
one it calls ServerInfo.getAuthenticationToken(ServerInfo.java:153).
And both lead to (a lot of ?!) compressing..

2) The "Manager" process writes ~200Mb of logs. Maybe the default log level
should not be DEBUG ?!

Most of its threads either wait for notifications from Zookeeper:

878647 "Manager-ClientPool-Worker-3" #61 daemon prio=5 os_prio=0
cpu=375.95ms elapsed=182.38s tid=0x0000fffee0007800 nid=0x32d943 in
Object.wait()  [0x0000fffebb7fc000]
 878648    java.lang.Thread.State: TIMED_WAITING (on object monitor)
 878649   at java.lang.Object.wait(java.base@11.0.11/Native Method)
 878650   - waiting on <no object reference available>
 878651   at
org.apache.accumulo.fate.ZooStore.waitForStatusChange(ZooStore.java:386)
 878652   - waiting to re-lock in wait() <0x00000000f1427458> (a
org.apache.accumulo.fate.ZooStore)
 878653   at
org.apache.accumulo.fate.AgeOffStore.waitForStatusChange(AgeOffStore.java:209)
 878654   at
org.apache.accumulo.core.logging.FateLogger$1.waitForStatusChange(FateLogger.java:75)
 878655   at org.apache.accumulo.fate.Fate.waitForCompletion(Fate.java:297)
 878656   at
org.apache.accumulo.manager.FateServiceHandler.waitForFateOperation(FateServiceHandler.java:659)
 878657   at
org.apache.accumulo.manager.ManagerClientServiceHandler.waitForFateOperation(ManagerClientServiceHandler.java:100)
...

or wait for data:
878781 "Repo Runner-Worker-1" #90 daemon prio=5 os_prio=0 cpu=7440.91ms
elapsed=179.99s tid=0x0000fffeb0002000 nid=0x32d99a in Object.wait()
 [0x0000fffebadfd000]
 878782    java.lang.Thread.State: WAITING (on object monitor)
 878783   at java.lang.Object.wait(java.base@11.0.11/Native Method)
 878784   - waiting on <no object reference available>
 878785   at java.lang.Object.wait(java.base@11.0.11/Object.java:328)
 878786   at
org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1529)
 878787   - waiting to re-lock in wait() <0x00000000f9bf42d8> (a
org.apache.zookeeper.ClientCnxn$Packet)
 878788   at
org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1512)
 878789   at org.apache.zookeeper.ZooKeeper.getChildren(ZooKeeper.java:2587)
 878790   at
org.apache.accumulo.fate.zookeeper.ZooReader.lambda$getChildren$5(ZooReader.java:87)
 878791   at
org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$182/0x0000000100324040.apply(Unknown
Source)
 878792   at
org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$184/0x0000000100323c40.apply(Unknown
Source)
 878793   at
org.apache.accumulo.fate.zookeeper.ZooReader.retryLoopMutator(ZooReader.java:165)
 878794   at
org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:144)
 878795   at
org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:131)
 878796   at
org.apache.accumulo.fate.zookeeper.ZooReader.getChildren(ZooReader.java:87)
 878797   at org.apache.accumulo.fate.ZooStore.reserve(ZooStore.java:141)
 878798   at
org.apache.accumulo.fate.AgeOffStore.reserve(AgeOffStore.java:155)
 878799   at
org.apache.accumulo.core.logging.FateLogger$1.reserve(FateLogger.java:50)
 878800   at
org.apache.accumulo.fate.Fate$TransactionRunner.run(Fate.java:72)
 878801   at
io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207)
 878802   at
io.opentelemetry.context.Context$$Lambda$209/0x0000000100353840.run(Unknown
Source)
 878803   at
java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@11.0.11
/ThreadPoolExecutor.java:1128)
 878804   at
java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@11.0.11
/ThreadPoolExecutor.java:628)
 878805   at
io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207)
 878806   at
io.opentelemetry.context.Context$$Lambda$209/0x0000000100353840.run(Unknown
Source)
 878807   at java.lang.Thread.run(java.base@11.0.11/Thread.java:829)

908220 "Status Thread" #41 daemon prio=5 os_prio=0 cpu=1700.28ms
elapsed=187.25s tid=0x0000fffee41f9800 nid=0x32d920 in Object.wait()
 [0x0000ffff20f50000]
 908221    java.lang.Thread.State: WAITING (on object monitor)
 908222   at java.lang.Object.wait(java.base@11.0.11/Native Method)
 908223   - waiting on <no object reference available>
 908224   at java.lang.Object.wait(java.base@11.0.11/Object.java:328)
 908225   at
org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1529)
 908226   - waiting to re-lock in wait() <0x00000000fa781138> (a
org.apache.zookeeper.ClientCnxn$Packet)
 908227   at
org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1512)
 908228   at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:2129)
 908229   at
org.apache.accumulo.fate.zookeeper.ZooReader.lambda$getData$0(ZooReader.java:65)
 908230   at
org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$220/0x0000000100351440.apply(Unknown
Source)
 908231   at
org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$184/0x0000000100323c40.apply(Unknown
Source)
 908232   at
org.apache.accumulo.fate.zookeeper.ZooReader.retryLoopMutator(ZooReader.java:165)
 908233   at
org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:144)
 908234   at
org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:131)
 908235   at
org.apache.accumulo.fate.zookeeper.ZooReader.getData(ZooReader.java:65)
 908236   at
org.apache.accumulo.manager.Manager.getManagerGoalState(Manager.java:496)
 908237   at
org.apache.accumulo.manager.Manager$StatusThread.updateStatus(Manager.java:822)
 908238   at
org.apache.accumulo.manager.Manager$StatusThread.run(Manager.java:797)
 908239   at
io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207)
 908240   at
io.opentelemetry.context.Context$$Lambda$209/0x0000000100353840.run(Unknown
Source)
 908241   at java.lang.Thread.run(java.base@11.0.11/Thread.java:829)

3) SimpleGarbageCollector is also busy in getting credentials

 "gc" #31 prio=5 os_prio=0 cpu=15495.47ms elapsed=209.43s
tid=0x0000ffff28295800 nid=0x32dac5 runnable  [0x0000ffff3a5fb000]
2503    java.lang.Thread.State: RUNNABLE
2504   at
sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11
/DigestBase.java:149)
2505   at
sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11
/DigestBase.java:144)
2506   at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11
/DigestBase.java:131)
2507   at
java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11
/MessageDigest.java:623)
2508   at java.security.MessageDigest.update(java.base@11.0.11
/MessageDigest.java:345)
2509   at
org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:421)
2510   at
org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585)
2511   at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78)
2512   at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167)
2513   at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120)
2514   at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125)
2515   at
org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66)
2516   at
org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179)
2517   at
org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148)
2518   at
org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169)
2519   at
org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236)
2520   at
org.apache.accumulo.core.clientImpl.ClientContext.createScanner(ClientContext.java:635)
2521   at
org.apache.accumulo.server.metadata.ServerAmpleImpl.getGcCandidates(ServerAmpleImpl.java:180)
2522   at
org.apache.accumulo.gc.SimpleGarbageCollector$GCEnv.getCandidates(SimpleGarbageCollector.java:199)
2523   at
org.apache.accumulo.gc.GarbageCollectionAlgorithm.collect(GarbageCollectionAlgorithm.java:302)
2524   at
org.apache.accumulo.gc.SimpleGarbageCollector.run(SimpleGarbageCollector.java:502)
2525   at io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207)
2526   at
io.opentelemetry.context.Context$$Lambda$209/0x0000000100357840.run(Unknown
Source)
2527   at java.lang.Thread.run(java.base@11.0.11/Thread.java:829)


3151 "gc" #31 prio=5 os_prio=0 cpu=15982.95ms elapsed=218.59s
tid=0x0000ffff28295800 nid=0x32dac5 runnable  [0x0000ffff3a5fb000]
3152    java.lang.Thread.State: RUNNABLE
3153   at java.util.Arrays.hashCode(java.base@11.0.11/Arrays.java:4685)
3154   at java.util.Objects.hash(java.base@11.0.11/Objects.java:146)
3155   at java.security.Provider$ServiceKey.hashCode(java.base@11.0.11
/Provider.java:1107)
3156   at java.util.concurrent.ConcurrentHashMap.get(java.base@11.0.11
/ConcurrentHashMap.java:936)
3157   at java.security.Provider.getService(java.base@11.0.11
/Provider.java:1282)
3158   at sun.security.jca.ProviderList.getService(java.base@11.0.11
/ProviderList.java:380)
3159   at sun.security.jca.GetInstance.getInstance(java.base@11.0.11
/GetInstance.java:157)
3160   at java.security.Security.getImpl(java.base@11.0.11
/Security.java:700)
3161   at java.security.MessageDigest.getInstance(java.base@11.0.11
/MessageDigest.java:178)
3162   at
org.apache.commons.codec.digest.DigestUtils.getDigest(DigestUtils.java:170)
3163   at
org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:395)
3164   at
org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585)
3165   at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78)
3166   at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167)
3167   at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120)
3168   at
org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125)
3169   at
org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66)
3170   at
org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179)
3171   at
org.apache.accumulo.server.ServerInfo.getAuthenticationToken(ServerInfo.java:153)
3172   at
org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:168)
3173   at
org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236)
3174   at
org.apache.accumulo.core.clientImpl.ClientContext.createScanner(ClientContext.java:635)
3175   at
org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.buildNonRoot(TabletsMetadata.java:177)
3176   at
org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.build(TabletsMetadata.java:125)
3177   at
org.apache.accumulo.gc.SimpleGarbageCollector$GCEnv.getReferences(SimpleGarbageCollector.java:249)
3178   at
org.apache.accumulo.gc.GarbageCollectionAlgorithm.confirmDeletes(GarbageCollectionAlgorithm.java:169)
3179   at
org.apache.accumulo.gc.GarbageCollectionAlgorithm.confirmDeletesTrace(GarbageCollectionAlgorithm.java:276)
3180   at
org.apache.accumulo.gc.GarbageCollectionAlgorithm.deleteBatch(GarbageCollectionAlgorithm.java:330)
3181   at
org.apache.accumulo.gc.GarbageCollectionAlgorithm.collect(GarbageCollectionAlgorithm.java:315)
3182   at
org.apache.accumulo.gc.SimpleGarbageCollector.run(SimpleGarbageCollector.java:501)
3183   at io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207)
3184   at
io.opentelemetry.context.Context$$Lambda$209/0x0000000100357840.run(Unknown
Source)
3185   at java.lang.Thread.run(java.base@11.0.11/Thread.java:829)


4) Nothing interesting for Initialize, Main and ZooKeeperServerMain
processes


I'm not saying that the above are problematic. You know how Accumulo works.
It is up to you to decide whether something should be improved.

Regards,
Mark


On Wed, 1 Dec 2021 at 16:35, Mark Jens <mark.r.j...@gmail.com> wrote:

>
>
> On Tue, 30 Nov 2021 at 18:32, Christopher <ctubb...@apache.org> wrote:
>
>> It looks like the tests are timing out. This happens frequently when
>> running on resource-constrained systems. You can give the test more
>> time by increasing the timeout factor: `mvn clean verify
>> -Dcheckstyle.skip -Dspotbugs.skip -Dit.test=ConcurrentDeleteTableIT
>> -Dtimeout.factor=3`
>>
>> There's nothing we know of that would change the way our tests work
>> due to ARM64, but you may have issues because of limited RAM, slow CPU
>> speeds, slow disk I/O, busy background processes, or other
>> resource-related issues. I don't think most of the currently active
>> developers use ARM64, or have access to a test machine to reproduce or
>>
>
> In case anyone wants to test on Linux ARM64 you could easily use Oracle
> Cloud for free.
>
> https://martin-grigorov.medium.com/github-actions-arm64-runner-on-oracle-cloud-a77cdf7a325a
> explains how to create a VM and how to use this VM as a Github Actions
> runner.
> https://github.com/apache/accumulo/issues/1884#issuecomment-970267282
> mentions this article.
>
>
>> experiment with Accumulo there, so you may have to do some of your own
>> troubleshooting. If you can rule out resource-constraint issues, and
>> it isn't already a known flaky test (ConcurrentDeleteTableIT is known
>> flaky and sometimes times out on x86_64 as well), you could create a
>> bug ticket with more details at
>> https://github.com/apache/accumulo/issues ; there is an issue template
>> specifically for broken and/or flaky tests that you can select when
>> creating a new ticket.
>>
>> On Tue, Nov 30, 2021 at 9:34 AM Mark Jens <mark.r.j...@gmail.com> wrote:
>> >
>> > Hi dev1,
>> >
>> > On Tue, 30 Nov 2021 at 16:21, dev1 <d...@etcoleman.com> wrote:
>> >
>> > > Some of those tests are trying to stress conditions that require a
>> lot of
>> > > resources to replicate specific conditions. Have you tried to run
>> those
>> > > individual tests in isolation so that you are not competing for
>> resources?
>> > > Do they always fail, or are the failures transient?
>> > >
>> >
>> > Q: Have you tried to run those individual tests in isolation so that you
>> > are not competing for resources?
>> > A: This is what I mean with the following:
>> > ---------------------
>> > The tests fail even when executed separately, e.g.:
>> > mvn verify -Dit.test=ConcurrentDeleteTableIT -o -rf :accumulo-test
>> > ---------------------
>> >
>> > Q: Do they always fail, or are the failures transient?
>> > A: I also tried to explain that with "These tests fail consistently at
>> > every build attempt!"
>> >
>> > Mark
>> >
>> > >
>> > > -----Original Message-----
>> > > From: Mark Jens <mark.r.j...@gmail.com>
>> > > Sent: Tuesday, November 30, 2021 4:05 AM
>> > > To: dev@accumulo.apache.org
>> > > Subject: Consistent IT tests failures on Linux ARM64
>> > >
>> > > Hello Accumulo community,
>> > >
>> > > At my job we consider using Linux ARM64 servers and I've been tasked
>> to
>> > > test Accumulo.
>> > >
>> > > I face some timeout related issues with several IT tests:
>> > >
>> > >
>> > > [ERROR]
>> > >
>> > >
>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete
>> > >  Time elapsed: 420.122 s  <<< ERROR!
>> > > org.junit.runners.model.TestTimedOutException: test timed out after
>> 420
>> > > seconds at java.base@11.0.11/jdk.internal.misc.Unsafe.park(Native
>> Method)
>> > > at java.base@11.0.11
>> > > /java.util.concurrent.locks.LockSupport.park(LockSupport.java:194)
>> > > at java.base@11.0.11
>> > > /java.util.concurrent.FutureTask.awaitDone(FutureTask.java:447)
>> > > at java.base@11.0.11
>> > > /java.util.concurrent.FutureTask.get(FutureTask.java:190)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete(ConcurrentDeleteTableIT.java:213)
>> > > at java.base@11.0.11
>> > > /jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native
>> > > Method)
>> > > at java.base@11.0.11
>> > >
>> > >
>> /jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>> > > at java.base@11.0.11
>> > >
>> > >
>> /jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>> > > at java.base@11.0.11/java.lang.reflect.Method.invoke(Method.java:566)
>> > > at
>> > >
>> > >
>> app//org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>> > > at
>> > >
>> > >
>> app//org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:299)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:293)
>> > > at java.base@11.0.11
>> > > /java.util.concurrent.FutureTask.run(FutureTask.java:264)
>> > > at java.base@11.0.11/java.lang.Thread.run(Thread.java:829)
>> > >
>> > > [ERROR]
>> > >
>> > >
>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete
>> > >  Time elapsed: 420.122 s  <<< ERROR!
>> > > java.lang.Exception: Appears to be stuck in thread Time-limited
>> > > test-SendThread(localhost:44251)
>> > > at java.base@11.0.11/sun.nio.ch.EPoll.wait(Native Method) at
>> > > java.base@11.0.11
>> > > /sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:120)
>> > > at java.base@11.0.11
>> > > /sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:124)
>> > > at java.base@11.0.11/sun.nio.ch
>> .SelectorImpl.select(SelectorImpl.java:136)
>> > > at
>> > >
>> > >
>> app//org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:347)
>> > > at
>> > >
>> app//org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1223)
>> > >
>> > > [ERROR]
>> > >
>> > >
>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentDeleteTablesOps
>> > >  Time elapsed: 420.011 s  <<< ERROR!
>> > > org.junit.runners.model.TestTimedOutException: test timed out after
>> 420
>> > > seconds at java.base@11.0.11/java.lang.Thread.sleep(Native Method) at
>> > >
>> > >
>> app//org.apache.accumulo.fate.zookeeper.ZooCache$ZooRunnable.retry(ZooCache.java:299)
>> > > at
>> app//org.apache.accumulo.fate.zookeeper.ZooCache.get(ZooCache.java:442)
>> > > at
>> app//org.apache.accumulo.fate.zookeeper.ZooCache.get(ZooCache.java:372)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.ClientContext.verifyInstanceId(ClientContext.java:467)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.ClientContext.getInstanceID(ClientContext.java:446)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.ClientContext.getManagerLocations(ClientContext.java:405)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.ManagerClient.getConnection(ManagerClient.java:59)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.ManagerClient.getConnectionWithRetry(ManagerClient.java:49)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.beginFateOperation(TableOperationsImpl.java:260)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.doFateOperation(TableOperationsImpl.java:369)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.doFateOperation(TableOperationsImpl.java:359)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.doTableFateOperation(TableOperationsImpl.java:1670)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.create(TableOperationsImpl.java:248)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentDeleteTablesOps(ConcurrentDeleteTableIT.java:76)
>> > > at java.base@11.0.11
>> > > /jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native
>> > > Method)
>> > > at java.base@11.0.11
>> > >
>> > >
>> /jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>> > > at java.base@11.0.11
>> > >
>> > >
>> /jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>> > > at java.base@11.0.11/java.lang.reflect.Method.invoke(Method.java:566)
>> > > at
>> > >
>> > >
>> app//org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>> > > at
>> > >
>> > >
>> app//org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:299)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:293)
>> > > at java.base@11.0.11
>> > > /java.util.concurrent.FutureTask.run(FutureTask.java:264)
>> > > at java.base@11.0.11/java.lang.Thread.run(Thread.java:829)
>> > >
>> > > [INFO] Running org.apache.accumulo.test.functional.ScannerContextIT
>> > > [INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 102.909 s - in org.apache.accumulo.test.functional.ScannerContextIT
>> > > [INFO] Running org.apache.accumulo.test.functional.KerberosRenewalIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 504.472 s - in org.apache.accumulo.test.functional.KerberosRenewalIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BatchWriterFlushIT
>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 62.132 s - in org.apache.accumulo.test.functional.BatchWriterFlushIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BinaryIT
>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 65.034 s - in org.apache.accumulo.test.functional.BinaryIT
>> > > [INFO] Running org.apache.accumulo.test.functional.PermissionsIT
>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 59.25 s - in org.apache.accumulo.test.functional.PermissionsIT
>> > > [INFO] Running org.apache.accumulo.test.functional.ZookeeperRestartIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 37.37 s - in org.apache.accumulo.test.functional.ZookeeperRestartIT
>> > > [INFO] Running
>> org.apache.accumulo.test.functional.CreateManyScannersIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 23.046 s - in org.apache.accumulo.test.functional.CreateManyScannersIT
>> > > [INFO] Running
>> org.apache.accumulo.test.functional.CreateInitialSplitsIT
>> > > [INFO] Tests run: 7, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 255.108 s - in
>> org.apache.accumulo.test.functional.CreateInitialSplitsIT
>> > > [INFO] Running org.apache.accumulo.test.functional.MonitorSslIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 25.304 s - in org.apache.accumulo.test.functional.MonitorSslIT
>> > > [INFO] Running org.apache.accumulo.test.functional.RestartStressIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 78.359 s - in org.apache.accumulo.test.functional.RestartStressIT
>> > > [INFO] Running
>> org.apache.accumulo.test.functional.BulkSplitOptimizationIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 59.289 s - in
>> org.apache.accumulo.test.functional.BulkSplitOptimizationIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BulkNewIT
>> > > [INFO] Tests run: 14, Failures: 0, Errors: 0, Skipped: 0, Time
>> elapsed:
>> > > 63.696 s - in org.apache.accumulo.test.functional.BulkNewIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BloomFilterIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 135.298 s - in org.apache.accumulo.test.functional.BloomFilterIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BulkIT
>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 122.959 s - in org.apache.accumulo.test.functional.BulkIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BinaryStressIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 38.626 s - in org.apache.accumulo.test.functional.BinaryStressIT
>> > > [INFO] Running org.apache.accumulo.test.functional.ClassLoaderIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 45.61 s - in org.apache.accumulo.test.functional.ClassLoaderIT
>> > > [INFO] Running org.apache.accumulo.test.functional.LogicalTimeIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 116.819 s - in org.apache.accumulo.test.functional.LogicalTimeIT
>> > > [INFO] Running org.apache.accumulo.test.functional.SplitRecoveryIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 25.421 s - in org.apache.accumulo.test.functional.SplitRecoveryIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BigRootTabletIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 96.86 s - in org.apache.accumulo.test.functional.BigRootTabletIT
>> > > [INFO] Running org.apache.accumulo.test.functional.GarbageCollectorIT
>> > > [INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 238.409 s - in org.apache.accumulo.test.functional.GarbageCollectorIT
>> > > [INFO] Running
>> > > org.apache.accumulo.test.functional.BalanceInPresenceOfOfflineTableIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 219.253 s - in
>> > > org.apache.accumulo.test.functional.BalanceInPresenceOfOfflineTableIT
>> > > [INFO] Running org.apache.accumulo.test.functional.VisibilityIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 38.015 s - in org.apache.accumulo.test.functional.VisibilityIT
>> > > [INFO] Running org.apache.accumulo.test.functional.SslWithClientAuthIT
>> > > [INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 489.863 s - in org.apache.accumulo.test.functional.SslWithClientAuthIT
>> > > [INFO] Running org.apache.accumulo.test.functional.SummaryIT
>> > > [INFO] Tests run: 10, Failures: 0, Errors: 0, Skipped: 0, Time
>> elapsed:
>> > > 111.552 s - in org.apache.accumulo.test.functional.SummaryIT
>> > > [INFO] Running org.apache.accumulo.test.functional.MaxOpenIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 30.061 s - in org.apache.accumulo.test.functional.MaxOpenIT
>> > > [INFO] Running org.apache.accumulo.test.functional.ManagerFailoverIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 47.089 s - in org.apache.accumulo.test.functional.ManagerFailoverIT
>> > > [INFO] Running org.apache.accumulo.test.functional.DeleteRowsIT
>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 229.586 s - in org.apache.accumulo.test.functional.DeleteRowsIT
>> > > [INFO] Running org.apache.accumulo.test.functional.BackupManagerIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 22.943 s - in org.apache.accumulo.test.functional.BackupManagerIT
>> > > [INFO] Running org.apache.accumulo.test.functional.TabletMetadataIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 46.728 s - in org.apache.accumulo.test.functional.TabletMetadataIT
>> > > [INFO] Running org.apache.accumulo.test.functional.LateLastContactIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 46.648 s - in org.apache.accumulo.test.functional.LateLastContactIT
>> > > [INFO] Running
>> org.apache.accumulo.test.functional.SimpleBalancerFairnessIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 71.934 s - in
>> org.apache.accumulo.test.functional.SimpleBalancerFairnessIT
>> > > [INFO] Running org.apache.accumulo.test.functional.HalfDeadTServerIT
>> > > [ERROR] Tests run: 3, Failures: 0, Errors: 2, Skipped: 0, Time
>> elapsed:
>> > > 307.904 s <<< FAILURE! - in
>> > > org.apache.accumulo.test.functional.HalfDeadTServerIT
>> > > [ERROR]
>> org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover
>> > >  Time elapsed: 240.011 s  <<< ERROR!
>> > > org.junit.runners.model.TestTimedOutException: test timed out after
>> 240
>> > > seconds at java.base@11.0.11/java.lang.Object.wait(Native Method) at
>> > > java.base@11.0.11/java.lang.Object.wait(Object.java:328)
>> > > at java.base@11.0.11
>> /java.lang.ProcessImpl.waitFor(ProcessImpl.java:495)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.test.functional.HalfDeadTServerIT.test(HalfDeadTServerIT.java:217)
>> > > at
>> > >
>> > >
>> app//org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover(HalfDeadTServerIT.java:142)
>> > > at java.base@11.0.11
>> > > /jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native
>> > > Method)
>> > > at java.base@11.0.11
>> > >
>> > >
>> /jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>> > > at java.base@11.0.11
>> > >
>> > >
>> /jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>> > > at java.base@11.0.11/java.lang.reflect.Method.invoke(Method.java:566)
>> > > at
>> > >
>> > >
>> app//org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>> > > at
>> > >
>> > >
>> app//org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:299)
>> > > at
>> > >
>> > >
>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:293)
>> > > at java.base@11.0.11
>> > > /java.util.concurrent.FutureTask.run(FutureTask.java:264)
>> > > at java.base@11.0.11/java.lang.Thread.run(Thread.java:829)
>> > >
>> > > [ERROR]
>> org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover
>> > >  Time elapsed: 240.012 s  <<< ERROR!
>> > > java.lang.Exception: Appears to be stuck in thread Time-limited
>> > > test-SendThread(localhost:39285)
>> > > at java.base@11.0.11/sun.nio.ch.EPoll.wait(Native Method) at
>> > > java.base@11.0.11
>> > > /sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:120)
>> > > at java.base@11.0.11
>> > > /sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:124)
>> > > at java.base@11.0.11/sun.nio.ch
>> .SelectorImpl.select(SelectorImpl.java:136)
>> > > at
>> > >
>> > >
>> app//org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:347)
>> > > at
>> > >
>> app//org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1223)
>> > >
>> > > [INFO] Running org.apache.accumulo.test.functional.MetadataIT
>> > > [INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 97.987 s - in org.apache.accumulo.test.functional.MetadataIT
>> > > [INFO] Running
>> org.apache.accumulo.test.functional.ScanSessionTimeOutIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 43.91 s - in org.apache.accumulo.test.functional.ScanSessionTimeOutIT
>> > > [INFO] Running org.apache.accumulo.test.functional.ZooCacheIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 33.986 s - in org.apache.accumulo.test.functional.ZooCacheIT
>> > > [INFO] Running org.apache.accumulo.test.functional.DeleteRowsSplitIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 113.928 s - in org.apache.accumulo.test.functional.DeleteRowsSplitIT
>> > > [INFO] Running org.apache.accumulo.test.ScanFlushWithTimeIT
>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 36.854 s - in org.apache.accumulo.test.ScanFlushWithTimeIT
>> > > [INFO] Running org.apache.accumulo.test.AuditMessageIT
>> > > [INFO] Tests run: 6, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> > > 165.169 s - in org.apache.accumulo.test.AuditMessageIT
>> > > [INFO] Running
>> > > org.apache.accumulo.test.gc.replication.CloseWriteAheadLogReferencesIT
>> > > [WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time
>> elapsed:
>> > > 0.039 s - in
>> > > org.apache.accumulo.test.gc.replication.CloseWriteAheadLogReferencesIT
>> > > [INFO]
>> > > [INFO] Results:
>> > > [INFO]
>> > > [ERROR] Errors:
>> > > [ERROR]
>> > >
>> > >
>> org.apache.accumulo.test.compaction.ExternalCompaction_3_IT.testCoordinatorRestartsDuringCompaction
>> > > [ERROR]   Run 1:
>> > > ExternalCompaction_3_IT.testCoordinatorRestartsDuringCompaction:178 »
>> > > TestTimedOut
>> > > [ERROR]   Run 2:
>> > > ExternalCompaction_3_IT.testCoordinatorRestartsDuringCompaction »
>> Appears
>> > > to ...
>> > > [INFO]
>> > > [ERROR]   ConcurrentDeleteTableIT.testConcurrentDeleteTablesOps:76 »
>> > > TestTimedOut test t...
>> > > [ERROR]
>> > >
>> > >
>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete
>> > > [ERROR]   Run 1:
>> > > ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete:213 »
>> TestTimedOut
>> > > tes...
>> > > [ERROR]   Run 2:
>> ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete »
>> > >  Appears to be stuck...
>> > > [INFO]
>> > > [ERROR]
>> org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover
>> > > [ERROR]   Run 1:
>> > >
>> > >
>> HalfDeadTServerIT.testRecover:142->test:217->Object.wait:328->Object.wait:-2
>> > > » TestTimedOut
>> > > [ERROR]   Run 2: HalfDeadTServerIT.testRecover »  Appears to be stuck
>> in
>> > > thread Time-limited te...
>> > > [INFO]
>> > > [ERROR] org.apache.accumulo.test.functional.SslIT.adminStop
>> > > [ERROR]   Run 1: SslIT.adminStop:68->Object.wait:328->Object.wait:-2 »
>> > > TestTimedOut test timed ...
>> > > [ERROR]   Run 2: SslIT.adminStop »  Appears to be stuck in thread
>> > > Time-limited test-SendThread(...
>> > >
>> > > These tests fail consistently at every build attempt!
>> > >
>> > > The tests fail even when executed separately, e.g.:
>> > > mvn verify -Dit.test=ConcurrentDeleteTableIT -o -rf :accumulo-test
>> > >
>> > >
>> > > I am using the current 'main' branch of Accumulo.
>> > > JDK 11.0.11
>> > > Maven: 3.8.2
>> > > OS: Ubuntu 20.04.3 ARM64
>> > >
>> > > Is there anything that could be done to fix these problems ?
>> > > For example some config settings ?!
>> > >
>> > > P.S. At https://github.com/apache/accumulo/issues/1884 I read that
>> Linux
>> > > ARM64 is a supported platform since the JVM supports it.
>> > >
>> > > Thanks!
>> > >
>> > > Mark
>> > >
>>
>

Reply via email to