Please review https://github.com/apache/accumulo/pull/2374 By caching the ServerInfo's Credentials ConcurrentDeleteTableIT passes almost 6 times faster now! I am running the whole test suite now to see whether it doesn't break something else.
On Thu, 2 Dec 2021 at 13:49, Mark Jens <mark.r.j...@gmail.com> wrote: > Reducing the log output did not reduce the test run time: > > diff --git test/src/main/resources/log4j2-test.properties > test/src/main/resources/log4j2-test.properties > index 9124914f7a..810c7bf06f 100644 > --- test/src/main/resources/log4j2-test.properties > +++ test/src/main/resources/log4j2-test.properties > @@ -28,7 +28,7 @@ appender.console.layout.type = PatternLayout > appender.console.layout.pattern = %d{ISO8601} [%c{2}] %-5p: %m%n > > logger.01.name = org.apache.accumulo.core > -logger.01.level = debug > +logger.01.level = info > > logger.02.name = org.apache.accumulo.core.clientImpl.ManagerClient > logger.02.level = info > @@ -106,7 +106,7 @@ logger.25.name = org.apache.hadoop.security > logger.25.level = info > > logger.26.name = org.apache.hadoop.minikdc > -logger.26.level = debug > +logger.26.level = info > > > @@ -169,6 +169,6 @@ logger.metrics.level = info > logger.metrics.additivity = false > logger.metrics.appenderRef.metrics.ref = LoggingMetricsOutput > > -rootLogger.level = debug > +rootLogger.level = info > rootLogger.appenderRef.console.ref = STDOUT > > INFO] Running org.apache.accumulo.test.functional.ConcurrentDeleteTableIT > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: > 785.503 s - in org.apache.accumulo.test.functional.ConcurrentDeleteTableIT > > > On Thu, 2 Dec 2021 at 12:10, Mark Jens <mark.r.j...@gmail.com> wrote: > >> Hi again, >> >> Here are the thread dumps as promised: >> >> 1) Both TabletServers are very busy at compressing at close time. The >> following stacks are dumped in ~5 secs interval: >> >> "tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=68425.44ms >> elapsed=75.42s tid=0x0000fffeac074800 nid=0x33077e runnable >> [0x0000fffe8f3fd000] >> java.lang.Thread.State: RUNNABLE >> at sun.security.provider.SHA5.implCompressCheck(java.base@11.0.11 >> /SHA5.java:232) >> at sun.security.provider.SHA5.implCompress(java.base@11.0.11 >> /SHA5.java:221) >> at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11 >> /DigestBase.java:124) >> at >> java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11 >> /MessageDigest.java:623) >> at java.security.MessageDigest.update(java.base@11.0.11 >> /MessageDigest.java:345) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:421) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125) >> at >> org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66) >> at >> org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179) >> at >> org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148) >> at >> org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169) >> at >> org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236) >> at >> org.apache.accumulo.core.clientImpl.ClientContext.createScanner(ClientContext.java:635) >> at >> org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.buildNonRoot(TabletsMetadata.java:177) >> at >> org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.build(TabletsMetadata.java:125) >> at >> org.apache.accumulo.core.metadata.schema.AmpleImpl.readTablet(AmpleImpl.java:46) >> at >> org.apache.accumulo.core.metadata.schema.Ample.readTablet(Ample.java:141) >> at >> org.apache.accumulo.tserver.tablet.Tablet.closeConsistencyCheck(Tablet.java:1379) >> at >> org.apache.accumulo.tserver.tablet.Tablet.completeClose(Tablet.java:1331) >> - locked <0x00000000f1585830> (a >> org.apache.accumulo.tserver.tablet.Tablet) >> at >> org.apache.accumulo.tserver.tablet.Tablet.close(Tablet.java:1221) >> at >> org.apache.accumulo.tserver.UnloadTabletHandler.run(UnloadTabletHandler.java:92) >> at >> io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207) >> at >> io.opentelemetry.context.Context$$Lambda$209/0x000000010035c840.run(Unknown >> Source) >> at >> java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@11.0.11 >> /ThreadPoolExecutor.java:1128) >> at >> java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@11.0.11 >> /ThreadPoolExecutor.java:628) >> at >> io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207) >> at >> io.opentelemetry.context.Context$$Lambda$209/0x000000010035c840.run(Unknown >> Source) >> at java.lang.Thread.run(java.base@11.0.11/Thread.java:829) >> >> "tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=72485.20ms >> elapsed=79.71s tid=0x0000fffeac074800 nid=0x33077e runnable >> [0x0000fffe8f3fd000] >> java.lang.Thread.State: RUNNABLE >> at >> sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11 >> /DigestBase.java:149) >> at >> sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11 >> /DigestBase.java:144) >> at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11 >> /DigestBase.java:131) >> at >> java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11 >> /MessageDigest.java:623) >> at java.security.MessageDigest.update(java.base@11.0.11 >> /MessageDigest.java:345) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:403) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125) >> at >> org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66) >> at >> org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179) >> at >> org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148) >> at >> org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169) >> ... >> >> "tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=81174.59ms >> elapsed=89.01s tid=0x0000fffeac074800 nid=0x33077e runnable >> [0x0000fffe8f3fd000] >> java.lang.Thread.State: RUNNABLE >> at sun.security.provider.ByteArrayAccess.l2bBig(java.base@11.0.11 >> /ByteArrayAccess.java:449) >> at sun.security.provider.SHA5.implDigest(java.base@11.0.11 >> /SHA5.java:131) >> at sun.security.provider.DigestBase.engineDigest(java.base@11.0.11 >> /DigestBase.java:210) >> at sun.security.provider.DigestBase.engineDigest(java.base@11.0.11 >> /DigestBase.java:189) >> at >> java.security.MessageDigest$Delegate.engineDigest(java.base@11.0.11 >> /MessageDigest.java:639) >> at java.security.MessageDigest.digest(java.base@11.0.11 >> /MessageDigest.java:385) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:439) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125) >> at >> org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66) >> at >> org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179) >> at >> org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148) >> at >> org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169) >> ... >> >> "tablet migration-Worker-1" #4380 daemon prio=5 os_prio=0 cpu=86499.01ms >> elapsed=94.68s tid=0x0000fffeac074800 nid=0x33077e runnable >> [0x0000fffe8f3fd000] >> java.lang.Thread.State: RUNNABLE >> at >> sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11 >> /DigestBase.java:149) >> at >> sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11 >> /DigestBase.java:144) >> at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11 >> /DigestBase.java:131) >> at >> java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11 >> /MessageDigest.java:623) >> at java.security.MessageDigest.update(java.base@11.0.11 >> /MessageDigest.java:345) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:403) >> at >> org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78) >> at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120) >> at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125) >> at >> org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66) >> at >> org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179) >> at >> org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148) >> at >> org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169) >> ... >> >> "tablet migration-Worker-1" #6107 daemon prio=5 os_prio=0 cpu=109551.37ms >> elapsed=117.48s tid=0x0000fffeac01b000 nid=0x33174d runnable >> [0x0000fffe7bffd000] >> 14012 java.lang.Thread.State: RUNNABLE >> 14013 at >> sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11 >> /DigestBase.java:149) >> 14014 at >> sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11 >> /DigestBase.java:144) >> 14015 at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11 >> /DigestBase.java:131) >> 14016 at >> java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11 >> /MessageDigest.java:623) >> 14017 at java.security.MessageDigest.update(java.base@11.0.11 >> /MessageDigest.java:345) >> 14018 at >> org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:432) >> 14019 at >> org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585) >> 14020 at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78) >> 14021 at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167) >> 14022 at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120) >> 14023 at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125) >> 14024 at >> org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66) >> 14025 at >> org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179) >> 14026 at >> org.apache.accumulo.server.ServerInfo.getAuthenticationToken(ServerInfo.java:153) >> 14027 at >> org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:168) >> 14028 at >> org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236) >> >> Notice that ClientContext.getProperties(ClientContext.java:236) most of >> the times calls ServerInfo.getPrincipal(ServerInfo.java:148) but in the >> last one it calls ServerInfo.getAuthenticationToken(ServerInfo.java:153). >> And both lead to (a lot of ?!) compressing.. >> >> 2) The "Manager" process writes ~200Mb of logs. Maybe the default log >> level should not be DEBUG ?! >> >> Most of its threads either wait for notifications from Zookeeper: >> >> 878647 "Manager-ClientPool-Worker-3" #61 daemon prio=5 os_prio=0 >> cpu=375.95ms elapsed=182.38s tid=0x0000fffee0007800 nid=0x32d943 in >> Object.wait() [0x0000fffebb7fc000] >> 878648 java.lang.Thread.State: TIMED_WAITING (on object monitor) >> 878649 at java.lang.Object.wait(java.base@11.0.11/Native Method) >> 878650 - waiting on <no object reference available> >> 878651 at >> org.apache.accumulo.fate.ZooStore.waitForStatusChange(ZooStore.java:386) >> 878652 - waiting to re-lock in wait() <0x00000000f1427458> (a >> org.apache.accumulo.fate.ZooStore) >> 878653 at >> org.apache.accumulo.fate.AgeOffStore.waitForStatusChange(AgeOffStore.java:209) >> 878654 at >> org.apache.accumulo.core.logging.FateLogger$1.waitForStatusChange(FateLogger.java:75) >> 878655 at >> org.apache.accumulo.fate.Fate.waitForCompletion(Fate.java:297) >> 878656 at >> org.apache.accumulo.manager.FateServiceHandler.waitForFateOperation(FateServiceHandler.java:659) >> 878657 at >> org.apache.accumulo.manager.ManagerClientServiceHandler.waitForFateOperation(ManagerClientServiceHandler.java:100) >> ... >> >> or wait for data: >> 878781 "Repo Runner-Worker-1" #90 daemon prio=5 os_prio=0 cpu=7440.91ms >> elapsed=179.99s tid=0x0000fffeb0002000 nid=0x32d99a in Object.wait() >> [0x0000fffebadfd000] >> 878782 java.lang.Thread.State: WAITING (on object monitor) >> 878783 at java.lang.Object.wait(java.base@11.0.11/Native Method) >> 878784 - waiting on <no object reference available> >> 878785 at java.lang.Object.wait(java.base@11.0.11/Object.java:328) >> 878786 at >> org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1529) >> 878787 - waiting to re-lock in wait() <0x00000000f9bf42d8> (a >> org.apache.zookeeper.ClientCnxn$Packet) >> 878788 at >> org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1512) >> 878789 at >> org.apache.zookeeper.ZooKeeper.getChildren(ZooKeeper.java:2587) >> 878790 at >> org.apache.accumulo.fate.zookeeper.ZooReader.lambda$getChildren$5(ZooReader.java:87) >> 878791 at >> org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$182/0x0000000100324040.apply(Unknown >> Source) >> 878792 at >> org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$184/0x0000000100323c40.apply(Unknown >> Source) >> 878793 at >> org.apache.accumulo.fate.zookeeper.ZooReader.retryLoopMutator(ZooReader.java:165) >> 878794 at >> org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:144) >> 878795 at >> org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:131) >> 878796 at >> org.apache.accumulo.fate.zookeeper.ZooReader.getChildren(ZooReader.java:87) >> 878797 at org.apache.accumulo.fate.ZooStore.reserve(ZooStore.java:141) >> 878798 at >> org.apache.accumulo.fate.AgeOffStore.reserve(AgeOffStore.java:155) >> 878799 at >> org.apache.accumulo.core.logging.FateLogger$1.reserve(FateLogger.java:50) >> 878800 at >> org.apache.accumulo.fate.Fate$TransactionRunner.run(Fate.java:72) >> 878801 at >> io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207) >> 878802 at >> io.opentelemetry.context.Context$$Lambda$209/0x0000000100353840.run(Unknown >> Source) >> 878803 at >> java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@11.0.11 >> /ThreadPoolExecutor.java:1128) >> 878804 at >> java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@11.0.11 >> /ThreadPoolExecutor.java:628) >> 878805 at >> io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207) >> 878806 at >> io.opentelemetry.context.Context$$Lambda$209/0x0000000100353840.run(Unknown >> Source) >> 878807 at java.lang.Thread.run(java.base@11.0.11/Thread.java:829) >> >> 908220 "Status Thread" #41 daemon prio=5 os_prio=0 cpu=1700.28ms >> elapsed=187.25s tid=0x0000fffee41f9800 nid=0x32d920 in Object.wait() >> [0x0000ffff20f50000] >> 908221 java.lang.Thread.State: WAITING (on object monitor) >> 908222 at java.lang.Object.wait(java.base@11.0.11/Native Method) >> 908223 - waiting on <no object reference available> >> 908224 at java.lang.Object.wait(java.base@11.0.11/Object.java:328) >> 908225 at >> org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1529) >> 908226 - waiting to re-lock in wait() <0x00000000fa781138> (a >> org.apache.zookeeper.ClientCnxn$Packet) >> 908227 at >> org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1512) >> 908228 at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:2129) >> 908229 at >> org.apache.accumulo.fate.zookeeper.ZooReader.lambda$getData$0(ZooReader.java:65) >> 908230 at >> org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$220/0x0000000100351440.apply(Unknown >> Source) >> 908231 at >> org.apache.accumulo.fate.zookeeper.ZooReader$$Lambda$184/0x0000000100323c40.apply(Unknown >> Source) >> 908232 at >> org.apache.accumulo.fate.zookeeper.ZooReader.retryLoopMutator(ZooReader.java:165) >> 908233 at >> org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:144) >> 908234 at >> org.apache.accumulo.fate.zookeeper.ZooReader.retryLoop(ZooReader.java:131) >> 908235 at >> org.apache.accumulo.fate.zookeeper.ZooReader.getData(ZooReader.java:65) >> 908236 at >> org.apache.accumulo.manager.Manager.getManagerGoalState(Manager.java:496) >> 908237 at >> org.apache.accumulo.manager.Manager$StatusThread.updateStatus(Manager.java:822) >> 908238 at >> org.apache.accumulo.manager.Manager$StatusThread.run(Manager.java:797) >> 908239 at >> io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207) >> 908240 at >> io.opentelemetry.context.Context$$Lambda$209/0x0000000100353840.run(Unknown >> Source) >> 908241 at java.lang.Thread.run(java.base@11.0.11/Thread.java:829) >> >> 3) SimpleGarbageCollector is also busy in getting credentials >> >> "gc" #31 prio=5 os_prio=0 cpu=15495.47ms elapsed=209.43s >> tid=0x0000ffff28295800 nid=0x32dac5 runnable [0x0000ffff3a5fb000] >> 2503 java.lang.Thread.State: RUNNABLE >> 2504 at >> sun.security.provider.DigestBase.implCompressMultiBlock0(java.base@11.0.11 >> /DigestBase.java:149) >> 2505 at >> sun.security.provider.DigestBase.implCompressMultiBlock(java.base@11.0.11 >> /DigestBase.java:144) >> 2506 at sun.security.provider.DigestBase.engineUpdate(java.base@11.0.11 >> /DigestBase.java:131) >> 2507 at >> java.security.MessageDigest$Delegate.engineUpdate(java.base@11.0.11 >> /MessageDigest.java:623) >> 2508 at java.security.MessageDigest.update(java.base@11.0.11 >> /MessageDigest.java:345) >> 2509 at >> org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:421) >> 2510 at >> org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585) >> 2511 at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78) >> 2512 at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167) >> 2513 at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120) >> 2514 at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125) >> 2515 at >> org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66) >> 2516 at >> org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179) >> 2517 at >> org.apache.accumulo.server.ServerInfo.getPrincipal(ServerInfo.java:148) >> 2518 at >> org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:169) >> 2519 at >> org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236) >> 2520 at >> org.apache.accumulo.core.clientImpl.ClientContext.createScanner(ClientContext.java:635) >> 2521 at >> org.apache.accumulo.server.metadata.ServerAmpleImpl.getGcCandidates(ServerAmpleImpl.java:180) >> 2522 at >> org.apache.accumulo.gc.SimpleGarbageCollector$GCEnv.getCandidates(SimpleGarbageCollector.java:199) >> 2523 at >> org.apache.accumulo.gc.GarbageCollectionAlgorithm.collect(GarbageCollectionAlgorithm.java:302) >> 2524 at >> org.apache.accumulo.gc.SimpleGarbageCollector.run(SimpleGarbageCollector.java:502) >> 2525 at io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207) >> 2526 at >> io.opentelemetry.context.Context$$Lambda$209/0x0000000100357840.run(Unknown >> Source) >> 2527 at java.lang.Thread.run(java.base@11.0.11/Thread.java:829) >> >> >> 3151 "gc" #31 prio=5 os_prio=0 cpu=15982.95ms elapsed=218.59s >> tid=0x0000ffff28295800 nid=0x32dac5 runnable [0x0000ffff3a5fb000] >> 3152 java.lang.Thread.State: RUNNABLE >> 3153 at java.util.Arrays.hashCode(java.base@11.0.11/Arrays.java:4685) >> 3154 at java.util.Objects.hash(java.base@11.0.11/Objects.java:146) >> 3155 at java.security.Provider$ServiceKey.hashCode(java.base@11.0.11 >> /Provider.java:1107) >> 3156 at java.util.concurrent.ConcurrentHashMap.get(java.base@11.0.11 >> /ConcurrentHashMap.java:936) >> 3157 at java.security.Provider.getService(java.base@11.0.11 >> /Provider.java:1282) >> 3158 at sun.security.jca.ProviderList.getService(java.base@11.0.11 >> /ProviderList.java:380) >> 3159 at sun.security.jca.GetInstance.getInstance(java.base@11.0.11 >> /GetInstance.java:157) >> 3160 at java.security.Security.getImpl(java.base@11.0.11 >> /Security.java:700) >> 3161 at java.security.MessageDigest.getInstance(java.base@11.0.11 >> /MessageDigest.java:178) >> 3162 at >> org.apache.commons.codec.digest.DigestUtils.getDigest(DigestUtils.java:170) >> 3163 at >> org.apache.commons.codec.digest.Sha2Crypt.sha2Crypt(Sha2Crypt.java:395) >> 3164 at >> org.apache.commons.codec.digest.Sha2Crypt.sha512Crypt(Sha2Crypt.java:585) >> 3165 at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:78) >> 3166 at org.apache.commons.codec.digest.Crypt.crypt(Crypt.java:167) >> 3167 at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.hashInstanceConfigs(SystemCredentials.java:120) >> 3168 at >> org.apache.accumulo.server.security.SystemCredentials$SystemToken.generate(SystemCredentials.java:125) >> 3169 at >> org.apache.accumulo.server.security.SystemCredentials.get(SystemCredentials.java:66) >> 3170 at >> org.apache.accumulo.server.ServerInfo.getCredentials(ServerInfo.java:179) >> 3171 at >> org.apache.accumulo.server.ServerInfo.getAuthenticationToken(ServerInfo.java:153) >> 3172 at >> org.apache.accumulo.server.ServerInfo.getProperties(ServerInfo.java:168) >> 3173 at >> org.apache.accumulo.core.clientImpl.ClientContext.getProperties(ClientContext.java:236) >> 3174 at >> org.apache.accumulo.core.clientImpl.ClientContext.createScanner(ClientContext.java:635) >> 3175 at >> org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.buildNonRoot(TabletsMetadata.java:177) >> 3176 at >> org.apache.accumulo.core.metadata.schema.TabletsMetadata$Builder.build(TabletsMetadata.java:125) >> 3177 at >> org.apache.accumulo.gc.SimpleGarbageCollector$GCEnv.getReferences(SimpleGarbageCollector.java:249) >> 3178 at >> org.apache.accumulo.gc.GarbageCollectionAlgorithm.confirmDeletes(GarbageCollectionAlgorithm.java:169) >> 3179 at >> org.apache.accumulo.gc.GarbageCollectionAlgorithm.confirmDeletesTrace(GarbageCollectionAlgorithm.java:276) >> 3180 at >> org.apache.accumulo.gc.GarbageCollectionAlgorithm.deleteBatch(GarbageCollectionAlgorithm.java:330) >> 3181 at >> org.apache.accumulo.gc.GarbageCollectionAlgorithm.collect(GarbageCollectionAlgorithm.java:315) >> 3182 at >> org.apache.accumulo.gc.SimpleGarbageCollector.run(SimpleGarbageCollector.java:501) >> 3183 at io.opentelemetry.context.Context.lambda$wrap$1(Context.java:207) >> 3184 at >> io.opentelemetry.context.Context$$Lambda$209/0x0000000100357840.run(Unknown >> Source) >> 3185 at java.lang.Thread.run(java.base@11.0.11/Thread.java:829) >> >> >> 4) Nothing interesting for Initialize, Main and ZooKeeperServerMain >> processes >> >> >> I'm not saying that the above are problematic. You know how Accumulo >> works. It is up to you to decide whether something should be improved. >> >> Regards, >> Mark >> >> >> On Wed, 1 Dec 2021 at 16:35, Mark Jens <mark.r.j...@gmail.com> wrote: >> >>> >>> >>> On Tue, 30 Nov 2021 at 18:32, Christopher <ctubb...@apache.org> wrote: >>> >>>> It looks like the tests are timing out. This happens frequently when >>>> running on resource-constrained systems. You can give the test more >>>> time by increasing the timeout factor: `mvn clean verify >>>> -Dcheckstyle.skip -Dspotbugs.skip -Dit.test=ConcurrentDeleteTableIT >>>> -Dtimeout.factor=3` >>>> >>>> There's nothing we know of that would change the way our tests work >>>> due to ARM64, but you may have issues because of limited RAM, slow CPU >>>> speeds, slow disk I/O, busy background processes, or other >>>> resource-related issues. I don't think most of the currently active >>>> developers use ARM64, or have access to a test machine to reproduce or >>>> >>> >>> In case anyone wants to test on Linux ARM64 you could easily use Oracle >>> Cloud for free. >>> >>> https://martin-grigorov.medium.com/github-actions-arm64-runner-on-oracle-cloud-a77cdf7a325a >>> explains how to create a VM and how to use this VM as a Github Actions >>> runner. >>> https://github.com/apache/accumulo/issues/1884#issuecomment-970267282 >>> mentions this article. >>> >>> >>>> experiment with Accumulo there, so you may have to do some of your own >>>> troubleshooting. If you can rule out resource-constraint issues, and >>>> it isn't already a known flaky test (ConcurrentDeleteTableIT is known >>>> flaky and sometimes times out on x86_64 as well), you could create a >>>> bug ticket with more details at >>>> https://github.com/apache/accumulo/issues ; there is an issue template >>>> specifically for broken and/or flaky tests that you can select when >>>> creating a new ticket. >>>> >>>> On Tue, Nov 30, 2021 at 9:34 AM Mark Jens <mark.r.j...@gmail.com> >>>> wrote: >>>> > >>>> > Hi dev1, >>>> > >>>> > On Tue, 30 Nov 2021 at 16:21, dev1 <d...@etcoleman.com> wrote: >>>> > >>>> > > Some of those tests are trying to stress conditions that require a >>>> lot of >>>> > > resources to replicate specific conditions. Have you tried to run >>>> those >>>> > > individual tests in isolation so that you are not competing for >>>> resources? >>>> > > Do they always fail, or are the failures transient? >>>> > > >>>> > >>>> > Q: Have you tried to run those individual tests in isolation so that >>>> you >>>> > are not competing for resources? >>>> > A: This is what I mean with the following: >>>> > --------------------- >>>> > The tests fail even when executed separately, e.g.: >>>> > mvn verify -Dit.test=ConcurrentDeleteTableIT -o -rf :accumulo-test >>>> > --------------------- >>>> > >>>> > Q: Do they always fail, or are the failures transient? >>>> > A: I also tried to explain that with "These tests fail consistently at >>>> > every build attempt!" >>>> > >>>> > Mark >>>> > >>>> > > >>>> > > -----Original Message----- >>>> > > From: Mark Jens <mark.r.j...@gmail.com> >>>> > > Sent: Tuesday, November 30, 2021 4:05 AM >>>> > > To: dev@accumulo.apache.org >>>> > > Subject: Consistent IT tests failures on Linux ARM64 >>>> > > >>>> > > Hello Accumulo community, >>>> > > >>>> > > At my job we consider using Linux ARM64 servers and I've been >>>> tasked to >>>> > > test Accumulo. >>>> > > >>>> > > I face some timeout related issues with several IT tests: >>>> > > >>>> > > >>>> > > [ERROR] >>>> > > >>>> > > >>>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete >>>> > > Time elapsed: 420.122 s <<< ERROR! >>>> > > org.junit.runners.model.TestTimedOutException: test timed out after >>>> 420 >>>> > > seconds at java.base@11.0.11/jdk.internal.misc.Unsafe.park(Native >>>> Method) >>>> > > at java.base@11.0.11 >>>> > > /java.util.concurrent.locks.LockSupport.park(LockSupport.java:194) >>>> > > at java.base@11.0.11 >>>> > > /java.util.concurrent.FutureTask.awaitDone(FutureTask.java:447) >>>> > > at java.base@11.0.11 >>>> > > /java.util.concurrent.FutureTask.get(FutureTask.java:190) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete(ConcurrentDeleteTableIT.java:213) >>>> > > at java.base@11.0.11 >>>> > > /jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native >>>> > > Method) >>>> > > at java.base@11.0.11 >>>> > > >>>> > > >>>> /jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) >>>> > > at java.base@11.0.11 >>>> > > >>>> > > >>>> /jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >>>> > > at java.base@11.0.11 >>>> /java.lang.reflect.Method.invoke(Method.java:566) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:299) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:293) >>>> > > at java.base@11.0.11 >>>> > > /java.util.concurrent.FutureTask.run(FutureTask.java:264) >>>> > > at java.base@11.0.11/java.lang.Thread.run(Thread.java:829) >>>> > > >>>> > > [ERROR] >>>> > > >>>> > > >>>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete >>>> > > Time elapsed: 420.122 s <<< ERROR! >>>> > > java.lang.Exception: Appears to be stuck in thread Time-limited >>>> > > test-SendThread(localhost:44251) >>>> > > at java.base@11.0.11/sun.nio.ch.EPoll.wait(Native Method) at >>>> > > java.base@11.0.11 >>>> > > /sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:120) >>>> > > at java.base@11.0.11 >>>> > > /sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:124) >>>> > > at java.base@11.0.11/sun.nio.ch >>>> .SelectorImpl.select(SelectorImpl.java:136) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:347) >>>> > > at >>>> > > >>>> app//org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1223) >>>> > > >>>> > > [ERROR] >>>> > > >>>> > > >>>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentDeleteTablesOps >>>> > > Time elapsed: 420.011 s <<< ERROR! >>>> > > org.junit.runners.model.TestTimedOutException: test timed out after >>>> 420 >>>> > > seconds at java.base@11.0.11/java.lang.Thread.sleep(Native Method) >>>> at >>>> > > >>>> > > >>>> app//org.apache.accumulo.fate.zookeeper.ZooCache$ZooRunnable.retry(ZooCache.java:299) >>>> > > at >>>> app//org.apache.accumulo.fate.zookeeper.ZooCache.get(ZooCache.java:442) >>>> > > at >>>> app//org.apache.accumulo.fate.zookeeper.ZooCache.get(ZooCache.java:372) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.ClientContext.verifyInstanceId(ClientContext.java:467) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.ClientContext.getInstanceID(ClientContext.java:446) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.ClientContext.getManagerLocations(ClientContext.java:405) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.ManagerClient.getConnection(ManagerClient.java:59) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.ManagerClient.getConnectionWithRetry(ManagerClient.java:49) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.beginFateOperation(TableOperationsImpl.java:260) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.doFateOperation(TableOperationsImpl.java:369) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.doFateOperation(TableOperationsImpl.java:359) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.doTableFateOperation(TableOperationsImpl.java:1670) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.core.clientImpl.TableOperationsImpl.create(TableOperationsImpl.java:248) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentDeleteTablesOps(ConcurrentDeleteTableIT.java:76) >>>> > > at java.base@11.0.11 >>>> > > /jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native >>>> > > Method) >>>> > > at java.base@11.0.11 >>>> > > >>>> > > >>>> /jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) >>>> > > at java.base@11.0.11 >>>> > > >>>> > > >>>> /jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >>>> > > at java.base@11.0.11 >>>> /java.lang.reflect.Method.invoke(Method.java:566) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:299) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:293) >>>> > > at java.base@11.0.11 >>>> > > /java.util.concurrent.FutureTask.run(FutureTask.java:264) >>>> > > at java.base@11.0.11/java.lang.Thread.run(Thread.java:829) >>>> > > >>>> > > [INFO] Running org.apache.accumulo.test.functional.ScannerContextIT >>>> > > [INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 102.909 s - in org.apache.accumulo.test.functional.ScannerContextIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.KerberosRenewalIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 504.472 s - in org.apache.accumulo.test.functional.KerberosRenewalIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.BatchWriterFlushIT >>>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 62.132 s - in org.apache.accumulo.test.functional.BatchWriterFlushIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.BinaryIT >>>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 65.034 s - in org.apache.accumulo.test.functional.BinaryIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.PermissionsIT >>>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 59.25 s - in org.apache.accumulo.test.functional.PermissionsIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.ZookeeperRestartIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 37.37 s - in org.apache.accumulo.test.functional.ZookeeperRestartIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.CreateManyScannersIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 23.046 s - in >>>> org.apache.accumulo.test.functional.CreateManyScannersIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.CreateInitialSplitsIT >>>> > > [INFO] Tests run: 7, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 255.108 s - in >>>> org.apache.accumulo.test.functional.CreateInitialSplitsIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.MonitorSslIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 25.304 s - in org.apache.accumulo.test.functional.MonitorSslIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.RestartStressIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 78.359 s - in org.apache.accumulo.test.functional.RestartStressIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.BulkSplitOptimizationIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 59.289 s - in >>>> org.apache.accumulo.test.functional.BulkSplitOptimizationIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.BulkNewIT >>>> > > [INFO] Tests run: 14, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 63.696 s - in org.apache.accumulo.test.functional.BulkNewIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.BloomFilterIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 135.298 s - in org.apache.accumulo.test.functional.BloomFilterIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.BulkIT >>>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 122.959 s - in org.apache.accumulo.test.functional.BulkIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.BinaryStressIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 38.626 s - in org.apache.accumulo.test.functional.BinaryStressIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.ClassLoaderIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 45.61 s - in org.apache.accumulo.test.functional.ClassLoaderIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.LogicalTimeIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 116.819 s - in org.apache.accumulo.test.functional.LogicalTimeIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.SplitRecoveryIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 25.421 s - in org.apache.accumulo.test.functional.SplitRecoveryIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.BigRootTabletIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 96.86 s - in org.apache.accumulo.test.functional.BigRootTabletIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.GarbageCollectorIT >>>> > > [INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 238.409 s - in >>>> org.apache.accumulo.test.functional.GarbageCollectorIT >>>> > > [INFO] Running >>>> > > >>>> org.apache.accumulo.test.functional.BalanceInPresenceOfOfflineTableIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 219.253 s - in >>>> > > >>>> org.apache.accumulo.test.functional.BalanceInPresenceOfOfflineTableIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.VisibilityIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 38.015 s - in org.apache.accumulo.test.functional.VisibilityIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.SslWithClientAuthIT >>>> > > [INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 489.863 s - in >>>> org.apache.accumulo.test.functional.SslWithClientAuthIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.SummaryIT >>>> > > [INFO] Tests run: 10, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 111.552 s - in org.apache.accumulo.test.functional.SummaryIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.MaxOpenIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 30.061 s - in org.apache.accumulo.test.functional.MaxOpenIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.ManagerFailoverIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 47.089 s - in org.apache.accumulo.test.functional.ManagerFailoverIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.DeleteRowsIT >>>> > > [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 229.586 s - in org.apache.accumulo.test.functional.DeleteRowsIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.BackupManagerIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 22.943 s - in org.apache.accumulo.test.functional.BackupManagerIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.TabletMetadataIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 46.728 s - in org.apache.accumulo.test.functional.TabletMetadataIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.LateLastContactIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 46.648 s - in org.apache.accumulo.test.functional.LateLastContactIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.SimpleBalancerFairnessIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 71.934 s - in >>>> org.apache.accumulo.test.functional.SimpleBalancerFairnessIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.HalfDeadTServerIT >>>> > > [ERROR] Tests run: 3, Failures: 0, Errors: 2, Skipped: 0, Time >>>> elapsed: >>>> > > 307.904 s <<< FAILURE! - in >>>> > > org.apache.accumulo.test.functional.HalfDeadTServerIT >>>> > > [ERROR] >>>> org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover >>>> > > Time elapsed: 240.011 s <<< ERROR! >>>> > > org.junit.runners.model.TestTimedOutException: test timed out after >>>> 240 >>>> > > seconds at java.base@11.0.11/java.lang.Object.wait(Native Method) >>>> at >>>> > > java.base@11.0.11/java.lang.Object.wait(Object.java:328) >>>> > > at java.base@11.0.11 >>>> /java.lang.ProcessImpl.waitFor(ProcessImpl.java:495) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.test.functional.HalfDeadTServerIT.test(HalfDeadTServerIT.java:217) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover(HalfDeadTServerIT.java:142) >>>> > > at java.base@11.0.11 >>>> > > /jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native >>>> > > Method) >>>> > > at java.base@11.0.11 >>>> > > >>>> > > >>>> /jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) >>>> > > at java.base@11.0.11 >>>> > > >>>> > > >>>> /jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >>>> > > at java.base@11.0.11 >>>> /java.lang.reflect.Method.invoke(Method.java:566) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:299) >>>> > > at >>>> > > >>>> > > >>>> app//org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:293) >>>> > > at java.base@11.0.11 >>>> > > /java.util.concurrent.FutureTask.run(FutureTask.java:264) >>>> > > at java.base@11.0.11/java.lang.Thread.run(Thread.java:829) >>>> > > >>>> > > [ERROR] >>>> org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover >>>> > > Time elapsed: 240.012 s <<< ERROR! >>>> > > java.lang.Exception: Appears to be stuck in thread Time-limited >>>> > > test-SendThread(localhost:39285) >>>> > > at java.base@11.0.11/sun.nio.ch.EPoll.wait(Native Method) at >>>> > > java.base@11.0.11 >>>> > > /sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:120) >>>> > > at java.base@11.0.11 >>>> > > /sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:124) >>>> > > at java.base@11.0.11/sun.nio.ch >>>> .SelectorImpl.select(SelectorImpl.java:136) >>>> > > at >>>> > > >>>> > > >>>> app//org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:347) >>>> > > at >>>> > > >>>> app//org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1223) >>>> > > >>>> > > [INFO] Running org.apache.accumulo.test.functional.MetadataIT >>>> > > [INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 97.987 s - in org.apache.accumulo.test.functional.MetadataIT >>>> > > [INFO] Running >>>> org.apache.accumulo.test.functional.ScanSessionTimeOutIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 43.91 s - in >>>> org.apache.accumulo.test.functional.ScanSessionTimeOutIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.ZooCacheIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 33.986 s - in org.apache.accumulo.test.functional.ZooCacheIT >>>> > > [INFO] Running org.apache.accumulo.test.functional.DeleteRowsSplitIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 113.928 s - in org.apache.accumulo.test.functional.DeleteRowsSplitIT >>>> > > [INFO] Running org.apache.accumulo.test.ScanFlushWithTimeIT >>>> > > [INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 36.854 s - in org.apache.accumulo.test.ScanFlushWithTimeIT >>>> > > [INFO] Running org.apache.accumulo.test.AuditMessageIT >>>> > > [INFO] Tests run: 6, Failures: 0, Errors: 0, Skipped: 0, Time >>>> elapsed: >>>> > > 165.169 s - in org.apache.accumulo.test.AuditMessageIT >>>> > > [INFO] Running >>>> > > >>>> org.apache.accumulo.test.gc.replication.CloseWriteAheadLogReferencesIT >>>> > > [WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time >>>> elapsed: >>>> > > 0.039 s - in >>>> > > >>>> org.apache.accumulo.test.gc.replication.CloseWriteAheadLogReferencesIT >>>> > > [INFO] >>>> > > [INFO] Results: >>>> > > [INFO] >>>> > > [ERROR] Errors: >>>> > > [ERROR] >>>> > > >>>> > > >>>> org.apache.accumulo.test.compaction.ExternalCompaction_3_IT.testCoordinatorRestartsDuringCompaction >>>> > > [ERROR] Run 1: >>>> > > ExternalCompaction_3_IT.testCoordinatorRestartsDuringCompaction:178 >>>> » >>>> > > TestTimedOut >>>> > > [ERROR] Run 2: >>>> > > ExternalCompaction_3_IT.testCoordinatorRestartsDuringCompaction » >>>> Appears >>>> > > to ... >>>> > > [INFO] >>>> > > [ERROR] ConcurrentDeleteTableIT.testConcurrentDeleteTablesOps:76 » >>>> > > TestTimedOut test t... >>>> > > [ERROR] >>>> > > >>>> > > >>>> org.apache.accumulo.test.functional.ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete >>>> > > [ERROR] Run 1: >>>> > > ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete:213 » >>>> TestTimedOut >>>> > > tes... >>>> > > [ERROR] Run 2: >>>> ConcurrentDeleteTableIT.testConcurrentFateOpsWithDelete » >>>> > > Appears to be stuck... >>>> > > [INFO] >>>> > > [ERROR] >>>> org.apache.accumulo.test.functional.HalfDeadTServerIT.testRecover >>>> > > [ERROR] Run 1: >>>> > > >>>> > > >>>> HalfDeadTServerIT.testRecover:142->test:217->Object.wait:328->Object.wait:-2 >>>> > > » TestTimedOut >>>> > > [ERROR] Run 2: HalfDeadTServerIT.testRecover » Appears to be >>>> stuck in >>>> > > thread Time-limited te... >>>> > > [INFO] >>>> > > [ERROR] org.apache.accumulo.test.functional.SslIT.adminStop >>>> > > [ERROR] Run 1: >>>> SslIT.adminStop:68->Object.wait:328->Object.wait:-2 » >>>> > > TestTimedOut test timed ... >>>> > > [ERROR] Run 2: SslIT.adminStop » Appears to be stuck in thread >>>> > > Time-limited test-SendThread(... >>>> > > >>>> > > These tests fail consistently at every build attempt! >>>> > > >>>> > > The tests fail even when executed separately, e.g.: >>>> > > mvn verify -Dit.test=ConcurrentDeleteTableIT -o -rf :accumulo-test >>>> > > >>>> > > >>>> > > I am using the current 'main' branch of Accumulo. >>>> > > JDK 11.0.11 >>>> > > Maven: 3.8.2 >>>> > > OS: Ubuntu 20.04.3 ARM64 >>>> > > >>>> > > Is there anything that could be done to fix these problems ? >>>> > > For example some config settings ?! >>>> > > >>>> > > P.S. At https://github.com/apache/accumulo/issues/1884 I read that >>>> Linux >>>> > > ARM64 is a supported platform since the JVM supports it. >>>> > > >>>> > > Thanks! >>>> > > >>>> > > Mark >>>> > > >>>> >>>