Hi Alex, Here is the log from the Ignite startup. It's fairly short but shows everything I think:
2023-07-17 22:38:55,061 [1] DBG [ImmutableCacheComputeServer] Starting Ignite.NET 2.15.0.23172 2023-07-17 22:38:55,065 [1] DBG [ImmutableCacheComputeServer] 2023-07-17 22:38:55,068 [1] DBG [ImmutableCacheComputeServer] 2023-07-17 22:38:55,070 [1] DBG [ImmutableCacheComputeServer] 2023-07-17 22:38:55,070 [1] DBG [ImmutableCacheComputeServer] 2023-07-17 22:38:55,073 [1] DBG [ImmutableCacheComputeServer] 2023-07-17 22:38:55,471 [1] DBG [ImmutableCacheComputeServer] JVM started. 2023-07-17 22:38:56,340 [1] WRN [ImmutableCacheComputeServer] Consistent ID is not set, it is recommended to set consistent ID for production clusters (use IgniteConfiguration.setConsistentId property) 2023-07-17 22:38:56,382 [1] INF [ImmutableCacheComputeServer] >>> __________ ________________ >>> / _/ ___/ |/ / _/_ __/ __/ >>> _/ // (7 7 // / / / / _/ >>> /___/\___/_/|_/___/ /_/ /___/ >>> >>> ver. 2.15.0#20230425-sha1:f98f7f35 >>> 2023 Copyright(C) Apache Software Foundation >>> >>> Ignite documentation: https://ignite.apache.org 2023-07-17 22:38:56,383 [1] INF [ImmutableCacheComputeServer] Config URL: n/a 2023-07-17 22:38:56,414 [1] INF [ImmutableCacheComputeServer] IgniteConfiguration [igniteInstanceName=TRex-Immutable, pubPoolSize=250, svcPoolSize=8, callbackPoolSize=8, stripedPoolSize=8, sysPoolSize=250, mgmtPoolSize=4, dataStreamerPoolSize=8, utilityCachePoolSize=8, utilityCacheKeepAliveTime=60000, p2pPoolSize=2, qryPoolSize=8, buildIdxPoolSize=1, igniteHome=/trex/, igniteWorkDir=/persist/Immutable, mbeanSrv=com.sun.jmx.mbeanserver.JmxMBeanServer@6e46d9f4, nodeId=4e70ba5e-5829-4b2d-b349-6539918990b5, marsh=BinaryMarshaller [], marshLocJobs=false, p2pEnabled=false, netTimeout=5000, netCompressionLevel=1, sndRetryDelay=1000, sndRetryCnt=3, metricsHistSize=10000, metricsUpdateFreq=2000, metricsExpTime=9223372036854775807, discoSpi=TcpDiscoverySpi [addrRslvr=null, addressFilter=null, sockTimeout=0, ackTimeout=0, marsh=null, reconCnt=10, reconDelay=2000, maxAckTimeout=600000, soLinger=0, forceSrvMode=false, clientReconnectDisabled=false, internalLsnr=null, skipAddrsRandomization=false], segPlc=USE_FAILURE_HANDLER, segResolveAttempts=2, waitForSegOnStart=true, allResolversPassReq=true, segChkFreq=10000, commSpi=TcpCommunicationSpi [connectGate=org.apache.ignite.spi.communication.tcp.internal.ConnectGateway@5bb3d42d, ctxInitLatch=java.util.concurrent.CountDownLatch@5bf61e67[Count = 1], stopping=false, clientPool=null, nioSrvWrapper=null, stateProvider=null], evtSpi=org.apache.ignite.spi.eventstorage.NoopEventStorageSpi@2c1dc8e, colSpi=NoopCollisionSpi [], deploySpi=LocalDeploymentSpi [], indexingSpi=org.apache.ignite.spi.indexing.noop.NoopIndexingSpi@61019f59, addrRslvr=null, encryptionSpi=org.apache.ignite.spi.encryption.noop.NoopEncryptionSpi@62e8f862, tracingSpi=org.apache.ignite.spi.tracing.NoopTracingSpi@26f3d90c, clientMode=false, rebalanceThreadPoolSize=1, rebalanceTimeout=10000, rebalanceBatchesPrefetchCnt=3, rebalanceThrottle=0, rebalanceBatchSize=524288, txCfg=TransactionConfiguration [txSerEnabled=false, dfltIsolation=REPEATABLE_READ, dfltConcurrency=PESSIMISTIC, dfltTxTimeout=0, txTimeoutOnPartitionMapExchange=0, deadlockTimeout=10000, pessimisticTxLogSize=0, pessimisticTxLogLinger=10000, tmLookupClsName=null, txManagerFactory=null, useJtaSync=false], cacheSanityCheckEnabled=true, discoStartupDelay=60000, deployMode=SHARED, p2pMissedCacheSize=100, locHost=null, timeSrvPortBase=31100, timeSrvPortRange=100, failureDetectionTimeout=60000, sysWorkerBlockedTimeout=null, clientFailureDetectionTimeout=60000, metricsLogFreq=30000, connectorCfg=ConnectorConfiguration [jettyPath=null, host=null, port=11212, noDelay=true, directBuf=false, sndBufSize=32768, rcvBufSize=32768, idleQryCurTimeout=600000, idleQryCurCheckFreq=60000, sndQueueLimit=0, selectorCnt=2, idleTimeout=7000, sslEnabled=false, sslClientAuth=false, sslCtxFactory=null, sslFactory=null, portRange=100, threadPoolSize=8, msgInterceptor=null], odbcCfg=null, warmupClos=null, atomicCfg=AtomicConfiguration [seqReserveSize=1000, cacheMode=PARTITIONED, backups=1, aff=null, grpName=null], classLdr=null, sslCtxFactory=null, platformCfg=PlatformDotNetConfiguration [binaryCfg=null], binaryCfg=BinaryConfiguration [idMapper=null, nameMapper=null, serializer=null, compactFooter=true], memCfg=null, pstCfg=null, dsCfg=DataStorageConfiguration [pageSize=4096, concLvl=2, sysDataRegConf=org.apache.ignite.configuration.SystemDataRegionConfiguration@55a8dc49, dfltDataRegConf=DataRegionConfiguration [name=Default-Immutable, maxSize=8589934592, initSize=8589934592, swapPath=null, pageEvictionMode=DISABLED, pageReplacementMode=CLOCK, evictionThreshold=0.9, emptyPagesPoolSize=100, metricsEnabled=false, metricsSubIntervalCount=5, metricsRateTimeInterval=60000, persistenceEnabled=true, checkpointPageBufSize=0, lazyMemoryAllocation=true, warmUpCfg=null, memoryAllocator=null, cdcEnabled=false], dataRegions=null, storagePath=/persist/Immutable/Persistence, checkpointFreq=30000, lockWaitTime=10000, checkpointThreads=4, checkpointWriteOrder=SEQUENTIAL, walHistSize=20, maxWalArchiveSize=5368709120, walSegments=10, walSegmentSize=536870912, walPath=/persist/Immutable/WalStore, walArchivePath=/persist/Immutable/WalArchive, cdcWalPath=db/wal/cdc, cdcWalDirMaxSize=0, metricsEnabled=false, walMode=FSYNC, walTlbSize=131072, walBuffSize=0, walFlushFreq=2000, walFsyncDelay=1000, walRecordIterBuffSize=67108864, alwaysWriteFullPages=false, fileIOFactory=org.apache.ignite.internal.processors.cache.persistence.file.AsyncFileIOFactory@545f80bf, metricsSubIntervalCnt=5, metricsRateTimeInterval=60000, walAutoArchiveAfterInactivity=-1, walForceArchiveTimeout=-1, writeThrottlingEnabled=false, walCompactionEnabled=false, walCompactionLevel=1, checkpointReadLockTimeout=null, walPageCompression=DISABLED, walPageCompressionLevel=null, dfltWarmUpCfg=null, encCfg=org.apache.ignite.configuration.EncryptionConfiguration@22fa55b2, defragmentationThreadPoolSize=4, minWalArchiveSize=-1, memoryAllocator=null], snapshotPath=snapshots, snapshotThreadPoolSize=4, activeOnStart=true, activeOnStartPropSetFlag=false, autoActivation=true, autoActivationPropSetFlag=false, clusterStateOnStart=null, sqlConnCfg=null, cliConnCfg=ClientConnectorConfiguration [host=null, port=10800, portRange=100, sockSndBufSize=0, sockRcvBufSize=0, tcpNoDelay=true, maxOpenCursorsPerConn=128, threadPoolSize=8, selectorCnt=4, idleTimeout=0, handshakeTimeout=10000, jdbcEnabled=true, odbcEnabled=true, thinCliEnabled=true, sslEnabled=false, useIgniteSslCtxFactory=true, sslClientAuth=false, sslCtxFactory=null, thinCliCfg=ThinClientConfiguration [maxActiveTxPerConn=100, maxActiveComputeTasksPerConn=0, sendServerExcStackTraceToClient=false]], mvccVacuumThreadCnt=2, mvccVacuumFreq=5000, authEnabled=false, failureHnd=null, commFailureRslvr=null, sqlCfg=SqlConfiguration [longQryWarnTimeout=3000, dfltQryTimeout=0, sqlQryHistSize=1000, validationEnabled=false], asyncContinuationExecutor=null] 2023-07-17 22:38:56,414 [1] INF [ImmutableCacheComputeServer] OS: Linux 5.15.0-1041-azure amd64 2023-07-17 22:38:56,415 [1] INF [ImmutableCacheComputeServer] OS user: root 2023-07-17 22:38:56,419 [1] INF [ImmutableCacheComputeServer] PID: 1 2023-07-17 22:38:56,420 [1] INF [ImmutableCacheComputeServer] Language runtime: Java Platform API Specification ver. 11 2023-07-17 22:38:56,420 [1] INF [ImmutableCacheComputeServer] VM information: OpenJDK Runtime Environment 11.0.19+7-LTS Amazon.com Inc. OpenJDK 64-Bit Server VM 11.0.19+7-LTS 2023-07-17 22:38:56,420 [1] INF [ImmutableCacheComputeServer] VM total memory: 1.0GB 2023-07-17 22:38:56,421 [1] INF [ImmutableCacheComputeServer] Remote Management [restart: off, REST: on, JMX (remote: off)] 2023-07-17 22:38:56,421 [1] INF [ImmutableCacheComputeServer] Logger: PlatformLogger [traceEnabled=false, debugEnabled=false, infoEnabled=true, isQuiet=false] 2023-07-17 22:38:56,421 [1] INF [ImmutableCacheComputeServer] IGNITE_HOME=/trex/ 2023-07-17 22:38:56,422 [1] INF [ImmutableCacheComputeServer] VM arguments: [-DIGNITE_QUIET=false, -Djava.net.preferIPv4Stack=true, -XX:+UseG1GC, -Djdk.tls.server.protocols="TLSv1.2", -Djdk.tls.client.protocols="TLSv1.2", --add-exports=java.base/jdk.internal.misc=ALL-UNNAMED, --add-exports=java.base/sun.nio.ch=ALL-UNNAMED, --add-exports=java.management/com.sun.jmx.mbeanserver=ALL-UNNAMED, --add-exports=jdk.internal.jvmstat/sun.jvmstat.monitor=ALL-UNNAMED, --add-exports=java.base/sun.reflect.generics.reflectiveObjects=ALL-UNNAMED, --illegal-access=permit, -javaagent:./libs/jmx_prometheus_javaagent-0.18.0.jar=8088:prometheusConfig.yaml, -Xms1024m, -Xmx1024m, -Dfile.encoding=UTF-8, -Djava.util.logging.config.file=/trex/config/java.util.logging.properties, --add-exports=java.base/jdk.internal.misc=ALL-UNNAMED, --add-exports=java.base/sun.nio.ch=ALL-UNNAMED, --add-exports=java.management/com.sun.jmx.mbeanserver=ALL-UNNAMED, --add-exports=jdk.internal.jvmstat/sun.jvmstat.monitor=ALL-UNNAMED, --add-opens=jdk.management/com.sun.management.internal=ALL-UNNAMED, --illegal-access=permit, --add-opens=java.base/jdk.internal.misc=ALL-UNNAMED, --add-opens=java.base/ sun.nio.ch=ALL-UNNAMED, --add-opens=java.management/com.sun.jmx.mbeanserver=ALL-UNNAMED, --add-opens=jdk.internal.jvmstat/sun.jvmstat.monitor=ALL-UNNAMED, --add-opens=java.base/sun.reflect.generics.reflectiveObjects=ALL-UNNAMED, --add-opens=java.base/java.io=ALL-UNNAMED, --add-opens=java.base/java.nio=ALL-UNNAMED, --add-opens=java.base/java.util=ALL-UNNAMED, --add-opens=java.base/java.util.concurrent=ALL-UNNAMED, --add-opens=java.base/java.util.concurrent.locks=ALL-UNNAMED, --add-opens=java.base/java.util.concurrent.atomic=ALL-UNNAMED, --add-opens=java.base/java.lang=ALL-UNNAMED, --add-opens=java.base/java.lang.invoke=ALL-UNNAMED, --add-opens=java.base/java.math=ALL-UNNAMED, --add-opens=java.sql/java.sql=ALL-UNNAMED] 2023-07-17 22:38:56,422 [1] INF [ImmutableCacheComputeServer] System cache's DataRegion size is configured to 40 MB. Use DataStorageConfiguration.systemRegionInitialSize property to change the setting. 2023-07-17 22:38:56,422 [1] INF [ImmutableCacheComputeServer] Configured caches [in 'sysMemPlc' dataRegion: ['ignite-sys-cache']] 2023-07-17 22:38:56,487 [1] INF [ImmutableCacheComputeServer] Configured plugins: 2023-07-17 22:38:56,488 [1] INF [ImmutableCacheComputeServer] ^-- None 2023-07-17 22:38:56,488 [1] INF [ImmutableCacheComputeServer] 2023-07-17 22:38:56,491 [1] INF [ImmutableCacheComputeServer] Configured failure handler: [hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0, super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet [SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]]] 2023-07-17 22:38:56,747 [1] INF [ImmutableCacheComputeServer] Successfully bound communication NIO server to TCP port [port=47100, locHost=0.0.0.0/0.0.0.0, selectorsCnt=2, selectorSpins=0, pairedConn=false] 2023-07-17 22:38:56,749 [1] WRN [ImmutableCacheComputeServer] Failure detection timeout will be ignored (one of SPI parameters has been set explicitly) 2023-07-17 22:38:56,769 [1] INF [ImmutableCacheComputeServer] Collision resolution is disabled (all jobs will be activated upon arrival). 2023-07-17 22:38:56,836 [1] INF [ImmutableCacheComputeServer] Successfully bound to TCP port [port=47500, localHost=0.0.0.0/0.0.0.0, locNodeId=4e70ba5e-5829-4b2d-b349-6539918990b5] 2023-07-17 22:38:56,869 [1] INF [ImmutableCacheComputeServer] Successfully locked persistence storage folder [/persist/Immutable/Persistence/node00-03094411-a868-4d96-8ea3-39df7f6f2262] 2023-07-17 22:38:56,870 [1] INF [ImmutableCacheComputeServer] Consistent ID used for local node is [03094411-a868-4d96-8ea3-39df7f6f2262] according to persistence data storage folders 2023-07-17 22:38:56,870 [1] INF [ImmutableCacheComputeServer] Resolved store directory for node persistent data: /persist/Immutable/Persistence/node00-03094411-a868-4d96-8ea3-39df7f6f2262 2023-07-17 22:39:00,359 [1] INF [ImmutableCacheComputeServer] Resolved directory for serialized binary metadata: /persist/Immutable/db/binary_meta/node00-03094411-a868-4d96-8ea3-39df7f6f2262 2023-07-17 22:39:02,289 [1] INF [ImmutableCacheComputeServer] Resolved page store work directory: /persist/Immutable/Persistence/node00-03094411-a868-4d96-8ea3-39df7f6f2262 2023-07-17 22:39:02,325 [1] INF [ImmutableCacheComputeServer] Resolved page store work directory: /persist/Immutable/Persistence/node00-03094411-a868-4d96-8ea3-39df7f6f2262 2023-07-17 22:39:02,412 [1] INF [ImmutableCacheComputeServer] Resolved write ahead log work directory: /persist/Immutable/WalStore/node00-03094411-a868-4d96-8ea3-39df7f6f2262 2023-07-17 22:39:02,417 [1] INF [ImmutableCacheComputeServer] Resolved write ahead log archive directory: /persist/Immutable/WalArchive/node00-03094411-a868-4d96-8ea3-39df7f6f2262 2023-07-17 22:39:02,568 [1] INF [ImmutableCacheComputeServer] Configured data regions initialized successfully [total=5] 2023-07-17 22:39:02,617 [1] INF [ImmutableCacheComputeServer] Resolved snapshot work directory: /persist/Immutable/snapshots 2023-07-17 22:39:02,617 [1] INF [ImmutableCacheComputeServer] Resolved temp directory for snapshot creation: /persist/Immutable/Persistence/node00-03094411-a868-4d96-8ea3-39df7f6f2262/snp 2023-07-17 22:39:02,670 [1] WRN [ImmutableCacheComputeServer] Serialization of Java objects in H2 was enabled. 2023-07-17 22:39:02,798 [1] INF [ImmutableCacheComputeServer] Client connector processor has started on TCP port 10800 2023-07-17 22:39:02,845 [1] INF [ImmutableCacheComputeServer] Command protocol successfully started [name=TCP binary, host=0.0.0.0/0.0.0.0, port=11212] 2023-07-17 22:39:02,868 [1] WRN [ImmutableCacheComputeServer] Marshaller is automatically set to o.a.i.i.binary.BinaryMarshaller (other nodes must have the same marshaller type). 2023-07-17 22:39:02,885 [1] INF [ImmutableCacheComputeServer] Configured .NET plugins: 2023-07-17 22:39:02,885 [1] INF [ImmutableCacheComputeServer] ^-- None 2023-07-17 22:39:02,897 [1] INF [ImmutableCacheComputeServer] Non-loopback local IPs: 10.215.104.141 2023-07-17 22:39:02,898 [1] INF [ImmutableCacheComputeServer] Enabled local MACs: E636C15B5514 2023-07-17 22:39:02,948 [1] INF [ImmutableCacheComputeServer] Read checkpoint status [startMarker=/persist/Immutable/Persistence/node00-03094411-a868-4d96-8ea3-39df7f6f2262/cp/1689633469988-4581c48e-4906-41c4-9c63-c798bfb50fc6-START.bin, endMarker=/persist/Immutable/Persistence/node00-03094411-a868-4d96-8ea3-39df7f6f2262/cp/1689632606392-6cda821a-488a-40c3-9d24-20d100190181-END.bin] 2023-07-17 22:39:02,953 [1] INF [ImmutableCacheComputeServer] Started page memory [memoryAllocated=100.0 MiB, pages=24814, tableSize=1.9 MiB, replacementSize=3.0 KiB, checkpointBuffer=100.0 MiB] 2023-07-17 22:39:02,954 [1] INF [ImmutableCacheComputeServer] Checking memory state [lastValidPos=WALPointer [idx=2431, fileOff=201666668, len=60379], lastMarked=WALPointer [idx=2431, fileOff=215985766, len=60379], lastCheckpointId=4581c48e-4906-41c4-9c63-c798bfb50fc6] 2023-07-17 22:39:03,260 [1] WRN [ImmutableCacheComputeServer] WAL segment tail reached. [idx=2431, isWorkDir=true, serVer=org.apache.ignite.internal.processors.cache.persistence.wal.serializer.RecordV2Serializer@1b90fee4, actualFilePtr=WALPointer [idx=2431, fileOff=216046145, len=0]] 2023-07-17 22:39:03,265 [1] INF [ImmutableCacheComputeServer] Restoring checkpoint after logical recovery, will start physical recovery from back pointer: WALPointer [idx=2431, fileOff=209031823, len=29] 2023-07-17 22:39:03,352 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000057, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209169155, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,355 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000054, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209173350, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,356 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000048, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209177545, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,356 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000047, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209181740, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,357 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000046, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209185935, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,357 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000045, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209190130, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,358 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000044, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209194325, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,360 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000043, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209198520, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,360 [30] ERR [ImmutableCacheComputeServer] Failed to apply page delta. rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000042, pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, chainSize=0, pos=WALPointer [idx=2431, fileOff=209202715, len=41], type=PAGES_LIST_REMOVE_PAGE]]]] 2023-07-17 22:39:03,362 [1] INF [ImmutableCacheComputeServer] Cleanup cache stores [total=0, left=0, cleanFiles=false] 2023-07-17 22:39:03,363 [1] ERR [ImmutableCacheComputeServer] Exception during start processors, node will be stopped and close connections 2023-07-17 22:39:03,364 [1] ERR [ImmutableCacheComputeServer] Got exception while starting (will rollback startup routine). 2023-07-17 22:39:03,365 [1] WRN [ImmutableCacheComputeServer] Attempt to stop starting grid. This operation cannot be guaranteed to be successful. 2023-07-17 22:39:03,369 [1] INF [ImmutableCacheComputeServer] Command protocol successfully stopped: TCP binary 2023-07-17 22:39:03,458 [1] INF [VSS.TRex.GridFabric.Servers.Compute.ImmutableCacheComputeServer] Completed creation of new Ignite node: Exists = False, Factory available = True 2023-07-17 22:39:03,458 [1] WRN [VSS.TRex.GridFabric.Servers.Compute.ImmutableCacheComputeServer] Unable to obtain instance of TRex-Immutable at attempt:1 Unhandled exception: Apache.Ignite.Core.Common.IgniteException: Failed to apply page delta ---> Apache.Ignite.Core.Common.JavaException: class org.apache.ignite.IgniteException: Failed to apply page delta at org.apache.ignite.internal.util.IgniteUtils.convertException(IgniteUtils.java:1150) at org.apache.ignite.internal.processors.platform.PlatformAbstractBootstrap.start(PlatformAbstractBootstrap.java:48) at org.apache.ignite.internal.processors.platform.PlatformIgnition.start(PlatformIgnition.java:74) Caused by: class org.apache.ignite.IgniteCheckedException: Failed to apply page delta at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.lambda$performBinaryMemoryRestore$26(GridCacheDatabaseSharedManager.java:2289) at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.lambda$stripedApplyPage$27(GridCacheDatabaseSharedManager.java:2346) at org.apache.ignite.internal.processors.cache.persistence.CacheStripedExecutor.lambda$submit$0(CacheStripedExecutor.java:75) at org.apache.ignite.internal.util.StripedExecutor$Stripe.body(StripedExecutor.java:637) at org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:125) at java.base/java.lang.Thread.run(Thread.java:829) Caused by: java.lang.IllegalStateException: Failed to get page IO instance (page content is corrupted) at org.apache.ignite.internal.processors.cache.persistence.tree.io.IOVersions.forVersion(IOVersions.java:85) at org.apache.ignite.internal.processors.cache.persistence.tree.io.IOVersions.forPage(IOVersions.java:97) at org.apache.ignite.internal.pagemem.wal.record.delta.PagesListRemovePageRecord.applyDelta(PagesListRemovePageRecord.java:55) at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.applyPageDelta(GridCacheDatabaseSharedManager.java:2401) at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager.lambda$performBinaryMemoryRestore$26(GridCacheDatabaseSharedManager.java:2282) ... 5 more at Apache.Ignite.Core.Impl.Unmanaged.Jni.Env.ExceptionCheck() at Apache.Ignite.Core.Impl.Unmanaged.Jni.Env.CallStaticVoidMethod(GlobalRef cls, IntPtr methodId, Int64* argsPtr) at Apache.Ignite.Core.Impl.Unmanaged.UnmanagedUtils.IgnitionStart(Env env, String cfgPath, String gridName, Boolean clientMode, Boolean userLogger, Int64 igniteId, Boolean redirectConsole) at Apache.Ignite.Core.Ignition.Start(IgniteConfiguration cfg) --- End of inner exception stack trace --- at Apache.Ignite.Core.Ignition.Start(IgniteConfiguration cfg) Thanks, Raymond. On Wed, Jul 19, 2023 at 5:43 AM Raymond Wilson <raymond_wil...@trimble.com> wrote: > Hi Alex, > > We are using Ignite v2.15. > > I will track down the additional log information and reply on this thread. > > Raymond. > > > On Wed, Jul 19, 2023 at 2:55 AM Alex Plehanov <plehanov.a...@gmail.com> > wrote: > >> Hello, >> >> Which Ignite version do you use? >> Please share exception details after "Exception during start processors, >> node will be stopped and close connections" (there should be a reason in >> the log, why the page delta can't be applied). >> >> вт, 18 июл. 2023 г. в 05:05, Raymond Wilson <raymond_wil...@trimble.com>: >> >>> Hi, >>> >>> We run a dev/alpha stack of our application in Azure Kubernetes. >>> Persistent storage is contained in Azure Files NAS storage volumes, one per >>> server node. >>> >>> We ran an upgrade of Kubernetes today (from 1.24.9 to 1.26.3). During >>> the update various pods were stopped and restarted as is normal for an >>> update. This included nodes running the dev/alpha stack. >>> >>> At least one node (of a cluster of four server nodes in the cluster) >>> failed to restart after the update, with the following logging: >>> >>> 2023-07-18 01:23:55.171 [1] INF Restoring checkpoint after logical >>> recovery, will start physical recovery from back pointer: WALPointer >>> [idx=2431, fileOff=209031823, len=29] >>> 2023-07-18 01:23:55.205 [28] ERR Failed to apply page delta. >>> rec=[PagesListRemovePageRecord [rmvdPageId=0101000100000057, >>> pageId=0101000100000004, grpId=-1476359018, super=PageDeltaRecord >>> [grpId=-1476359018, pageId=0101000100000004, super=WALRecord [size=41, >>> chainSize=0, pos=WALPointer [idx=2431, fileOff=209169155, len=41], >>> type=PAGES_LIST_REMOVE_PAGE]]]] >>> 2023-07-18 01:23:55.217 [1] INF Cleanup cache stores [total=0, >>> left=0, cleanFiles=false] >>> 2023-07-18 01:23:55.218 [1] ERR Got exception while starting (will >>> rollback startup routine). >>> 2023-07-18 01:23:55.218 [1] ERR Exception during start processors, >>> node will be stopped and close connections >>> >>> I know Apache Ignite is very good at surviving 'Big Red Switch' >>> scenarios, and we have our data regions configured with the strictest >>> update protocol (full sync after each write), however it's possible the NAS >>> implementation does something different! >>> >>> I think if we delete the WAL files from the nodes that won't restart >>> then the node may be happy, though we will lose any updates since the last >>> checkpoint (but then, it has low use and checkpoints are every 30-45 >>> seconds or so, so this won't be significant). >>> >>> Is this an error anyone else has noticed? >>> Has anyone else had similar issues with Azure Files when using strict >>> update/sync semantics? >>> >>> Thanks, >>> Raymond. >>> >>> -- >>> <http://www.trimble.com/> >>> Raymond Wilson >>> Trimble Distinguished Engineer, Civil Construction Software (CCS) >>> 11 Birmingham Drive | Christchurch, New Zealand >>> raymond_wil...@trimble.com >>> >>> >>> <https://worksos.trimble.com/?utm_source=Trimble&utm_medium=emailsign&utm_campaign=Launch> >>> >> > > -- > <http://www.trimble.com/> > Raymond Wilson > Trimble Distinguished Engineer, Civil Construction Software (CCS) > 11 Birmingham Drive | Christchurch, New Zealand > raymond_wil...@trimble.com > > > <https://worksos.trimble.com/?utm_source=Trimble&utm_medium=emailsign&utm_campaign=Launch> > -- <http://www.trimble.com/> Raymond Wilson Trimble Distinguished Engineer, Civil Construction Software (CCS) 11 Birmingham Drive | Christchurch, New Zealand raymond_wil...@trimble.com <https://worksos.trimble.com/?utm_source=Trimble&utm_medium=emailsign&utm_campaign=Launch>