Hi, In our office, we have client nodes started on windows and Linux machines. we see that client nodes on linux always happen to connect to Ignite cluster properly However, once in a while *ALL* windows machines fail to connect
Below is the log on of the windows machines. Any clue ? [15:05:34,795][INFO][main][IgniteKernal] Config URL: n/a [15:05:34,813][INFO][main][IgniteKernal] IgniteConfiguration [igniteInstanceName=null, pubPoolSize=8, svcPoolSize=7, callbackPoolSize=8, stripedPoolSize=8, sysPoolSize=8, mgmtPoolSize=4, igfsPoolSize=8, dataStreamerPoolSize=8, utilityCachePoolSize=8, utilityCacheKeepAliveTime=60000, p2pPoolSize=2, qryPoolSize=8, igniteHome=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin, igniteWorkDir=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin\work, mbeanSrv=com.sun.jmx.mbeanserver.JmxMBeanServer@769e7ee8, nodeId=7d9f28d2-69db-49b5-9c20-219523108114, marsh=BinaryMarshaller [], marshLocJobs=false, daemon=false, p2pEnabled=true, netTimeout=10000, sndRetryDelay=1000, sndRetryCnt=3, metricsHistSize=10000, metricsUpdateFreq=2000, metricsExpTime=9223372036854775807, discoSpi=TcpDiscoverySpi [addrRslvr=null, sockTimeout=50000, ackTimeout=0, marsh=null, reconCnt=10, reconDelay=2000, maxAckTimeout=600000, forceSrvMode=false, clientReconnectDisabled=false, internalLsnr=null], segPlc=STOP, segResolveAttempts=2, waitForSegOnStart=true, allResolversPassReq=true, segChkFreq=10000, commSpi=TcpCommunicationSpi [connectGate=null, connPlc=org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi$FirstConnectionPolicy@1fa121e2, enableForcibleNodeKill=false, enableTroubleshootingLog=false, locAddr=192.168.1.156, locHost=null, locPort=47100, locPortRange=100, shmemPort=-1, directBuf=true, directSndBuf=false, idleConnTimeout=600000, connTimeout=5000, maxConnTimeout=600000, reconCnt=10, sockSndBuf=32768, sockRcvBuf=32768, msgQueueLimit=0, slowClientQueueLimit=0, nioSrvr=null, shmemSrv=null, usePairedConnections=false, connectionsPerNode=1, tcpNoDelay=true, filterReachableAddresses=false, ackSndThreshold=32, unackedMsgsBufSize=0, sockWriteTimeout=40000, boundTcpPort=-1, boundTcpShmemPort=-1, selectorsCnt=1, selectorSpins=0, addrRslvr=null, ctxInitLatch=java.util.concurrent.CountDownLatch@7eac9008[Count = 1], stopping=false], evtSpi=org.apache.ignite.spi.eventstorage.NoopEventStorageSpi@4116aac9, colSpi=NoopCollisionSpi [], deploySpi=LocalDeploymentSpi [], indexingSpi=org.apache.ignite.spi.indexing.noop.NoopIndexingSpi@15de0b3c, addrRslvr=null, encryptionSpi=org.apache.ignite.spi.encryption.noop.NoopEncryptionSpi@489115ef, clientMode=true, rebalanceThreadPoolSize=1, txCfg=TransactionConfiguration [txSerEnabled=false, dfltIsolation=REPEATABLE_READ, dfltConcurrency=PESSIMISTIC, dfltTxTimeout=0, txTimeoutOnPartitionMapExchange=0, pessimisticTxLogSize=0, pessimisticTxLogLinger=10000, tmLookupClsName=null, txManagerFactory=null, useJtaSync=false], cacheSanityCheckEnabled=true, discoStartupDelay=60000, deployMode=SHARED, p2pMissedCacheSize=100, locHost=null, timeSrvPortBase=31100, timeSrvPortRange=100, failureDetectionTimeout=10000, sysWorkerBlockedTimeout=3600000, clientFailureDetectionTimeout=30000, metricsLogFreq=60000, hadoopCfg=null, connectorCfg=ConnectorConfiguration [jettyPath=null, host=null, port=11211, noDelay=true, directBuf=false, sndBufSize=32768, rcvBufSize=32768, idleQryCurTimeout=600000, idleQryCurCheckFreq=60000, sndQueueLimit=0, selectorCnt=4, idleTimeout=7000, sslEnabled=false, sslClientAuth=false, sslCtxFactory=null, sslFactory=null, portRange=100, threadPoolSize=8, msgInterceptor=null], odbcCfg=null, warmupClos=null, atomicCfg=AtomicConfiguration [seqReserveSize=1000, cacheMode=PARTITIONED, backups=1, aff=null, grpName=null], classLdr=null, sslCtxFactory=null, platformCfg=null, binaryCfg=null, memCfg=null, pstCfg=null, dsCfg=null, activeOnStart=true, autoActivation=true, longQryWarnTimeout=3000, sqlConnCfg=null, cliConnCfg=ClientConnectorConfiguration [host=null, port=10800, portRange=100, sockSndBufSize=0, sockRcvBufSize=0, tcpNoDelay=true, maxOpenCursorsPerConn=128, threadPoolSize=8, idleTimeout=0, jdbcEnabled=true, odbcEnabled=true, thinCliEnabled=true, sslEnabled=false, useIgniteSslCtxFactory=true, sslClientAuth=false, sslCtxFactory=null], mvccVacuumThreadCnt=2, mvccVacuumFreq=5000, authEnabled=false, failureHnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0, super=AbstractFailureHandler [ignoredFailureTypes=[]]], commFailureRslvr=null] [15:05:34,819][INFO][main][IgniteKernal] Daemon mode: off [15:05:34,819][INFO][main][IgniteKernal] OS: Windows 10 10.0 amd64 [15:05:34,819][INFO][main][IgniteKernal] OS user: mahes [15:05:34,826][INFO][main][IgniteKernal] PID: 10608 [15:05:34,826][INFO][main][IgniteKernal] Language runtime: Java Platform API Specification ver. 1.8 [15:05:34,826][INFO][main][IgniteKernal] VM information: Java(TM) SE Runtime Environment 1.8.0_191-b12 Oracle Corporation Java HotSpot(TM) 64-Bit Server VM 25.191-b12 [15:05:34,827][INFO][main][IgniteKernal] VM total memory: 3.5GB [15:05:34,827][INFO][main][IgniteKernal] Remote Management [restart: off, REST: off, JMX (remote: off)] [15:05:34,828][INFO][main][IgniteKernal] Logger: JavaLogger [quiet=false, config=null] [15:05:34,828][INFO][main][IgniteKernal] IGNITE_HOME=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin [15:05:34,828][INFO][main][IgniteKernal] VM arguments: [-DIGNITE_QUIET=false, -Djava.net.preferIPv4Stack=true, -Xms1g, -Xms2g, -DIGNITE_HOME=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin, -javaagent:C:\Users\mahes\AppData\Roaming\JetBrains\IntelliJ IDEA 2018.2.5\lib\idea_rt.jar=61613:C:\Users\mahes\AppData\Roaming\JetBrains\IntelliJ IDEA 2018.2.5\bin, -Dfile.encoding=UTF-8] [15:05:34,840][INFO][main][IgniteKernal] Configured caches [in 'sysMemPlc' dataRegion: ['ignite-sys-cache']] [15:05:34,840][WARNING][main][IgniteKernal] Peer class loading is enabled (disable it in production for performance and deployment consistency reasons) [15:05:34,843][WARNING][pub-#19][GridDiagnostic] This operating system has been tested less rigorously: Windows 10 10.0 amd64. Our team will appreciate the feedback if you experience any problems running ignite in this environment. [15:05:34,845][INFO][main][IgniteKernal] 3-rd party licenses can be found at: C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin\libs\licenses [15:05:34,943][INFO][main][IgnitePluginProcessor] Configured plugins: [15:05:34,943][INFO][main][IgnitePluginProcessor] ^-- None [15:05:34,943][INFO][main][IgnitePluginProcessor] [15:05:34,944][INFO][main][FailureProcessor] Configured failure handler: [hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0, super=AbstractFailureHandler [ignoredFailureTypes=[]]]] [15:05:35,003][INFO][main][TcpCommunicationSpi] Successfully bound communication NIO server to TCP port [port=47100, locHost=/192.168.1.156, selectorsCnt=1, selectorSpins=0, pairedConn=false] [15:05:35,005][WARNING][main][TcpCommunicationSpi] Message queue limit is set to 0 which may lead to potential OOMEs when running cache operations in FULL_ASYNC or PRIMARY_SYNC modes due to message queues growth on sender and receiver sides. [15:05:35,037][WARNING][main][NoopCheckpointSpi] Checkpoints are disabled (to enable configure any GridCheckpointSpi implementation) [15:05:35,087][WARNING][main][GridCollisionManager] Collision resolution is disabled (all jobs will be activated upon arrival). [15:05:35,100][INFO][main][IgniteKernal] Security status [authentication=off, tls/ssl=off] [15:05:35,444][INFO][main][PartitionsEvictManager] Evict partition permits=2 [15:05:35,680][INFO][main][ClientListenerProcessor] Client connector processor has started on TCP port 10800 [15:05:35,700][INFO][main][GridRestProcessor] REST protocols do not start on client node. To start the protocols on client node set '-DIGNITE_REST_START_ON_CLIENT=true' system property. [15:05:35,857][INFO][main][IgniteKernal] Non-loopback local IPs: 192.168.1.156 [15:05:35,857][INFO][main][IgniteKernal] Enabled local MACs: 54BF64129A29, 7440BB948987, 7440BB948988, 7640BB948987, 8640BB948987 [15:05:35,865][WARNING][main][TcpDiscoverySpi] Failure detection timeout will be ignored (one of SPI parameters has been set explicitly) [15:05:51,478][INFO][grid-nio-worker-tcp-comm-0-#24][TcpCommunicationSpi] Accepted incoming communication connection [locAddr=/192.168.1.156:47100, rmtAddr=/192.168.1.6:56514] [15:06:45,126][SEVERE][tcp-client-disco-msg-worker-#4][] Critical system error detected. Will be handled accordingly to configured handler [hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0, super=AbstractFailureHandler [ignoredFailureTypes=[]]], failureCtx=FailureContext [type=SYSTEM_WORKER_TERMINATION, err=class o.a.i.IgniteException: GridWorker [name=tcp-client-disco-msg-worker, igniteInstanceName=null, finished=true, heartbeatTs=1560505005116]]] class org.apache.ignite.IgniteException: GridWorker [name=tcp-client-disco-msg-worker, igniteInstanceName=null, finished=true, heartbeatTs=1560505005116] at org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1831) -- Sent from: http://apache-ignite-users.70518.x6.nabble.com/