Hi, 

In our office, we have client nodes started on windows and Linux machines. 
we see that client nodes on linux always happen to connect to Ignite cluster
properly
However, once in a while *ALL* windows machines fail to connect 

Below is the log on of the windows machines. Any clue ?


[15:05:34,795][INFO][main][IgniteKernal] Config URL: n/a
[15:05:34,813][INFO][main][IgniteKernal] IgniteConfiguration
[igniteInstanceName=null, pubPoolSize=8, svcPoolSize=7, callbackPoolSize=8,
stripedPoolSize=8, sysPoolSize=8, mgmtPoolSize=4, igfsPoolSize=8,
dataStreamerPoolSize=8, utilityCachePoolSize=8,
utilityCacheKeepAliveTime=60000, p2pPoolSize=2, qryPoolSize=8,
igniteHome=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin,
igniteWorkDir=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin\work,
mbeanSrv=com.sun.jmx.mbeanserver.JmxMBeanServer@769e7ee8,
nodeId=7d9f28d2-69db-49b5-9c20-219523108114, marsh=BinaryMarshaller [],
marshLocJobs=false, daemon=false, p2pEnabled=true, netTimeout=10000,
sndRetryDelay=1000, sndRetryCnt=3, metricsHistSize=10000,
metricsUpdateFreq=2000, metricsExpTime=9223372036854775807,
discoSpi=TcpDiscoverySpi [addrRslvr=null, sockTimeout=50000, ackTimeout=0,
marsh=null, reconCnt=10, reconDelay=2000, maxAckTimeout=600000,
forceSrvMode=false, clientReconnectDisabled=false, internalLsnr=null],
segPlc=STOP, segResolveAttempts=2, waitForSegOnStart=true,
allResolversPassReq=true, segChkFreq=10000, commSpi=TcpCommunicationSpi
[connectGate=null,
connPlc=org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi$FirstConnectionPolicy@1fa121e2,
enableForcibleNodeKill=false, enableTroubleshootingLog=false,
locAddr=192.168.1.156, locHost=null, locPort=47100, locPortRange=100,
shmemPort=-1, directBuf=true, directSndBuf=false, idleConnTimeout=600000,
connTimeout=5000, maxConnTimeout=600000, reconCnt=10, sockSndBuf=32768,
sockRcvBuf=32768, msgQueueLimit=0, slowClientQueueLimit=0, nioSrvr=null,
shmemSrv=null, usePairedConnections=false, connectionsPerNode=1,
tcpNoDelay=true, filterReachableAddresses=false, ackSndThreshold=32,
unackedMsgsBufSize=0, sockWriteTimeout=40000, boundTcpPort=-1,
boundTcpShmemPort=-1, selectorsCnt=1, selectorSpins=0, addrRslvr=null,
ctxInitLatch=java.util.concurrent.CountDownLatch@7eac9008[Count = 1],
stopping=false],
evtSpi=org.apache.ignite.spi.eventstorage.NoopEventStorageSpi@4116aac9,
colSpi=NoopCollisionSpi [], deploySpi=LocalDeploymentSpi [],
indexingSpi=org.apache.ignite.spi.indexing.noop.NoopIndexingSpi@15de0b3c,
addrRslvr=null,
encryptionSpi=org.apache.ignite.spi.encryption.noop.NoopEncryptionSpi@489115ef,
clientMode=true, rebalanceThreadPoolSize=1, txCfg=TransactionConfiguration
[txSerEnabled=false, dfltIsolation=REPEATABLE_READ,
dfltConcurrency=PESSIMISTIC, dfltTxTimeout=0,
txTimeoutOnPartitionMapExchange=0, pessimisticTxLogSize=0,
pessimisticTxLogLinger=10000, tmLookupClsName=null, txManagerFactory=null,
useJtaSync=false], cacheSanityCheckEnabled=true, discoStartupDelay=60000,
deployMode=SHARED, p2pMissedCacheSize=100, locHost=null,
timeSrvPortBase=31100, timeSrvPortRange=100, failureDetectionTimeout=10000,
sysWorkerBlockedTimeout=3600000, clientFailureDetectionTimeout=30000,
metricsLogFreq=60000, hadoopCfg=null, connectorCfg=ConnectorConfiguration
[jettyPath=null, host=null, port=11211, noDelay=true, directBuf=false,
sndBufSize=32768, rcvBufSize=32768, idleQryCurTimeout=600000,
idleQryCurCheckFreq=60000, sndQueueLimit=0, selectorCnt=4, idleTimeout=7000,
sslEnabled=false, sslClientAuth=false, sslCtxFactory=null, sslFactory=null,
portRange=100, threadPoolSize=8, msgInterceptor=null], odbcCfg=null,
warmupClos=null, atomicCfg=AtomicConfiguration [seqReserveSize=1000,
cacheMode=PARTITIONED, backups=1, aff=null, grpName=null], classLdr=null,
sslCtxFactory=null, platformCfg=null, binaryCfg=null, memCfg=null,
pstCfg=null, dsCfg=null, activeOnStart=true, autoActivation=true,
longQryWarnTimeout=3000, sqlConnCfg=null,
cliConnCfg=ClientConnectorConfiguration [host=null, port=10800,
portRange=100, sockSndBufSize=0, sockRcvBufSize=0, tcpNoDelay=true,
maxOpenCursorsPerConn=128, threadPoolSize=8, idleTimeout=0,
jdbcEnabled=true, odbcEnabled=true, thinCliEnabled=true, sslEnabled=false,
useIgniteSslCtxFactory=true, sslClientAuth=false, sslCtxFactory=null],
mvccVacuumThreadCnt=2, mvccVacuumFreq=5000, authEnabled=false,
failureHnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0,
super=AbstractFailureHandler [ignoredFailureTypes=[]]],
commFailureRslvr=null]
[15:05:34,819][INFO][main][IgniteKernal] Daemon mode: off
[15:05:34,819][INFO][main][IgniteKernal] OS: Windows 10 10.0 amd64
[15:05:34,819][INFO][main][IgniteKernal] OS user: mahes
[15:05:34,826][INFO][main][IgniteKernal] PID: 10608
[15:05:34,826][INFO][main][IgniteKernal] Language runtime: Java Platform API
Specification ver. 1.8
[15:05:34,826][INFO][main][IgniteKernal] VM information: Java(TM) SE Runtime
Environment 1.8.0_191-b12 Oracle Corporation Java HotSpot(TM) 64-Bit Server
VM 25.191-b12
[15:05:34,827][INFO][main][IgniteKernal] VM total memory: 3.5GB
[15:05:34,827][INFO][main][IgniteKernal] Remote Management [restart: off,
REST: off, JMX (remote: off)]
[15:05:34,828][INFO][main][IgniteKernal] Logger: JavaLogger [quiet=false,
config=null]
[15:05:34,828][INFO][main][IgniteKernal]
IGNITE_HOME=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin
[15:05:34,828][INFO][main][IgniteKernal] VM arguments:
[-DIGNITE_QUIET=false, -Djava.net.preferIPv4Stack=true, -Xms1g, -Xms2g,
-DIGNITE_HOME=C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin,
-javaagent:C:\Users\mahes\AppData\Roaming\JetBrains\IntelliJ IDEA
2018.2.5\lib\idea_rt.jar=61613:C:\Users\mahes\AppData\Roaming\JetBrains\IntelliJ
IDEA 2018.2.5\bin, -Dfile.encoding=UTF-8]
[15:05:34,840][INFO][main][IgniteKernal] Configured caches [in 'sysMemPlc'
dataRegion: ['ignite-sys-cache']]
[15:05:34,840][WARNING][main][IgniteKernal] Peer class loading is enabled
(disable it in production for performance and deployment consistency
reasons)
[15:05:34,843][WARNING][pub-#19][GridDiagnostic] This operating system has
been tested less rigorously: Windows 10 10.0 amd64. Our team will appreciate
the feedback if you experience any problems running ignite in this
environment.
[15:05:34,845][INFO][main][IgniteKernal] 3-rd party licenses can be found
at:
C:\Users\mahes\Desktop\apache\apache-ignite-2.7.0-bin\apache-ignite-2.7.0-bin\libs\licenses
[15:05:34,943][INFO][main][IgnitePluginProcessor] Configured plugins:
[15:05:34,943][INFO][main][IgnitePluginProcessor]   ^-- None
[15:05:34,943][INFO][main][IgnitePluginProcessor] 
[15:05:34,944][INFO][main][FailureProcessor] Configured failure handler:
[hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0,
super=AbstractFailureHandler [ignoredFailureTypes=[]]]]
[15:05:35,003][INFO][main][TcpCommunicationSpi] Successfully bound
communication NIO server to TCP port [port=47100, locHost=/192.168.1.156,
selectorsCnt=1, selectorSpins=0, pairedConn=false]
[15:05:35,005][WARNING][main][TcpCommunicationSpi] Message queue limit is
set to 0 which may lead to potential OOMEs when running cache operations in
FULL_ASYNC or PRIMARY_SYNC modes due to message queues growth on sender and
receiver sides.
[15:05:35,037][WARNING][main][NoopCheckpointSpi] Checkpoints are disabled
(to enable configure any GridCheckpointSpi implementation)
[15:05:35,087][WARNING][main][GridCollisionManager] Collision resolution is
disabled (all jobs will be activated upon arrival).
[15:05:35,100][INFO][main][IgniteKernal] Security status
[authentication=off, tls/ssl=off]
[15:05:35,444][INFO][main][PartitionsEvictManager] Evict partition permits=2
[15:05:35,680][INFO][main][ClientListenerProcessor] Client connector
processor has started on TCP port 10800
[15:05:35,700][INFO][main][GridRestProcessor] REST protocols do not start on
client node. To start the protocols on client node set
'-DIGNITE_REST_START_ON_CLIENT=true' system property.
[15:05:35,857][INFO][main][IgniteKernal] Non-loopback local IPs:
192.168.1.156
[15:05:35,857][INFO][main][IgniteKernal] Enabled local MACs: 54BF64129A29,
7440BB948987, 7440BB948988, 7640BB948987, 8640BB948987
[15:05:35,865][WARNING][main][TcpDiscoverySpi] Failure detection timeout
will be ignored (one of SPI parameters has been set explicitly)
[15:05:51,478][INFO][grid-nio-worker-tcp-comm-0-#24][TcpCommunicationSpi]
Accepted incoming communication connection [locAddr=/192.168.1.156:47100,
rmtAddr=/192.168.1.6:56514]
[15:06:45,126][SEVERE][tcp-client-disco-msg-worker-#4][] Critical system
error detected. Will be handled accordingly to configured handler
[hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0,
super=AbstractFailureHandler [ignoredFailureTypes=[]]],
failureCtx=FailureContext [type=SYSTEM_WORKER_TERMINATION, err=class
o.a.i.IgniteException: GridWorker [name=tcp-client-disco-msg-worker,
igniteInstanceName=null, finished=true, heartbeatTs=1560505005116]]]
class org.apache.ignite.IgniteException: GridWorker
[name=tcp-client-disco-msg-worker, igniteInstanceName=null, finished=true,
heartbeatTs=1560505005116]
        at
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1831)



--
Sent from: http://apache-ignite-users.70518.x6.nabble.com/

Reply via email to