Correction of the problem interpretation, it's not discovery that
fails now but the (system?) cache bootstrapping. How to use the
AddressResolver then?

Thanks

On Wed, Oct 5, 2016 at 2:27 AM, yazuna <yaz...@gmail.com> wrote:
> On the 1.8 problem stays the same (I switched off by mistake 48500
> before - I switched it on again now): node connects, is listed as part
> of the cluster, discovery sends list of nodes with internal addresses
> and they are not translated on the local node, so even the node is
> listed as part of the cluster, exceptions appear that the client
> cannot connect to any of the nodes from the send list...
>
> Now it fails on
>
>    ctx.io().send(node, new
> GridDhtAffinityAssignmentRequest(key.get1(), key.get2()),
> at GridDhtAssignmentFetchFuture:185
> trying to connect to port 48500 of the first node from the
> availableNodes list. Why is it connecting to discovery port, even it
> had fetched the cluster info already?
>
> <log>
>
> [02:25:57,195][INFO ][main][IgniteKernal] Non-loopback local IPs:
> 192.168.0.102, fe80:0:0:0:6e40:8ff:fe91:c0a4%en0,
> fe80:0:0:0:9485:8fff:fec1:9de8%awdl0
>
> [02:25:57,195][INFO ][main][IgniteKernal] Enabled local MACs:
> 6C400891C0A4, 96858FC19DE8
>
> [02:25:57,213][INFO ][main][IgnitePluginProcessor] Configured plugins:
>
> [02:25:57,213][INFO ][main][IgnitePluginProcessor]   ^-- None
>
> [02:25:57,213][INFO ][main][IgnitePluginProcessor]
>
> [02:25:59,804][INFO ][main][TcpCommunicationSpi] Successfully bound to
> TCP port [port=47099, locHost=localhost/127.0.0.1]
>
> [02:26:00,451][WARN ][main][NoopCheckpointSpi] Checkpoints are
> disabled (to enable configure any GridCheckpointSpi implementation)
>
> [02:26:00,486][WARN ][main][GridCollisionManager] Collision resolution
> is disabled (all jobs will be activated upon arrival).
>
> [02:26:00,489][WARN ][main][NoopSwapSpaceSpi] Swap space is disabled.
> To enable use FileSwapSpaceSpi.
>
> [02:26:00,491][INFO ][main][IgniteKernal] Security status
> [authentication=off, tls/ssl=off]
>
> [02:26:00,776][INFO ][main][GridTcpRestProtocol] Command protocol
> successfully started [name=TCP binary, host=0.0.0.0/0.0.0.0,
> port=11211]
>
> [02:26:06,180][INFO ][main][GridCacheProcessor] Started cache
> [name=ignite-sys-cache, mode=REPLICATED]
>
> [02:26:06,194][INFO ][main][GridCacheProcessor] Started cache
> [name=ignite-atomics-sys-cache, mode=PARTITIONED]
>
> [02:26:06,214][INFO ][main][GridCacheProcessor] Started cache
> [name=ignite-marshaller-sys-cache, mode=REPLICATED]
>
> [02:26:22,657][WARN ][exchange-worker-#46%null%][TcpCommunicationSpi]
> Connect timed out (consider increasing 'failureDetectionTimeout'
> configuration property) [addr=/192.168.168.8:47100,
> failureDetectionTimeout=15000]
>
> [02:26:22,660][WARN ][exchange-worker-#46%null%][TcpCommunicationSpi]
> Connect timed out (consider increasing 'failureDetectionTimeout'
> configuration property) [addr=/127.0.0.1:47100,
> failureDetectionTimeout=15000]
>
> [02:26:22,661][WARN ][exchange-worker-#46%null%][TcpCommunicationSpi]
> Failed to connect to a remote node (make sure that destination node is
> alive and operating system firewall is disabled on local and remote
> hosts) [addrs=[/192.168.168.8:47100, /127.0.0.1:47100,
> 0:0:0:0:0:0:0:1%lo:47100]]
>
> [02:26:36,223][WARN ][main][GridCachePartitionExchangeManager] Failed
> to wait for initial partition map exchange. Possible reasons are:
>
>   ^-- Transactions in deadlock.
>
>   ^-- Long running transactions (ignore if this is the case).
>
>   ^-- Unreleased explicit locks.
>
> [02:26:54,741][ERROR][exchange-worker-#46%null%][GridDhtAssignmentFetchFuture]
> Failed to request affinity assignment from remote node (will continue
> to another node): TcpDiscoveryNode
> [id=337d89f0-c14a-4b5d-8d33-b995331dcf63, addrs=[0:0:0:0:0:0:0:1%lo,
> 127.0.0.1, 192.168.168.8], sockAddrs=[0:0:0:0:0
>
> :0:0:1%lo:48500, /127.0.0.1:48500, /192.168.168.8:48500],
> discPort=48500, order=218, intOrder=110,
> lastExchangeTime=1475627164474, loc=false,
> ver=1.8.0#20161004-sha1:a370bad1, isClient=false]
>
> class org.apache.ignite.IgniteCheckedException: Failed to send message
> (node may have left the grid or TCP connection cannot be established
> due to firewall issues) [node=TcpDiscoveryNode
> [id=337d89f0-c14a-4b5d-8d33-b995331dcf63, addrs=[0:0:0:0:0:0:0:1%lo,
> 127.0.0.1, 192.168.168.8], sockAddrs=[0:0:0:
>
> 0:0:0:0:1%lo:48500, /127.0.0.1:48500, /192.168.168.8:48500],
> discPort=48500, order=218, intOrder=110,
> lastExchangeTime=1475627164474, loc=false,
> ver=1.8.0#20161004-sha1:a370bad1, isClient=false], topic=TOPIC_CACHE,
> msg=GridDhtAffinityAssignmentRequest [topVer=AffinityTopologyVersion
> [topVer=276, min
>
> orTopVer=0]], policy=4]
>
> at 
> org.apache.ignite.internal.managers.communication.GridIoManager.send(GridIoManager.java:1309)
>
> at 
> org.apache.ignite.internal.managers.communication.GridIoManager.send(GridIoManager.java:1373)
>
> at 
> org.apache.ignite.internal.processors.cache.GridCacheIoManager.send(GridCacheIoManager.java:841)
>
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtAssignmentFetchFuture.requestFromNextNode(GridDhtAssignmentFetchFuture.java:185)
>
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.GridDhtAssignmentFetchFuture.init(GridDhtAssignmentFetchFuture.java:107)
>
> at 
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.fetchAffinityOnJoin(CacheAffinitySharedManager.java:953)
>
> at 
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.onClientEvent(CacheAffinitySharedManager.java:639)
>
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.onClientNodeEvent(GridDhtPartitionsExchangeFuture.java:619)
>
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.init(GridDhtPartitionsExchangeFuture.java:464)
>
> at 
> org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$ExchangeWorker.body(GridCachePartitionExchangeManager.java:1447)
>
> at org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:110)
>
> at java.lang.Thread.run(Thread.java:745)
>
> Caused by: class org.apache.ignite.spi.IgniteSpiException: Failed to
> send message to remote node: TcpDiscoveryNode
> [id=337d89f0-c14a-4b5d-8d33-b995331dcf63, addrs=[0:0:0:0:0:0:0:1%lo,
> 127.0.0.1, 192.168.168.8], sockAddrs=[0:0:0:0:0:0:0:1%lo:48500,
> /127.0.0.1:48500, /192.168.168.8:48500], discPort=48
>
> 500, order=218, intOrder=110, lastExchangeTime=1475627164474,
> loc=false, ver=1.8.0#20161004-sha1:a370bad1, isClient=false]
>
> at 
> org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi.sendMessage0(TcpCommunicationSpi.java:2013)
>
> at 
> org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi.sendMessage(TcpCommunicationSpi.java:1951)
>
> at 
> org.apache.ignite.internal.managers.communication.GridIoManager.send(GridIoManager.java:1304)
>
> </log>
>
>
> What am I missing?
>
> In 1.8 I am unable to have any DEBUG messages with the config that
> worked with 1.7, both  log4j and ignite-log4j from 1.8 on the
> classpath.
>
> Cheers
> Krzysztof
>
> On Tue, Oct 4, 2016 at 11:27 PM, vkulichenko [via Apache Ignite Users]
> <ml-node+s70518n8090...@n6.nabble.com> wrote:
>> All nodes should run on the same version.
>>
>> What is exactly different in log4j behavior?
>>
>> -Val
>>
>> ________________________________
>> If you reply to this email, your message will be added to the discussion
>> below:
>> http://apache-ignite-users.70518.x6.nabble.com/Ignite-Cluster-Communication-with-SSH-Tunnels-tp273p8090.html
>> To unsubscribe from Ignite Cluster Communication with SSH Tunnels, click
>> here.
>> NAML




--
View this message in context: 
http://apache-ignite-users.70518.x6.nabble.com/Ignite-Cluster-Communication-with-SSH-Tunnels-tp273p8103.html
Sent from the Apache Ignite Users mailing list archive at Nabble.com.

Reply via email to