[ 
https://issues.apache.org/jira/browse/IGNITE-21630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17822422#comment-17822422
 ] 

Ignite TC Bot commented on IGNITE-21630:
----------------------------------------

{panel:title=Branch: [pull/11260/head] Base: [master] : No blockers 
found!|borderStyle=dashed|borderColor=#ccc|titleBGColor=#D6F7C1}{panel}
{panel:title=Branch: [pull/11260/head] Base: [master] : No new tests 
found!|borderStyle=dashed|borderColor=#ccc|titleBGColor=#F7D6C1}{panel}
[TeamCity *--> Run :: All* 
Results|https://ci2.ignite.apache.org/viewLog.html?buildId=7765945&buildTypeId=IgniteTests24Java8_RunAll]

> Cluster falls apart on topology change when DNS service is unavailable 
> -----------------------------------------------------------------------
>
>                 Key: IGNITE-21630
>                 URL: https://issues.apache.org/jira/browse/IGNITE-21630
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Aleksey Plekhanov
>            Assignee: Aleksey Plekhanov
>            Priority: Major
>              Labels: ise
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Requests to DNS service performed synchroniously by some critical discovery 
> threads. Timeout for such requests can't be controlled by java code (see 
> [https://bugs.openjdk.org/browse/JDK-6450279]). This leads to segmentation of 
> nodes and falling apart cluster.
> For example, stack of {{tcp-disco-msg-worker}} thread with request to DNS 
> service:
> {noformat}
>     at java.net.InetAddress$2.lookupAllHostAddr(InetAddress.java:929)
>     at java.net.InetAddress.getAddressesFromNameService(InetAddress.java:1330)
>     at java.net.InetAddress.getAllByName0(InetAddress.java:1283)
>     at java.net.InetAddress.getAllByName(InetAddress.java:1199)
>     at java.net.InetAddress.getAllByName(InetAddress.java:1127)
>     at java.net.InetAddress.getByName(InetAddress.java:1077)
>     at java.net.InetSocketAddress.<init>(InetSocketAddress.java:220)
>     at 
> org.apache.ignite.internal.util.IgniteUtils.createResolved(IgniteUtils.java:9829)
>     at 
> org.apache.ignite.internal.util.IgniteUtils.toSocketAddresses(IgniteUtils.java:9792)
>     at 
> org.apache.ignite.internal.util.IgniteUtils.toSocketAddresses(IgniteUtils.java:9770)
>     at 
> org.apache.ignite.spi.discovery.tcp.internal.TcpDiscoveryNode.socketAddresses(TcpDiscoveryNode.java:392)
>     at 
> org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi.getNodeAddresses(TcpDiscoverySpi.java:1267)
>     at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl.interruptPing(ServerImpl.java:985)
>     at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl.access$6800(ServerImpl.java:206)
>     at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processNodeLeftMessage(ServerImpl.java:5433)
>     at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:3221)
>     at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:2894)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to