Yakov Zhdanov created IGNITE-5155: ------------------------------------- Summary: Need to improve stats dump on exchange timeout Key: IGNITE-5155 URL: https://issues.apache.org/jira/browse/IGNITE-5155 Project: Ignite Issue Type: Improvement Reporter: Yakov Zhdanov Assignee: Stanilovsky Evgeny Fix For: 2.1
Currently, on large topologies info dumped on "Failed to wait for partition map exchange" (org/apache/ignite/internal/processors/cache/GridCachePartitionExchangeManager.java:1713) floods the log and we need to reduce information dumped. 1. Reduce output for exchange futures that are already done. Keep event, topology version, servers count, clients count (more?) 2. Do not dump the whole communication stats, but send message to exchange coordinator, ask for its status and for number of messages received and for acked messages from local node. 3. we can think of sending new message from cache node to coordinator that may be a sign of a problem on that node (e.g. unreleased tx locks or still renting partitions) and coordinator may include this info to a status thus every Ignite node may point to a problem node in the logs. -- This message was sent by Atlassian JIRA (v6.3.15#6346)