[ https://issues.apache.org/jira/browse/CASSANDRA-10580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15070975#comment-15070975 ]
Joshua McKenzie commented on CASSANDRA-10580: --------------------------------------------- [Reverted incorrect patch commit (oops)|https://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=commit;h=bd5c8bbc04e017089743b27cce55635dac00b98e] [Commit correct patch|https://git-wip-us.apache.org/repos/asf?p=cassandra.git;a=commit;h=c9ef25fd81501005b6484baf064081efc557f3f4] [~pauloricardomg]: care to sanity check me? > Add latency metrics for dropped messages > ---------------------------------------- > > Key: CASSANDRA-10580 > URL: https://issues.apache.org/jira/browse/CASSANDRA-10580 > Project: Cassandra > Issue Type: Improvement > Components: Coordination, Observability > Environment: Production > Reporter: Anubhav Kale > Assignee: Anubhav Kale > Priority: Minor > Fix For: 3.2 > > Attachments: 0001-Metrics.patch, 10580-Metrics.patch, 10580.patch, > 2.2-All-Comments.patch, CASSANDRA-10580-Head.patch, Trunk-All-Comments.patch, > Trunk.patch > > > In our production cluster, we are seeing a large number of dropped mutations. > At a minimum, we should print the time the thread took to get scheduled > thereby dropping the mutation (We should also print the Message / Mutation so > it helps in figuring out which column family was affected). This will help > find the right tuning parameter for write_timeout_in_ms. > The change is small and is in StorageProxy.java and MessagingTask.java. I > will submit a patch shortly. -- This message was sent by Atlassian JIRA (v6.3.4#6332)