[ https://issues.apache.org/jira/browse/KUDU-1525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Grant Henke updated KUDU-1525: ------------------------------ Target Version/s: 1.8.0 (was: 1.5.0) > Create metrics for errors > ------------------------- > > Key: KUDU-1525 > URL: https://issues.apache.org/jira/browse/KUDU-1525 > Project: Kudu > Issue Type: Improvement > Components: supportability > Reporter: Jean-Daniel Cryans > Priority: Major > > There's a class of issue that can be hard to debug, namely when things fail > semi-silently on the client-side. We currently have glog_warning_messages and > glog_error_messages, but it could be good to have more granular metrics. A > few I have in mind: > - rpc errors, basically any "recv error" > - server-level errors, like when it says TOO BUSY. > - any kind of insert rejection, right now we have row key duplicates and > memory pressure, but we're missing things like txn_tracker rejections, "not a > leader". > - raft errors like dropping a follower because we don't have the WALs around > and it's lagging too much. > There's probably more but the above would be a good start. -- This message was sent by Atlassian JIRA (v7.6.3#76005)