Robert Joseph Evans created STORM-2610:
------------------------------------------

             Summary: Spout throttelign metrics are unusable
                 Key: STORM-2610
                 URL: https://issues.apache.org/jira/browse/STORM-2610
             Project: Apache Storm
          Issue Type: New Feature
          Components: storm-client, storm-core
    Affects Versions: 1.0.0, 2.0.0
            Reporter: Robert Joseph Evans
            Assignee: Robert Joseph Evans


When helping someone debug an issue with backpressure I realized that the 
metrics we are collecting in the spout are mistakenly being multiplied by the 
rate, even though we are not sub-sampling them.  This results in the values 
being, by default, 20 times higher then they should be.  Thinking about how I 
would use the metrics to debug an issue also showed that some of them.  
skipped-max-spout and skipped-throttle correspond to about 1 ms of sleep, but 
skipped-inactive corresponds to about 100 ms of sleep.  And the 1 ms sleep is 
configurable so it could be different from one topology to another, and even 
the code around it is pluggable, so it could be doing anything from not 
sleeping to sleeping a random amount of time.

I think we just need to scrap what we have been doing and record how long we 
sleep for and use that as the metric instead.

These metrics also don't appear to be documented anywhere so I am going to 
change what they mean and document them to actually be useful, and correct.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to