[jira] [Work logged] (BEAM-7528) Save correctly Python Load Tests metrics according to it's namespace

ASF GitHub Bot (JIRA) Wed, 31 Jul 2019 05:49:38 -0700


     [ 
https://issues.apache.org/jira/browse/BEAM-7528?focusedWorklogId=285729&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-285729
 ]


ASF GitHub Bot logged work on BEAM-7528:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 31/Jul/19 12:48
            Start Date: 31/Jul/19 12:48
    Worklog Time Spent: 10m 
      Work Description: tvalentyn commented on pull request #8941: [BEAM-7528] 
Save load test metrics according to distribution name
URL: https://github.com/apache/beam/pull/8941#discussion_r309199134
 
 

 ##########
 File path: 
sdks/python/apache_beam/testing/load_tests/load_test_metrics_utils.py
 ##########
 @@ -138,8 +143,25 @@ def as_dict(self):
 class CounterMetric(Metric):
   def __init__(self, counter_dict, submit_timestamp, metric_id):
     super(CounterMetric, self).__init__(submit_timestamp, metric_id)
-    self.value = counter_dict.committed
     self.label = str(counter_dict.key.metric.name)
+    self.value = counter_dict.committed
+
+
+class DistributionMetrics(Metric):
 
 Review comment:
   I see, looks like saving median would require changes to beam. Regarding 
unknown distributions, I see following options:
   1) Save sum, count, min, max, collected across all distributions associated 
with a metric with the same name.  In other words, total sum, total count, min 
of minimums and max of maximums. 
   2) Save `(sum, count, min, max)` for each distribution without aggregation 
(not sure if useful, but since we are exporting to bigquery, we can run a query 
later to aggregate something we need).
   3) register aggregators with an instance of MetricsReader. Registration will 
take the name of the metric, and an aggregator instance that recieves a list of 
all distribution readings, and computes desired output to store into Bigquery. 
   
   I think 2 is most flexible, but perhaps 1) is sufficient and will cover most 
of the use-cases. Feel free to ask @robertwb or @pabloem for an opinion. 3 
seems like overengineering at this point. 
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 285729)

> Save correctly Python Load Tests metrics according to it's namespace
> --------------------------------------------------------------------
>
>                 Key: BEAM-7528
>                 URL: https://issues.apache.org/jira/browse/BEAM-7528
>             Project: Beam
>          Issue Type: Bug
>          Components: testing
>            Reporter: Kasia Kucharczyk
>            Assignee: Kasia Kucharczyk
>            Priority: Major
>          Time Spent: 6h 10m
>  Remaining Estimate: 0h
>
> Load test framework considers all distribution metrics defined in a pipeline 
> as a `runtime` metric (which is defined by the loadtest framework), while 
> only  `runtime` distribution metric should be considered as runtime.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

[jira] [Work logged] (BEAM-7528) Save correctly Python Load Tests metrics according to it's namespace

Reply via email to