[DISCUSSION] Runner agnostic metrics extractor?

Etienne Chauchot Mon, 27 Nov 2017 07:56:56 -0800

Hi all,

I came by this ticket https://issues.apache.org/jira/browse/BEAM-2456. Iknow that the metrics subject has already been discussed a lot, but Iwould like to revive the discussion.

The aim in this ticket is to avoid relying on the runner to provide themetrics because they don't have all the same capabilities towardsmetrics. The idea in the ticket is to still use beam metrics API (andnot others like codahale as it has been discussed some time ago) andprovide a way to extract the metrics with a polling thread that would beforked by a PipelineWithMetrics (so, almost invisible to the end user)and then to push to a sink (such as a Http rest sink for example orGraphite sink or anything else...). Nevertheless, a polling thread mightnot work for all the runners because some might not make the metricsavailable before the end of the pipeline. Also, forking a thread wouldbe a bit unconventional, so it could be provided as a beam sdk extension.

Another way, to avoid polling, would be to push metrics values to a sinkwhen they are updated but I don't know if it is feasible in a runnerindependent way.


WDYT about the ideas in this ticket?

Best,
Etienne

[DISCUSSION] Runner agnostic metrics extractor?

Reply via email to