Tracked as https://issues.apache.org/jira/browse/BEAM-7058


On Wed, Apr 10, 2019 at 11:38 AM Pablo Estrada <pabl...@google.com> wrote:

> This sounds like a bug then? +Alex Amato <ajam...@google.com>
>
> On Wed, Apr 10, 2019 at 3:59 AM Maximilian Michels <m...@apache.org> wrote:
>
>> Hi @all,
>>
>>  From a quick debugging session, I conclude that the wiring is in place
>> for the Flink Runner. There is a ProgressReporter that reports
>> MonitoringInfos to Flink, in a similar fashion as the "legacy" Runner.
>>
>> The bundle duration metrics are 0, but the element count gets reported
>> correctly. It appears to be an issue of the Python/Java harness because
>> "ProcessBundleProgressResponse" contains only 0 values for the bundle
>> duration.
>>
>> Thanks,
>> Max
>>
>> On 04.04.19 19:54, Mikhail Gryzykhin wrote:
>> > Hi everyone,
>> >
>> > Quick summary on python and Dataflow Runner:
>> > Python SDK already reports:
>> > - MSec
>> > - User metrics (int64 and distribution)
>> > - PCollection Element Count
>> > - Work on MeanByteCount for pcollection is ongoing here
>> > <https://github.com/apache/beam/pull/8062>.
>> >
>> > Dataflow Runner:
>> > - all metrics listed above are passed through to Dataflow.
>> >
>> > Ryan can give more information on Flink Runner. I also see Maximilian
>> on
>> > some of relevant PRs, so he might comment on this as well.
>> >
>> > Regards,
>> > Mikhail.
>> >
>> >
>> > On Thu, Apr 4, 2019 at 10:43 AM Pablo Estrada <pabl...@google.com
>> > <mailto:pabl...@google.com>> wrote:
>> >
>> >     Hello guys!
>> >     Alex, Mikhail and Ryan are working on support for metrics in the
>> >     portability framework. The support on the SDK is pretty advanced
>> >     AFAIK*, and the next step is to get the metrics back into the
>> >     runner. Lukazs and myself are working on a project that depends on
>> >     this too, so I'm adding everyone so we can get an idea of what's
>> >     missing.
>> >
>> >     I believe:
>> >     - User metrics are fully wired up in the SDK
>> >     - State sampler (timing) metrics are wired up as well (is that
>> >     right, +Alex Amato <mailto:ajam...@google.com>?)
>> >     - Work is ongoing to send the updates back to Flink.
>> >     - What is the plan for making metrics queriable from Flink? +Ryan
>> >     Williams <mailto:r...@runsascoded.com>
>> >
>> >     Thanks!
>> >     -P.
>> >
>> >
>> >
>> >     On Wed, Apr 3, 2019 at 12:02 PM Thomas Weise <t...@apache.org
>> >     <mailto:t...@apache.org>> wrote:
>> >
>> >         I believe this is where the metrics are supplied:
>> >
>> https://github.com/apache/beam/blob/master/sdks/python/apache_beam/runners/worker/operations.py
>> >
>> >         git grep process_bundle_msecs   yields results for dataflow
>> >         worker only
>> >
>> >         There isn't any test coverage for the Flink runner:
>> >
>> >
>> https://github.com/apache/beam/blob/d38645ae8758d834c3e819b715a66dd82c78f6d4/sdks/python/apache_beam/runners/portability/flink_runner_test.py#L181
>> >
>> >
>> >
>> >         On Wed, Apr 3, 2019 at 10:45 AM Akshay Balwally
>> >         <abalwa...@lyft.com <mailto:abalwa...@lyft.com>> wrote:
>> >
>> >             Should have added- I'm using Python sdk, Flink runner
>> >
>> >             On Wed, Apr 3, 2019 at 10:32 AM Akshay Balwally
>> >             <abalwa...@lyft.com <mailto:abalwa...@lyft.com>> wrote:
>> >
>> >                 Hi,
>> >                 I'm hoping to get metrics on the amount of time spent on
>> >                 each operator, so it seams like the stat
>> >
>> >
>>  
>> {organization_specific_prefix}.operator.beam-metric-pardo_execution_time-process_bundle_msecs-v1.gauge.mean
>> >
>> >                 would be pretty helpful. But in practice, this stat
>> >                 always shows 0, which I interpret as 0 milliseconds
>> >                 spent per bundle, which can't be correct (other stats
>> >                 show that the operators are running, and timers within
>> >                 the operators show more reasonable times). Is this a
>> >                 known bug?
>> >
>> >
>> >                 --
>> >                 *Akshay Balwally*
>> >                 Software Engineer
>> >                 937.271.6469 <tel:+19372716469>
>> >                 Lyft <http://www.lyft.com/>
>> >
>> >
>> >
>> >             --
>> >             *Akshay Balwally*
>> >             Software Engineer
>> >             937.271.6469 <tel:+19372716469>
>> >             Lyft <http://www.lyft.com/>
>> >
>>
>

Reply via email to