[
https://issues.apache.org/jira/browse/BEAM-10808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kenneth Knowles updated BEAM-10808:
-----------------------------------
Status: Open (was: Triage Needed)
> StreamingDataflowWorker streaming rpcs due not always observe stream
> failures, until timeout
> --------------------------------------------------------------------------------------------
>
> Key: BEAM-10808
> URL: https://issues.apache.org/jira/browse/BEAM-10808
> Project: Beam
> Issue Type: Bug
> Components: runner-dataflow
> Reporter: Sam Whittle
> Assignee: Sam Whittle
> Priority: P2
>
> The StreamingDataflow worker communicates to Streaming Engine with grpc
> streaming rpcs. Occasionaly with service failures, the error is not observed
> by the streaming rpc until the deadline is exceeded. As the deadline is
> multiple minutes, this adds unnecessary latency to pipelines that are
> otherwise keeping up with seconds delay.
> Testing has shown that grpc channel health checks are not sufficient but that
> application level health checks ensure that streaming rpc health issues are
> detected earlier than the stream timeout.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)