This is your daily summary of Beam's current high priority issues that may need
attention.
See https://beam.apache.org/contribute/issue-priorities for the meaning and
expectations around issue priorities.
Unassigned P0 Issues:
https://github.com/apache/beam/issues/23794 [Bug]: Storage Write API client
hanging forever on shutdown
https://github.com/apache/beam/issues/23747 [Bug]: After JDBCIO read
withRowOutput(), the VARCHAR/TEXT -> LOGICAL_TYPE and not compatible with
SqlTypeName
Unassigned P1 Issues:
https://github.com/apache/beam/issues/23745 [Bug]: Samza
AsyncDoFnRunnerTest.testSimplePipeline is flaky
https://github.com/apache/beam/issues/23709 [Flake]: Spark batch flakes in
ParDoLifecycleTest.testTeardownCalledAfterExceptionInProcessElement and
ParDoLifecycleTest.testTeardownCalledAfterExceptionInStartBundle
https://github.com/apache/beam/issues/23693 [Bug]: apache_beam.io.kinesis
module READ_DATA_URN mismatch
https://github.com/apache/beam/issues/22969 Discrepancy in behavior of
`DoFn.process()` when `yield` is combined with `return` statement, or vice versa
https://github.com/apache/beam/issues/22321
PortableRunnerTestWithExternalEnv.test_pardo_large_input is regularly failing
on jenkins
https://github.com/apache/beam/issues/21713 404s in BigQueryIO don't get output
to Failed Inserts PCollection
https://github.com/apache/beam/issues/21561
ExternalPythonTransformTest.trivialPythonTransform flaky
https://github.com/apache/beam/issues/21469 beam_PostCommit_XVR_Flink flaky:
Connection refused
https://github.com/apache/beam/issues/21462 Flake in
org.apache.beam.sdk.io.mqtt.MqttIOTest.testReadObject: Address already in use
https://github.com/apache/beam/issues/21261
org.apache.beam.runners.dataflow.worker.fn.logging.BeamFnLoggingServiceTest.testMultipleClientsFailingIsHandledGracefullyByServer
is flaky
https://github.com/apache/beam/issues/21260 Python DirectRunner does not emit
data at GC time
https://github.com/apache/beam/issues/21123 Multiple jobs running on Flink
session cluster reuse the persistent Python environment.
https://github.com/apache/beam/issues/21113
testTwoTimersSettingEachOtherWithCreateAsInputBounded flaky
https://github.com/apache/beam/issues/20976
apache_beam.runners.portability.flink_runner_test.FlinkRunnerTestOptimized.test_flink_metrics
is flaky
https://github.com/apache/beam/issues/20975
org.apache.beam.runners.flink.ReadSourcePortableTest.testExecution[streaming:
false] is flaky
https://github.com/apache/beam/issues/20974 Python GHA PreCommits flake with
grpc.FutureTimeoutError on SDK harness startup
https://github.com/apache/beam/issues/20689 Kafka commitOffsetsInFinalize OOM
on Flink
https://github.com/apache/beam/issues/20108 Python direct runner doesn't emit
empty pane when it should
https://github.com/apache/beam/issues/19814 Flink streaming flakes in
ParDoLifecycleTest.testTeardownCalledAfterExceptionInStartBundleStateful and
ParDoLifecycleTest.testTeardownCalledAfterExceptionInProcessElementStateful
https://github.com/apache/beam/issues/19734
WatchTest.testMultiplePollsWithManyResults flake: Outputs must be in timestamp
order (sickbayed)
https://github.com/apache/beam/issues/19241 Python Dataflow integration tests
should export the pipeline Job ID and console output to Jenkins Test Result
section
P1 Issues with no update in the last week:
https://github.com/apache/beam/issues/23489 [Bug]: add DebeziumIO to the
connectors page
https://github.com/apache/beam/issues/22891 [Bug]:
beam_PostCommit_XVR_PythonUsingJavaDataflow is flaky
https://github.com/apache/beam/issues/22605 [Bug]: Beam Python failure for
dataflow_exercise_metrics_pipeline_test.ExerciseMetricsPipelineTest.test_metrics_it
https://github.com/apache/beam/issues/22011 [Bug]:
org.apache.beam.sdk.io.aws2.kinesis.KinesisIOWriteTest.testWriteFailure flaky
https://github.com/apache/beam/issues/21893 [Bug]: BigQuery Storage Write API
implementation does not support table partitioning
https://github.com/apache/beam/issues/21711 Python Streaming job failing to
drain with BigQueryIO write errors
https://github.com/apache/beam/issues/21709
beam_PostCommit_Java_ValidatesRunner_Samza Failing
https://github.com/apache/beam/issues/21708 beam_PostCommit_Java_DataflowV2,
testBigQueryStorageWrite30MProto failing consistently
https://github.com/apache/beam/issues/21707 GroupByKeyTest BasicTests
testLargeKeys100MB flake (on ULR)
https://github.com/apache/beam/issues/21700
--dataflowServiceOptions=use_runner_v2 is broken
https://github.com/apache/beam/issues/21695 DataflowPipelineResult does not
raise exception for unsuccessful states.
https://github.com/apache/beam/issues/21476 WriteToBigQuery Dynamic table
destinations returns wrong tableId
https://github.com/apache/beam/issues/21474 Flaky tests: Gradle build daemon
disappeared unexpectedly
https://github.com/apache/beam/issues/20814 JmsIO is not acknowledging messages
correctly
https://github.com/apache/beam/issues/20812 Cross-language consistency
(RequiresStableInputs) is quietly broken (at least on portable flink runner)