Udi Meiri created BEAM-9487:
-------------------------------

             Summary: GBKs on unbounded pcolls with global windows and no 
triggers should fail
                 Key: BEAM-9487
                 URL: https://issues.apache.org/jira/browse/BEAM-9487
             Project: Beam
          Issue Type: Bug
          Components: sdk-py-core
            Reporter: Udi Meiri


This, according to "4.2.2.1 GroupByKey and unbounded PCollections" in 
https://beam.apache.org/documentation/programming-guide/.

bq. If you do apply GroupByKey or CoGroupByKey to a group of unbounded 
PCollections without setting either a non-global windowing strategy, a trigger 
strategy, or both for each collection, Beam generates an IllegalStateException 
error at pipeline construction time.

Example where this doesn't happen in Python SDK: 
https://stackoverflow.com/questions/60623246/merge-pcollection-with-apache-beam

I also believe that this unit test should fail, since test_stream is unbounded, 
uses global window, and has no triggers.
{code}
  def test_global_window_gbk_fail(self):
    with TestPipeline() as p:
      test_stream = TestStream()
      _ = p | test_stream | GroupByKey()
{code}




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to