chadrik commented on a change in pull request #11038: [BEAM-7746] More typing 
fixes
URL: https://github.com/apache/beam/pull/11038#discussion_r396690657
 
 

 ##########
 File path: sdks/python/apache_beam/transforms/core.py
 ##########
 @@ -1300,12 +1300,13 @@ def to_runner_api_parameter(self, context):
           common_urns.requirements.REQUIRES_STATEFUL_PROCESSING.urn)
     from apache_beam.runners.common import DoFnSignature
     sig = DoFnSignature(self.fn)
-    is_splittable = sig.is_splittable_dofn()
 
 Review comment:
   > Not sure if checking get_restriction_coder() return type instead of 
is_splittable_dofn() is future proof.
   
   `get_restriction_coder()` calls `is_splittable_dofn()` and returns `None` if 
it's not splittable.  So I interpreted a `None` result from this method to mean 
"is not splittable". 
   
   ```python
     def get_restriction_coder(self):
       # type: () -> Optional[TupleCoder]
   
       """Get coder for a restriction when processing an SDF. """
       if self.is_splittable_dofn():
         return TupleCoder([
             (self.get_restriction_provider().restriction_coder()),
             (self.get_watermark_estimator_provider().estimator_state_coder())
         ])
       else:
         return None
   ```
   
   > I don't understand the change, from a mypy correctness perspective.
   
   Here's the problem:
   
   ```python
       if is_splittable:
         restriction_coder = sig.get_restriction_coder()  #  returns 
Optional[TupleCoder]
         restriction_coder_id = context.coders.get_id(restriction_coder)  # 
does not accept Optional!
       else:        else:
         restriction_coder_id = None
   ```
   
   With my changes, we naturally drop the optionality before passing the value 
to `context.coders.get_id()`.  We also avoid a redundant call to 
`is_splittable_dofn()`, FWIW.
   
   I see two options:
   
   1) keep my changes and update the documentation of `get_restriction_coder()` 
to clarify that `None` result indicates "is not splittable"
   2) revert my changes and add `assert restriction_coder is None` before the 
call to `context.coders.get_id()`
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to