Robert Bradshaw created BEAM-4030:
-------------------------------------

             Summary: Add CombineFn.compact, similar to Java
                 Key: BEAM-4030
                 URL: https://issues.apache.org/jira/browse/BEAM-4030
             Project: Beam
          Issue Type: Bug
          Components: sdk-py-core
            Reporter: Robert Bradshaw
            Assignee: Ahmet Altay


Some CombineFns buffer elements in their add_inputs because a combining 
operation cost can be effectively amortized across many elements. However, this 
introduces the extra (possibly higher) cost of potentially serializing more 
expensive buffers through shuffle. We should add a CombineFn.compact(self, 
accumulator) method (defaulting to the identity) similar to what the Java SDK 
provides which is called when flushing an element from the PGBKCV table. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to