Michael Yzvenn Wolanski created BEAM-8145:
---------------------------------------------
Summary: Pubsub message size limit not taking size increase from
base64 encoding into account
Key: BEAM-8145
URL: https://issues.apache.org/jira/browse/BEAM-8145
Project: Beam
Issue Type: Bug
Components: io-java-gcp
Reporter: Michael Yzvenn Wolanski
In the PubSubIO, the default max batch size is set to `10 * 1024 * 1024` bytes.
This however does not take into account the size increase of base64 encoding
the messages after the flush. Base64 encodes each set of three bytes into four
bytes.
Therefore the 'true' size limit placed on the unencoded batch should be
x = ((10 * 1024 * 1024) / 4) * 3 = 7864320 bytes
--
This message was sent by Atlassian Jira
(v8.3.2#803003)