Key encodings for state requests

Maximilian Michels Tue, 05 Nov 2019 13:26:38 -0800

Hi,

I wanted to get your opinion on something that I have been strugglingwith. It is about the coders for state requests in portable pipelines.

In contrast to "classic" Beam, the Runner is not guaranteed to knowwhich coder is used by the SDK. If the SDK happens to use a standardcoder (also known as model coder), we will also have it available at theRunner, i.e. if the Runner is written in one of the SDK languages (e.g.Java). However, when we do not have a standard coder, we just treat thedata from the SDK as a blob and just pass it around as bytes.


Problem
=======

In the case of state requests which the SDK Harness authors to theRunner, we would like for the key associated with the state request tomatch the key of the element which led to initiating the state request.


Example:

Runner                 SDK Harness
------                 -----------

KV["key","value"]  --> Process Element
                              |
LookupState("key") <-- Request state of "key"
        |
   State["key"]    --> Receive state

For stateful DoFns, the Runner partitions the data based on the key. InFlink, this partitioning must not change during the lifetime of apipeline because the checkpointing otherwise breaks[0]. The key isextracted from the element and stored encoded.

If we have a standard coder, it is basically the same as in the"classic" Runner which takes the key and serializes it. However, when wehave an SDK-specific coder, we basically do not know how it encodes. Sofar, we have been using the coder instantiated from the Proto, which isbasically a LengthPrefixCoder[ByteArrayCoder] or similar[1]. We have hadproblems with this because the key encoding of Java SDK state requestsdid not match the key encoding on the Runner side [2]. In an attempt tofix those, it is now partly broken for portable Python pipelines.Partly, because it "only" affects non-standard coders.

Non-standard coders yield the aforementionedLengthPrefixCoder[ByteArrayCoder]. Now, following the usual encodingscheme, we would simply encode the key using this coder. However, forstate requests, the Python SDK leaves out the length prefix for certaincoders, e.g. for primitives like int or byte. It is possible that onecoder uses a length prefix, while another doesn't. We have no way oftelling from the Runner side, if a length prefix has been used or not.This results in the keys to not match on the Runner side and thepartitioning to be broken.



How to solve this?
==================

(1) Should this simply be fixed on the Python SDK side? One fix would beto always append a length prefix to the key in state requests, even forprimitive coders like VarInt which do not use one.

OR

(2) Should the Runner detect that a non-standard coder is used? If so,it should just pass the bytes from the SDK Harness and never make anattempt to construct a coder based on the Proto.

Thinking about it now, it seems pretty obvious that (2) is the mostfeasible way to avoid complications across all current and future SDKsfor key encodings. Still, it is odd that the Proto contains coderinformation which is not usable.


What do you think?


Thanks,
Max

[0] It is possible to restart the pipeline and repartition thecheckpointed data.[1]https://github.com/apache/beam/blob/c39752af5391fe698a2b4f1489c187ddd4d604c0/runners/flink/src/main/java/org/apache/beam/runners/flink/FlinkStreamingPortablePipelineTranslator.java#L682

[2] https://issues.apache.org/jira/browse/BEAM-8157

Key encodings for state requests

Reply via email to