Re: RabbitMQ and CheckpointMark feasibility

Daniel Robert Thu, 14 Nov 2019 04:06:56 -0800


On 11/14/19 2:32 AM, Jan Lukavský wrote:

Hi Danny,
as Eugene pointed out, there are essentially two "modes of operation"of CheckpointMark. It can:
a) be used to somehow restore state of a reader (in call toUnboundedSource#createReader)
 b) confirm processed elements in CheckpointMark#finalizeCheckpoint
If your source doesn't provide a persistent position in data streamthat can be referred to (and serialized - example of this would bekafka offsets), then what you actually need to serialize is not thechannel, but a way how to restore it - e.g. by opening a new channelwith a given 'consumer group name'. Then you just use this checkpointto commit your processed data in finalizeCheckpoint.
Note that the finalizeCheckpoint is not guaranteed to be called - thatcan happen in cases when an error occurs and the source has to berewind back - that is what direct runner emulates with the probabilityof 'readerReuseChance'.
I'm reading the documentation of RabbitMQ very quickly, but if Iunderstand it correctly, then you have to create a subscription to thebroker, serialize identifier of the subscription into thecheckpointmark and then just recover the subscription in call toUnboundedSource#createReader. That should do the trick.

I have not seen any such documentation in rabbit. My understanding is ithas to be the same, physical connection and channel. Can you cite thesource you were looking at?


-Danny

Hope this helps, sorry if I'm not using 100% correct RabbitMQterminology as I said, I'm not quite familiar with it.
Best,

 Jan

On 11/14/19 5:26 AM, Daniel Robert wrote:
I believe I've nailed down a situation that happens in practice thatcauses Beam and Rabbit to be incompatible. It seems that runners canand do make assumptions about the serializability (via Coder) of aCheckpointMark.
To start, these are the semantics of RabbitMQ:

- the client establishes a connection to the server
- client opens a channel on the connection
- messages are either pulled or pushed to the client from the serveralong this channel- when messages are done processing, they are acknowledged*client-side* and must be acknowledged on the *same channel* thatoriginally received the message.
Since a channel (or any open connection) is non-serializable, itmeans that a CheckpointMark that has been serialized cannot ever beused to acknowledge these messages and correctly 'finalize' thecheckpoint. It also, as previously discussed in this thread, impliesa rabbit Reader cannot accept an existing CheckpointMark at all; theReader and the CheckpointMark must share the same connection to therabbit server ("channel").
Next, I've found how DirectRunner (and presumably others) can attemptto serialize a CheckpointMark that has not been finalized. Inhttps://github.com/apache/beam/blob/master/runners/direct-java/src/main/java/org/apache/beam/runners/direct/UnboundedReadEvaluatorFactory.java#L150,the DirectRunner applies a probability and if it hits, it sets thecurrent reader to 'null' but retains the existing CheckpointMark,which it then attempts to pass to a new reader via a Coder.
This puts the shard, the runner, and the reader with differing viewsof the world. In UnboundedReadEvaluatorFactory's processElementfunction, a call to getReader(shard) (https://github.com/apache/beam/blob/master/runners/direct-java/src/main/java/org/apache/beam/runners/direct/UnboundedReadEvaluatorFactory.java#L132) clones the shard's checkpoint mark and passes that to the newreader. The reader ignores it, creating its own, but even if itaccepted it, it would be accepting a serialized CheckpointMark, whichwouldn't work. Later, the runner calls finishRead (https://github.com/apache/beam/blob/master/runners/direct-java/src/main/java/org/apache/beam/runners/direct/UnboundedReadEvaluatorFactory.java#L246). The shard's CheckpointMark (unserialized; which should still bevalid) is finalized. The reader's CheckpointMark (which may be adifferent instance) becomes the return value, which is referred to as"finishedCheckpoint" in the calling code, which is misleading at bestand problematic at worst as *this* checkpoint has not been finalized.
So, tl;dr: I cannot find any means of maintaining a persistentconnection to the server for finalizing checkpoints that is safeacross runners. If there's a guarantee all of the shards are on thesame JVM instance, I could rely on global, staticcollections/instances as a workaround, but if other runners mightserialize this across the wire, I'm stumped. The only workablesituation I can think of right now is to proactively acknowledgemessages as they are received and effectively no-op infinalizeCheckpoint. This is very different, semantically, and canlead to dropped messages if a pipeline doesn't finish processing thegiven message.
Any help would be much appreciated.

Thanks,
-Danny

On 11/7/19 10:27 PM, Eugene Kirpichov wrote:
Hi Daniel,
This is probably insufficiently well documented. The CheckpointMarkis used for two purposes:1) To persistently store some notion of how much of the stream hasbeen consumed, so that if something fails we can tell the underlyingstreaming system where to start reading when we re-create thereader. This is why CheckpointMark is Serializable. E.g. this makessense for Kafka.2) To do acks - to let the underlying streaming system know that theBeam pipeline will never need data up to this CheckpointMark. Ackingdoes not require serializability - runners call ack() on the samein-memory instance of CheckpointMark that was produced by thereader. E.g. this makes sense for RabbitMq or Pubsub.
In practice, these two capabilities tend to be mutually exclusive:some streaming systems can provide a serializable CheckpointMark,some can do acks, some can do neither - but very few (or none) cando both, and it's debatable whether it even makes sense for a systemto provide both capabilities: usually acking is an implicit form ofstreaming-system-side checkpointing, i.e. when you re-create thereader you don't actually need to carry over any information from anold CheckpointMark - the necessary state (which records should bedelivered) is maintained on the streaming system side.
These two are lumped together into one API simply because that wasthe best design option we came up with (not for lack of trying, butsuggestions very much welcome - AFAIK nobody is happy with it).
RabbitMQ is under #2 - it can't do serializable checkpoint marks,but it can do acks. So you can simply ignore the non-serializability.
On Thu, Nov 7, 2019 at 12:07 PM Daniel Robert <daniel.rob...@acm.org<mailto:daniel.rob...@acm.org>> wrote:
    (Background: I recently upgraded RabbitMqIO from the 4.x to 5.x
    library.
    As part of this I switched to a pull-based API rather than the
    previously-used push-based. This has caused some nebulous
    problems so
    put up a correction PR that I think needs some eyes fairly
    quickly as
    I'd consider master to be broken for rabbitmq right now. The PR
    keeps
    the upgrade but reverts to the same push-based implementation as
    in 4.x:
    https://github.com/apache/beam/pull/9977 )

    Regardless, in trying to get the pull-based API to work, I'm
    finding the
    interactions between rabbitmq and beam with CheckpointMark to be
    fundamentally impossible to implement so I'm hoping for some
    input here.

    CheckointMark itself must be Serializable, presumably this means
    it gets
    shuffled around between nodes. However 'Channel', the tunnel
    through
    which it communicates with Rabbit to ack messages and finalize the
    checkpoint, is non-Serializable. Like most other CheckpointMark
    implementations, Channel is 'transient'. When a new
    CheckpointMark is
    instantiated, it's given a Channel. If an existing one is
    supplied to
    the Reader's constructor (part of the 'startReader()'
    interface), the
    channel is overwritten.

    *However*, Rabbit does not support 'ack'ing messages on a
    channel other
    than the one that consumed them in the first place. Attempting
    to do so
    results in a '406 (PRECONDITION-FAILED) - unknown delivery tag'.
    (See
    https://www.grzegorowski.com/rabbitmq-406-channel-closed-precondition-failed

    ).

    Truthfully, I don't really understand how the current
    implementation is
    working; it seems like a happy accident. But I'm curious if someone
    could help me debug and implement how to bridge the
    re-usable/serializable CheckpointMark requirement in Beam with this
    limitation of Rabbit.

    Thanks,
    -Daniel Robert

Re: RabbitMQ and CheckpointMark feasibility

Reply via email to