Thanks Luke, no more comments from me, nice work!

On Mon, Feb 14, 2022 at 5:22 AM Luke Chen <show...@gmail.com> wrote:

> Hi Guozhang,
>
> Thanks for your comments. I've updated the KIP.
> Here's what I've updated:
>
> * In the motivation section, I've added this paragraph after
> cooperativeStickyAssignor like this:
>
> *On the other hand,  `StickyAssignor` is also adding "generation" field
> plus the "ownedPartitions" into subscription userData bytes. the difference
> is that the `StickyAssignor`'s user bytes also encode the prev-owned
> partitions while the `CooperativeStickyAssignor` relies on the prev-owned
> partitions on the subscription protocol directly.*
>
> * In the proposed change section, I've updated the paragraph as:
>
>
> *For built-in CooperativeStickyAssignor, if there are consumers in old
> bytecode and some in the new bytecode, it's totally fine, because the
> subscription data from old consumers will contain \[empty ownedPartitions +
> default generation(-1)] in V0, or \[current ownedPartitions + default
> generation(-1)] in V1. For V0 case, it's quite simple, because we'll just
> ignore the info since they are empty. For V1 case, we'll get the
> "ownedPartitions" data, and then decode the "generation" info in the
> subscription userData bytes. So that we can continue to do assignment with
> these information.*
> * Also, after the "cooperativeStickyAssignor paragraph, I've also mentioned
> stickyAssignor:
>
>
> *For built-in StickyAssignor, if there are consumers in old bytecode and
> some in the new bytecode, it's also fine, because the subscription data
> from old consumers will contain \[empty ownedPartitions + default
> generation(-1)] in V0, or \[current ownedPartitions + default
> generation(-1)] in V1. For both V0 and V1 case, we'll directly use the
> ownedPartition and generation info in the subscription userData bytes. *
>
> Please let me know if you have other comments.
>
> Thank you.
> Luke
>
> On Wed, Feb 9, 2022 at 2:57 PM Guozhang Wang <wangg...@gmail.com> wrote:
>
> > Hello Luke,
> >
> > Thanks for the updated KIP, I've taken a look at it and still LGTM. Just
> a
> > couple minor comments in the wiki:
> >
> > * Both `StickyAssignor` and `CooperativeStickyAssignor` that there's
> > already generation is encoded in user-data bytes, the difference is that
> > the `StickyAssignor`'s user bytes also encode the prev-owned partitions
> > while the `CooperativeStickyAssignor` relies on the prev-owned partitions
> > on the subscription protocol directly. So we can add the `StickyAssignor`
> > in your paragraph talking about `CooperativeStickyAssignor` as well.
> >
> > * This sentence: "otherwise, we'll take the ownedPartitions as default
> > generation(-1)." does not read right to me, maybe need to rephrase a bit?
> >
> >
> > Guozhang
> >
> > On Mon, Feb 7, 2022 at 7:36 PM Luke Chen <show...@gmail.com> wrote:
> >
> > > Hi David,
> > >
> > > Thanks for your comments.
> > > I've updated the KIP to add changes in Subscription class.
> > >
> > > Thank you.
> > > Luke
> > >
> > > On Fri, Feb 4, 2022 at 11:43 PM David Jacot
> <dja...@confluent.io.invalid
> > >
> > > wrote:
> > >
> > > > Hi Luke,
> > > >
> > > > Thanks for updating the KIP. I just have a minor request.
> > > > Could you fully describe the changes to the Subscription
> > > > public class in the KIP? I think that it would be better than
> > > > just saying that the generation will be added to id.
> > > >
> > > > Otherwise, the KIP LGTM.
> > > >
> > > > Thanks,
> > > > David
> > > >
> > > > On Mon, Nov 29, 2021 at 4:29 AM Luke Chen <show...@gmail.com> wrote:
> > > > >
> > > > > Hi devs,
> > > > > Welcome to provide feedback.
> > > > >
> > > > > If there are no other comments, I'll start a vote tomorrow.
> > > > >
> > > > > Thank you.
> > > > > Luke
> > > > >
> > > > >
> > > > > On Mon, Nov 22, 2021 at 4:16 PM Luke Chen <show...@gmail.com>
> wrote:
> > > > >
> > > > > > Hello David,
> > > > > >
> > > > > > For (3):
> > > > > >
> > > > > >
> > > > > >
> > > > > > * I suppose that we could add a `generation` field to the
> > > > JoinGroupRequest
> > > > > > instead to do the fencing that you describe while handling the
> > > > sentinel in
> > > > > > the assignor directly. If we would add the `generation` to the
> > > request
> > > > > > itself, would we need the `generation` in the subscription
> protocol
> > > as
> > > > > > well?*
> > > > > >
> > > > > > On second thought, I think this is not better than adding
> > > `generation`
> > > > > > field in the subscription protocol, because I think we don't have
> > to
> > > > do any
> > > > > > generation validation on joinGroup request. The purpose of
> > > > > > `joinGroupRequest` is to accept any members to join this group,
> > even
> > > > if the
> > > > > > member is new or ever joined or what. As long as we have the
> > > > generationId
> > > > > > in the subscription metadata, the consumer lead can leverage the
> > info
> > > > to
> > > > > > ignore the old ownedPartitions (or do other handling), and the
> > > > rebalance
> > > > > > can still complete successfully and correctly. On the other hand,
> > if
> > > > we did
> > > > > > the generation check on JoinGroupRequest, and return
> > > > `ILLEGAL_GENERATION`
> > > > > > back to consumer, the consumer needs to clear its generation info
> > and
> > > > > > rejoin the group to continue the rebalance. It needs more
> > > > request/response
> > > > > > network and slow down the rebalance.
> > > > > >
> > > > > > So I think we should add the `generationId` field into
> Subscription
> > > > > > protocol to achieve what we want.
> > > > > >
> > > > > > Thank you.
> > > > > > Luke
> > > > > >
> > > > > > On Thu, Nov 18, 2021 at 8:51 PM Luke Chen <show...@gmail.com>
> > wrote:
> > > > > >
> > > > > >> Hi David,
> > > > > >> Thanks for your feedback.
> > > > > >>
> > > > > >> I've updated the KIP for your comments (1)(2).
> > > > > >> For (3), it's a good point! Yes, we didn't deserialize the
> > > > subscription
> > > > > >> metadata on broker side, and it's not necessary to add overhead
> on
> > > > broker
> > > > > >> side. And, yes, I think we can fix the original issue by adding
> a
> > > > > >> "generation" field into `JoinGroupRequest` instead, and also
> add a
> > > > field
> > > > > >> into `JoinGroupResponse` in `JoinGroupResponseMember` field.
> That
> > > > way, the
> > > > > >> broker can identify the old member from `JoinGroupRequest`. And
> > the
> > > > > >> assignor can also get the "generation" info via the
> `Subscription`
> > > > instance.
> > > > > >>
> > > > > >> I'll update the KIP to add "generation" field into
> > > `JoinGroupRequest`
> > > > and
> > > > > >> `JoinGroupResponse`, if there is no other options.
> > > > > >>
> > > > > >> Thank you.
> > > > > >> Luke
> > > > > >>
> > > > > >>
> > > > > >> On Tue, Nov 16, 2021 at 12:31 AM David Jacot
> > > > <dja...@confluent.io.invalid>
> > > > > >> wrote:
> > > > > >>
> > > > > >>> Hi Luke,
> > > > > >>>
> > > > > >>> Thanks for the KIP. Overall, I think that the motivation makes
> > > > sense. I
> > > > > >>> have a couple of comments/questions:
> > > > > >>>
> > > > > >>> 1. In the Public Interfaces section, it would be great if you
> > could
> > > > put
> > > > > >>> the
> > > > > >>> end state not the current one.
> > > > > >>>
> > > > > >>> 2. Do we need to update the Subscription class to expose the
> > > > > >>> generation? If so, it would be great to mention it in the
> Public
> > > > > >>> Interfaces section as well.
> > > > > >>>
> > > > > >>> 3. You mention that the broker will set the generation if the
> > > > > >>> subscription
> > > > > >>> contains a sentinel value (-1). As of today, the broker does
> not
> > > > parse
> > > > > >>> the subscription so I am not sure how/why we would do this. I
> > > suppose
> > > > > >>> that we could add a `generation` field to the JoinGroupRequest
> > > > instead
> > > > > >>> to do the fencing that you describe while handling the sentinel
> > in
> > > > the
> > > > > >>> assignor directly. If we would add the `generation` to the
> > request
> > > > > >>> itself,
> > > > > >>> would we need the `generation` in the subscription protocol as
> > > well?
> > > > > >>>
> > > > > >>> Best,
> > > > > >>> David
> > > > > >>>
> > > > > >>> On Fri, Nov 12, 2021 at 3:31 AM Luke Chen <show...@gmail.com>
> > > wrote:
> > > > > >>> >
> > > > > >>> > Hi all,
> > > > > >>> >
> > > > > >>> > I'd like to start the discussion for KIP-792: Add
> "generation"
> > > > field
> > > > > >>> into
> > > > > >>> > consumer protocol.
> > > > > >>> >
> > > > > >>> > The goal of this KIP is to allow assignor/consumer
> > > > coordinator/group
> > > > > >>> > coordinator to have a way to identify the out-of-date
> > > > > >>> members/assignments.
> > > > > >>> >
> > > > > >>> > Detailed description can be found here:
> > > > > >>> >
> > > > > >>>
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=191336614
> > > > > >>> >
> > > > > >>> > Any feedback is welcome.
> > > > > >>> >
> > > > > >>> > Thank you.
> > > > > >>> > Luke
> > > > > >>>
> > > > > >>
> > > >
> > >
> >
> >
> > --
> > -- Guozhang
> >
>


-- 
-- Guozhang

Reply via email to