Re: Partition Count Dilemma

2021-10-01 Thread Antony Stubbs
Hi there, sorry just checking the user list for "parallel consumer" - you
can use Confluent's Parallel Consumer client library to increase your
concurrency far beyond the number of partitions you have available.

For example in your case - you can process 1,000's of messages in parallel
without touching the partition count.

Have a look here:
https://www.confluent.io/confluent-accelerators/#parallel-consumer

Latest talk here:
https://www.confluent.io/events/kafka-summit-europe-2021/introducing-confluent-labs-parallel-consumer-client/

Github here: https://github.com/confluentinc/parallel-consumer

On Thu, Mar 21, 2019 at 9:18 AM Jérémy Thulliez 
wrote:

> In your case,
>
> Increase the number of partitions on an existing topic will have an impact
> on how the message will be dispatched into partitions.
>
> I mean if you use the default partitionner,  and your message have key, the
> algorithm is 'hash(key) % number of partitions'
>
> That means that a message with the same key, won't go into the same
> partition, before and after the change.
>
> IMO, It was important to notice, in case you need ordering by key.
>
> Jeremy
>


-- 

<https://www.confluent.io>

Antony Stubbs


Follow us:  Blog
<https://confluent.io/blog?utm_source=footer&utm_medium=email&utm_campaign=ch.email-signature_type.community_content.blog>
• Slack <https://slackpass.io/confluentcommunity> • Twitter
<https://twitter.com/ConfluentInc> • YouTube <https://youtube.com/confluent>


Re: Confluent's parallel consumer

2021-10-01 Thread Antony Stubbs
Hi Israel, Pushkar, everyone! Sorry I didn't notice this message. Feel
free to collaborate on the github discussions page about the project too.

And yes Israel is correct - offsets are handled correctly regardless of
what modes you choose. Offsets are tracked individually, and committed to
kafka inside the commit metadata payload - there is no message loss.
Here's a conversation on the topic in the discussion forum:
https://github.com/confluentinc/parallel-consumer/discussions/135#discussioncomment-1050437

Let me know if I can help with anything else!

Cheers,
Antony.

On Thu, Jul 15, 2021 at 11:50 PM Israel Ekpo  wrote:

> Hi Pushkar,
>
> Based on what I understand about the library, I don't think you need to
> worry about data loss because there are mechanisms in place to track which
> offsets have been processed in the event something goes wrong during
> processing.
>
> If the processing client goes offline or is unresponsive and has to be
> resumed or restarted, it should restart from where it left off since the
> offsets are being tracked.
>
>
> https://www.confluent.io/events/kafka-summit-europe-2021/introducing-confluent-labs-parallel-consumer-client/
>
> This should be true regardless of the ordering strategy you select.
>
> Take a look at the talk from the most recent Kafka Summit for additional
> details on this
>
> The github documentation for the project also covers this briefly
>
>
> https://github.com/confluentinc/parallel-consumer/blob/master/README.adoc#commit-mode
>
> I hope this was helpful. If you still have additional questions, please
> continue to bring it up.
>
> Thanks.
>
> On Thu, Jul 15, 2021 at 11:21 AM Pushkar Deole 
> wrote:
>
> > It is consumer client node that has received events and is processing
> > those...
> >
> > On Thu, Jul 15, 2021 at 8:49 PM Israel Ekpo 
> wrote:
> >
> > > Hi Pushkar,
> > >
> > > When you use the term “node/instance” are you referring to the Kafka
> > > Brokers or the consuming clients that are retrieving events from the
> > > broker?
> > >
> > > Please could you elaborate/clarify?
> > >
> > > On Thu, Jul 15, 2021 at 10:00 AM Pushkar Deole 
> > > wrote:
> > >
> > > > Well... with key-level ordering, i am mainly concerned about event
> > loss,
> > > if
> > > > any, in below mentioned scenario:
> > > >
> > > > 1. since event1 with key1 and event2 with key2 are both part of the
> > same
> > > > partition1
> > > > 2. key1 event has offset 30 while key2 has offset 40
> > > > 3. key2 is processed by background thread and offset committed which
> is
> > > 40
> > > > 4. before key1 gets processed by background thread, the instance/node
> > > goes
> > > > down
> > > > 5. partition1 gets rebalanced to node2 and start processing ahead of
> > > offset
> > > > 40, thus losing key1
> > > >
> > > >
> > > >
> > > > On Thu, Jul 15, 2021 at 7:18 PM Israel Ekpo 
> > > wrote:
> > > >
> > > > > Hi Pushkar,
> > > > >
> > > > > If you are selecting key-based ordering, you should not be
> concerned
> > > > about
> > > > > the other keys from the same partitions being committed first
> > > > >
> > > > > If that is a concern for your use cases then you should go with
> > > partition
> > > > > based ordering to ensure that the events are processed in the
> > sequence
> > > > they
> > > > > are picked up from the topic partition.
> > > > >
> > > > > For commit mode, you have the asynchronous, synchronous and
> > > transactional
> > > > > modes. I think if you are concerned with the order of commits you
> > > should
> > > > > look into the last two modes.
> > > > >
> > > > > My recommendation would be to go with the partition based ordering
> > with
> > > > > synchronous commits to start with.
> > > > >
> > > > >
> > > > >
> > > > > On Thu, Jul 15, 2021 at 7:36 AM Pushkar Deole <
> pdeole2...@gmail.com>
> > > > > wrote:
> > > > >
> > > > > > Hi All, and Antony (author of below article)
> > > > > >
> > > > > > i came across this article which seemed interesting: Introducing
> > > > > > Confluent’s Parallel Consumer Message Processing Cl

Re: [VOTE] KIP-657: Add Customized Kafka Streams Logo

2020-08-20 Thread Antony Stubbs
 > >
> > > > > @Boyang: as mentioned by Bruno, can we maybe add black/white
> options
> > > for
> > > > > both proposals, too?
> > > > >
> > > > > I also agree that Design B is not ideal with regard to the Kafka
> > logo.
> > > > > Would it be possible to change Design B accordingly?
> > > > >
> > > > > I am not a font expert, but the fonts in both design are different
> > and
> > > I
> > > > > am wondering if there is an official Apache Kafka font that we
> should
> > > > > reuse to make sure that the logos align -- I would expect that both
> > > > > logos (including "Apache Kafka" and "Kafka Streams" names) will be
> > used
> > > > > next to each other and it would look awkward if the font differs.
> > > > >
> > > > >
> > > > > -Matthias
> > > > >
> > > > > On 8/18/20 11:28 AM, Navinder Brar wrote:
> > > > > > Hi,
> > > > > > Thanks for the KIP, really like the idea. I am +1(non-binding)
> on A
> > > > > mainly because I felt like you have to tilt your head to realize
> the
> > > > > otter's head in B.
> > > > > > Regards,Navinder
> > > > > >
> > > > > > On Tuesday, 18 August, 2020, 11:44:20 pm IST, Guozhang Wang <
> > > > > wangg...@gmail.com> wrote:
> > > > > >
> > > > > >  I'm leaning towards design B primarily because it reminds me of
> > the
> > > > > Firefox
> > > > > > logo which I like a lot. But I also share Adam's concern that it
> > > should
> > > > > > better not obscure the Kafka logo --- so if we can tweak a bit to
> > fix
> > > > it
> > > > > my
> > > > > > vote goes to B, otherwise A :)
> > > > > >
> > > > > >
> > > > > > Guozhang
> > > > > >
> > > > > > On Tue, Aug 18, 2020 at 9:48 AM Bruno Cadonna <
> br...@confluent.io>
> > > > > wrote:
> > > > > >
> > > > > >> Thanks for the KIP!
> > > > > >>
> > > > > >> I am +1 (non-binding) for A.
> > > > > >>
> > > > > >> I would also like to hear opinions whether the logo should be
> > > > colorized
> > > > > >> or just black and white.
> > > > > >>
> > > > > >> Best,
> > > > > >> Bruno
> > > > > >>
> > > > > >>
> > > > > >> On 15.08.20 16:05, Adam Bellemare wrote:
> > > > > >>> I prefer Design B, but given that I missed the discussion
> > thread, I
> > > > > think
> > > > > >>> it would be better without the Otter obscuring any part of the
> > > Kafka
> > > > > >> logo.
> > > > > >>>
> > > > > >>> On Thu, Aug 13, 2020 at 6:31 PM Boyang Chen <
> > > > > reluctanthero...@gmail.com>
> > > > > >>> wrote:
> > > > > >>>
> > > > > >>>> Hello everyone,
> > > > > >>>>
> > > > > >>>> I would like to start a vote thread for KIP-657:
> > > > > >>>>
> > > > > >>>>
> > > > > >>
> > > > >
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-657%3A+Add+Customized+Kafka+Streams+Logo
> > > > > >>>>
> > > > > >>>> This KIP is aiming to add a new logo for the Kafka Streams
> > > library.
> > > > > And
> > > > > >> we
> > > > > >>>> prepared two candidates with a cute otter. You could look up
> the
> > > KIP
> > > > > to
> > > > > >>>> find those logos.
> > > > > >>>>
> > > > > >>>>
> > > > > >>>> Please post your vote against these two customized logos. For
> > > > > example, I
> > > > > >>>> would vote for *design-A (binding)*.
> > > > > >>>>
> > > > > >>>> This vote thread shall be open for one week to collect enough
> > > votes
> > > > to
> > > > > >> call
> > > > > >>>> for a winner. Still, feel free to post any question you may
> have
> > > > > >> regarding
> > > > > >>>> this KIP, thanks!
> > > > > >>>>
> > > > > >>>
> > > > > >>
> > > > > >
> > > > > >
> > > > >
> > > > >
> > > >
> > >
> >
>
>
> --
>
> Michael G. Noll
>
> Principal Technologist, Office of the CTO
>
> <https://www.confluent.io>
>


-- 

<https://www.confluent.io>

Antony Stubbs

Principal Consulting Engineer / Solutions Architect


Follow us:  Blog
<https://confluent.io/blog?utm_source=footer&utm_medium=email&utm_campaign=ch.email-signature_type.community_content.blog>
• Slack <https://slackpass.io/confluentcommunity> • Twitter
<https://twitter.com/ConfluentInc> • YouTube <https://youtube.com/confluent>