Re: Kafka Multiple Consumer Group for Same Topic

2017-01-24 Thread Sharninder Khera
I don't have benchmarks but multiple consumer groups are possible. For Kafka 
the performance should be similar or close to as having multiple consumers 
using a single group. 


_
From: Senthil Kumar 
Sent: Tuesday, January 24, 2017 10:38 PM
Subject: Kafka Multiple Consumer Group for Same Topic
To:  
Cc:  


Hi Team ,  Sorry if the same question asked already in this group !

Say we have topic => ad_events ..  I want to read events from ad_events
topic and send it to two different systems... This can be achieved by
creating two  Consume Groups..

Example :  Consumer Group SYS1 with 10 threads
  Consume Group SYS2 with 10 threads

Would like to know having two different Consumer Groups will impact
performance of Kafka Read ??  Also want to see the *Benchmarking Result*(
Numbers ) of   Single Topic Read with *One Consumer Group* Vs Single Topic
with *Two/Three Consumer Group*..


Cheers,
Senthil





Re: Storing Kafka Message JSON to deep storage like S3

2016-12-06 Thread Sharninder Khera
Why not just have a parallel consumer read all messages from whichever topics 
you're interested in and store them wherever you want to? You don't need to 
"backup" Kafka messages. 

_
From: Aseem Bansal 
Sent: Tuesday, December 6, 2016 4:55 PM
Subject: Storing Kafka Message JSON to deep storage like S3
To:  


Hi

Has anyone done a storage of Kafka JSON messages to deep storage like S3.
We are looking to back up all of our raw Kafka JSON messages for
Exploration. S3, HDFS, MongoDB come to mind initially.

I know that it can be stored in kafka itself but storing them in Kafka
itself does not seem like a good option as we won't be able to query it and
the configurations of machines containing kafka will have to be increased
as we go. Something like S3 we won't have to manage.





Re: Need to add & remove consumers dynamically in diffrent group and consume consecutively.

2016-10-19 Thread Sharninder Khera
Do you have only one partition in the topic? The way Kafka works is that all 
messages are first distributed into partitions in the topic and then the 
consumers are distributed among them and they read them sequentially. 
If you have only one partition in the topic, all your messages will be in it 
but only one consumer can read messages per partition. If you have multiple 
partitions, you can have that many consumers reading from it in parallel.
Let me know if that wasn't clear. 

_
From: Kaushil Rambhia/ MUM/CORP/ ENGINEERING 
Sent: Wednesday, October 19, 2016 8:40 PM
Subject: Need to add & remove consumers dynamically in diffrent group and 
consume consecutively.
To:  


Hi guys,
i am using apache kafka with phprd kafka, i want to know how can i use
multiple Kafka consumers on same partition from different groups to consume
message parallel, say if consumer are c1,c2,c3 consuming single partition
0,
than if c1 is consuming from 0 offset than c2 should start from 1 and c3
from 2 and if any new consumer comes up it should start from latest i.e 4th
 offset which is yet to be consumed by any consumer.

So in short all consumers should consume consecutively from kafka partition
and should not consumed same message again no matter any new consumer is
added or removed from different group.It should automatically select
consecutive offsets.

It would be good if anyone can recommend any solution or can this
be achieved by using kafka with some other tools like spark or something
else.

-- 
Regards,
Kaushil Rambhia

-- 



DISCLAIMER: The contents of this message may be legally privileged and 
confidential and are for the use of the intended recipient(s) only. It 
should not be read, copied and used by anyone other than the intended 
recipient(s). If you have received this message in error, please 
immediately notify the sender, preserve its confidentiality and delete it. 
Before opening any attachments please check them for viruses and defects.





Re: Using Kafka without persisting message to disk

2016-07-14 Thread Sharninder Khera
I'd second Tom here. Create a ram disk or just let Kafka write to disk. Use 
compression and batch messages and the OS fscache would take care of the rest. 
Kafka is pretty fast and you probably won't notice. 



_
From: Tom Crayford 
Sent: Friday, July 15, 2016 12:12 AM
Subject: Re: Using Kafka without persisting message to disk
To: Users 


Hi Jack,

No, kafka doesn't support not writing to disk. If you're really 100% sure
of yourself you could use a ramdisk and mount Kafka on it, but that's not
supported. I'd recommend "just" writing to disk, it's plenty fast enough
for nearly all use cases.

Thanks

Tom Crayford
Heroku Kafka

On Thu, Jul 14, 2016 at 7:33 PM, Jack Huang  wrote:

> Hi all,
>
> Is there a way to make a topic to be stored in memory only and not writing
> to disk? If not, what's the best way to minimize writing to disk? For this
> application we only need the notion of partitions and a short retention
> time (1hr or so) from Kafka. We want to use Kafka because we want to keep
> the flexibility to add persistence back if we need to.
>
> Thanks,
> Jack
>





Re: Kafka as master data store

2016-02-15 Thread Sharninder Khera
This topic comes up often on this list. Kafka can be used as a datastore if 
that’s what your application wants with the caveat that Kafka isn’t designed to 
keep data around forever. There is a default retention time after which older 
data gets deleted. The high level consumer essentially reads data as a stream 
and while you can do sort of random access with the low level consumer, its not 
ideal.



> On 15-Feb-2016, at 10:26 PM, Ted Swerve  wrote:
> 
> Hello,
> 
> Is it viable to use infinite-retention Kafka topics as a master data
> store?  I'm not talking massive volumes of data here, but still potentially
> extending into tens of terabytes.
> 
> Are there any drawbacks or pitfalls to such an approach?  It seems like a
> compelling design, but there seem to be mixed messages about its
> suitability for this kind of role.
> 
> Regards,
> Ted



Re: Multiple instances of HL Consumer

2015-06-05 Thread Sharninder Khera
You can have the same consumer id and Kafka will balance partitions across the 
two instances automatically. When one of them dies the partitions are 
rebalanced and assigned to the remaining alive consumers. 



_
From: Panda, Samaresh 
Sent: Friday, June 5, 2015 7:42 pm
Subject: Multiple instances of HL Consumer
To:  


I've a HL consumer receiving messages using four threads (four partitions). 
This is a stand-alone Java client. For fail-safe reasons, I want to run another 
instance of the exact same Java client in a different box.

Here are my questions:

> Can I keep the same consumer group name or it must be different for the 2nd 
> instance?
> If same consumer group, will the 2nd client receive same set of messages 
> again?
> In general what's the best practice to designing fail-safe clients?

Thanks
Sam