Re: Kafka producer does not respect added partitions

2017-03-24 Thread Manikumar
producer refreshes the metadata for every metadata.max.age.ms (default 5min) to discover new partitions. On Sat, Mar 25, 2017 at 2:22 AM, Robert Quinlivan wrote: > Hello, > > I have added partitions to a topic. The new partitions appear in the > consumer assignments and in

Kafka producer does not respect added partitions

2017-03-24 Thread Robert Quinlivan
Hello, I have added partitions to a topic. The new partitions appear in the consumer assignments and in the topics listing they have the correct number of ISRs. However, the producer still does not write to the new partitions. My producer writes in a round-robin fashion, using the Cluster's

Re: YASSQ (yet another state store question)

2017-03-24 Thread Eno Thereska
Hi Jon, This is expected, see this: https://groups.google.com/forum/?pli=1#!searchin/confluent-platform/migrated$20to$20another$20instance%7Csort:relevance/confluent-platform/LglWC_dZDKw/qsPuCRT_DQAJ

YASSQ (yet another state store question)

2017-03-24 Thread Jon Yeargers
I've setup a KTable as follows: KTable outTable = sourceStream.groupByKey(). reduce(rowReducer, TimeWindows.of(5 * 60 * 1000L).advanceBy(1 * 60 * 1000).until(10 * 60 * 1000L), "AggStore"); I can confirm its presence via 'streams.allMetadata()'

Re: Should MirrorMaker produce round robin with this configuration?

2017-03-24 Thread Chris Neal
lol...well, I take it all back. Now I can't get it to work at all :( Here's what I have: *consumer.properties* zookeeper.connect=[server_list] # I was changing the group.id each time in case that was causing some issues group.id=MirrorMakerTest7 client.id=MirrorMakerConsumer

how to get parallel processing with kafka and akka

2017-03-24 Thread Laxmi Narayan
Hi, Kafka has partitions and akka can do parallel processing. I have one perfect use-case where I have to read data in parallel. But seems like partitions does not give me any extra info other than partition number and how do i make sure that data_x should always go to x-partition next time and

Re: Question about kafka-streams task load balancing

2017-03-24 Thread Guozhang Wang
Hi Karthik, I think in the current trunk we do effectively load balance across processes (they are named as "clients" in the partition assignor) already. More specifically: 1. Consumer clients embedded a "client UUID" in its subscription so that the leader can group them into a single client,

Re: APPLICATION_SERVER_CONFIG ?

2017-03-24 Thread Jon Yeargers
You make some great cases for your architecture. To be clear - Ive been proselytizing for kafka since I joined this company last year. I think my largest issue is rethinking some preexisting notions about streaming to make them work in the kstream universe. On Fri, Mar 24, 2017 at 6:07 AM,

Re: Should MirrorMaker produce round robin with this configuration?

2017-03-24 Thread Chris Neal
Again, thank you for the feedback. That link was very helpful! I adjusted my consumer/producer configs to be: consumer: zookeeper.connect=[server_list_here] group.id=MirrorMaker exclude.internal.topics=true client.id=MirrorMakerConsumer producer: metadata.broker.list=[server_list_here]

Re: Should MirrorMaker produce round robin with this configuration?

2017-03-24 Thread Manikumar
producer distributes the non-keyed messages to available target partitions in a round-robin fashion. You don't need to set num.consumer.fetchers, partition.assignment.strategy props. Use the --num.streams option to specify the number of consumer threads to create.

Re: Should MirrorMaker produce round robin with this configuration?

2017-03-24 Thread Chris Neal
Thanks very much for the reply Manikumar! I found that there were a few topics on the source cluster that had more than two partitions, but all topics on the target cluster had 2 partitions. I did a test between one topic that had 2 on each, and I did get messages to both partitions as expected.

Re: Relationship fetch.replica.max.bytes and message.max.bytes

2017-03-24 Thread Kostas Christidis
Cool - thanks for clarifying this! On Thu, Mar 23, 2017 at 10:54 AM, Ismael Juma wrote: > Hi Kostas, > > Yes, equal is fine. The code that prints an error if replication fails due > to this: > > error(s"Replication is failing due to a message that is greater than >

Re: APPLICATION_SERVER_CONFIG ?

2017-03-24 Thread Michael Noll
> If I understand this correctly: assuming I have a simple aggregator > distributed across n-docker instances each instance will _also_ need to > support some sort of communications process for allowing access to its > statestore (last param from KStream.groupby.aggregate). Yes. See

even if i pass key no change in partition

2017-03-24 Thread Laxmi Narayan
Hi, I am passing key in producer but still no change in partition. I can see in producer response key value but no change in partition. *This is how my props looks:* props.put("bootstrap.servers", "localhost:9092"); props.put("group.id", "test");

Re: APPLICATION_SERVER_CONFIG ?

2017-03-24 Thread Jon Yeargers
If I understand this correctly: assuming I have a simple aggregator distributed across n-docker instances each instance will _also_ need to support some sort of communications process for allowing access to its statestore (last param from KStream.groupby.aggregate). How would one go about

Re: Kafka high cpu usage and disconnects

2017-03-24 Thread Paul van der Linden
If I run 3 brokers in a cluster on localhost the cpu usage is virtually zero. Not sure why on other environments the minimum usage of each broker is at least 13% (with zero producers/consumers), that doesn't sound normal. On Thu, Mar 23, 2017 at 4:48 PM, Paul van der Linden

Re: ORC plugin for Kafka HDFS connector

2017-03-24 Thread Manoj Murumkar
>> It wants to extract Avro schema from ORC record. Should say: It wants to extract connect schema from ORC record. On Thu, Mar 23, 2017 at 11:14 PM, Manoj Murumkar wrote: > Hi, > > I am developing a connector to support ORC data type in HDFS connector. > Everything