Re: 0.9.0.0 remaining jiras

2015-09-14 Thread Jiangjie Qin
HI Jun, Can we also include KAFKA-2448 in 0.9 as well? We see this issue a few times before and that cause replica fetcher threads not startup. Thanks, Jiangjie (Becket) Qin On Sat, Sep 12, 2015 at 9:40 AM, Jun Rao wrote: > The following is a candidate list of jiras that

Re: configuring log compaction

2015-08-09 Thread Jiangjie Qin
Actually Kafka only support two mutually exclusive log cleanup policy: 1) delete logs after retention period passed. 2) compact the log to only keep the last updated value of a key. log.retention.hours is only used by (1). For log compaction, currently it is not compacting the logs by its age,

Re: Consumer limit for pub-sub mode

2015-08-08 Thread Jiangjie Qin
That's not true Shaninder. Assuming you are using the high level consumer, if all the consumers have different group id, each of them will get the all the messages. Also, another way to think about this a little bit is that maybe you can produce the same data to many different topics each have

Re: Get last snapshot from compacted topic

2015-08-08 Thread Jiangjie Qin
You are looking for a K-V store here, so the general answer is no. But Kafka does have an internal K-V store but only for consumer offsets. So there are some tricks we can play: If your processing node is consuming from Kafka and the offset of a snapshot is the offset of a Kafka partition, you

Re: New consumer - offset storage options

2015-08-01 Thread Jiangjie Qin
Yes. That is correct. New consumer will only store offsets in Kafka. On Tue, Jul 21, 2015 at 8:57 AM, Stevo Slavić ssla...@gmail.com wrote: Hello Apache Kafka community, It seems new high level consumer coming in 0.8.3 will support only offset storage in Kafka topic. Can somebody please

Re: Regarding using of apache kafka

2015-08-01 Thread Jiangjie Qin
Since Kafka's performance largely depends on the operating system page cache. So usually people want to use dedicated machines for Kafka. But that depends on your performance requirement as well. If you only have 5 million messages per month, I think letting Kafka share the machines with other

Re: Connection to zk shell on Kafka

2015-07-30 Thread Jiangjie Qin
This looks an issue to be fixed. I created KAFKA-2385 for this. Thanks, Jiangjie (Becket) Qin On Wed, Jul 29, 2015 at 10:33 AM, Chris Barlock barl...@us.ibm.com wrote: I'm a user of Kafka/ZooKeeper not one of its developers, so I can't give you a technical explanation. I do agree that Kafka

Re: KAfka Mirror Maker

2015-07-29 Thread Jiangjie Qin
Mirror Maker does not have specific restrictions on cluster size. The error you saw was because consumer was not able to talk to the broker. Can you try to use kafka-console-consumer to consume some data from your source cluster and see if it works? It should be under KAFKA_HOME/bin/ Jiangjie

Re: Idea how to ensure exactly once message deliver without external storage

2015-07-16 Thread Jiangjie Qin
Hey Stefan, I think if you follow the one-commit-per-message approach, you will be able to achieve exact once semantic. However this would be very expensive and also everything has to be synchronous in order to make it work. In that sense, de-duplication on client side seems a more favorable

Re: consumer memory footprint

2015-07-16 Thread Jiangjie Qin
I think the rough calculation of max memory footprint for each high level consumer would be: (Number Of Partitions For All Topics) * fetch.message.max.bytes * queued.max.message.chunks + (some decompression memory cost for a message) In your case, it would be 10 times above. Thanks, Jiangjie

Re: Load Balancing Kafka

2015-07-15 Thread Jiangjie Qin
If you have pretty balanced traffic on each partition and have set auto.leader.rebalance.enabled to true or false, you might not need to do further workload balance. However, in most cases you probably still need to do some sort of load balancing based on the traffic and disk utilization of each

Re: Load Balancing Kafka

2015-07-15 Thread Jiangjie Qin
AhŠ It seems you are more focusing on producer side workload balanceŠ If that is the case, please ignore my previous comments. Jiangjie (Becket) Qin On 7/15/15, 6:01 PM, Jiangjie Qin j...@linkedin.com wrote: If you have pretty balanced traffic on each partition and have set

Re: Java API for fetching Consumer group from Kafka Server(Not Zookeeper)

2015-07-15 Thread Jiangjie Qin
It looks kafka.admin.ConsumerGroupCommand class is what you need. Jiangjie (Becket) Qin On 7/14/15, 8:23 PM, Swati Suman swatisuman1...@gmail.com wrote: Hi Team, Currently, I am able to fetch the Topic,Partition,Leader,Log Size through TopicMetadataRequest API available in Kafka. Is there any

Re: Offset not committed

2015-07-15 Thread Jiangjie Qin
=trace/ in my conf file I could not squeeze out any logging from kafka. Logging for all other libs (like zookeeper e.g.) work perfectly. Am I doing something wrong? On Tue, Jul 14, 2015 at 6:55 PM, Jiangjie Qin j...@linkedin.com.invalid wrote: Hi Vadim, Can you turn on trace level logging

Re: Offset not committed

2015-07-15 Thread Jiangjie Qin
commits are coming through during the period you see no moving offsets). - The alternative is to just consume the offsets topic. On Wed, Jul 15, 2015 at 05:30:17PM +, Jiangjie Qin wrote: I am not sure how your project was setup. But I think it depends on what log4j property file you

Re: Offset not committed

2015-07-15 Thread Jiangjie Qin
forever On Wed, Jul 15, 2015 at 3:43 PM, Jiangjie Qin j...@linkedin.com.invalid wrote: Is there anything on the broker log? Is it possible that your client and broker are not running on the same version? Jiangjie (Becket) Qin On 7/15/15, 11:40 AM, Vadim Bobrov vadimbob...@gmail.com wrote

Re: Offset not committed

2015-07-15 Thread Jiangjie Qin
If that is the case, I guess that might still be some value to try to run broker and clients locally and see if the issue still exist. Thanks, Jiangjie (Becket) Qin On 7/15/15, 1:23 PM, Vadim Bobrov vadimbob...@gmail.com wrote: it is pretty random On Wed, Jul 15, 2015 at 4:22 PM, Jiangjie Qin

Re: Custom topic metadata

2015-07-12 Thread Jiangjie Qin
Hi Stevo, Kafka does not support customized topic metadata. What we are doing now is having topic metadata store in a separate zookeeper structure. But I agree this is probably a good candidate feature to consider. Thanks, Jiangjie (Becket) Qin On 7/12/15, 4:16 PM, Stevo Slavić

Re: New consumer API used in mirror maker

2015-07-12 Thread Jiangjie Qin
Yes, we are going to use new consumer after it is ready. Jiangjie (Becket) Qin On 7/12/15, 8:21 PM, tao xiao xiaotao...@gmail.com wrote: Hi team, The trunk code of mirror maker now uses the old consumer API, Is there any plan to use new Java consumer api in mirror maker?

Re: Got conflicted ephemeral node exception for several hours

2015-07-12 Thread Jiangjie Qin
Hi Tao, We see this error from time to time but did not think of this as a big issue. Any reason it bothers you much? I¹m not sure if throwing exception to user on this exception is a good handling or not. What are user supposed to do in that case other than retry? Thanks, Jiangjie (Becket) Qin

Re: How to monitor Kafka offset

2015-07-09 Thread Jiangjie Qin
You can take a look at Burrow. We use it in LinkedIn. Thanks, Jiangjie (Becket) Qin On 7/9/15, 8:53 PM, Anandh Kumar anandhz...@gmail.com wrote: Hi We have any monitor tool which monitor kafka offset? I need some opensource admin tool for kafka. Please guide me. Regards, -Anandh Kumar

Re: Kafka settings for (more) reliable/durable messaging

2015-07-07 Thread Jiangjie Qin
The replica lag definition now is time based, so as long as a replica can catch up with leader in replica.lag.time.max.ms, it is in ISR, no matter how many messages it is behind. And yes, your understanding is correct - ACK is sent back either when all replica in ISR got the message or the

Re: How to achieve Failover or HighAvailable in SimpleConsumer?

2015-07-06 Thread Jiangjie Qin
I guess it is called SimpleConsumer for a reason. Simple consumer is really simple and does not support any failure recovery. You might need to implement you own logic, it is probably not trivial though. As a reference, high level consumer uses Zookeeper ephemeral path to monitor the liveliness of

Re: Origin of product name Kafka

2015-07-06 Thread Jiangjie Qin
:), Maybe we should put this in to Kafka FAQ? On 7/6/15, 7:25 PM, Gwen Shapira gshap...@cloudera.com wrote: Nice :) I always thought its a reference to the Kafkaesque process of building data pipelines in a large organization :) On Mon, Jul 6, 2015 at 6:52 PM, luo.fucong bayinam...@gmail.com

Re: Questions re: auto-commit and camel-kafka

2015-07-04 Thread Jiangjie Qin
From: Michael J. Kitchin mcoyote...@gmail.commailto:mcoyote...@gmail.com Reply-To: mcoy...@mcoyote.commailto:mcoy...@mcoyote.com mcoy...@mcoyote.commailto:mcoy...@mcoyote.com Date: Friday, July 3, 2015 at 3:01 PM To: Jiangjie Qin j...@linkedin.commailto:j...@linkedin.com Cc: users

Re: Questions re: auto-commit and camel-kafka

2015-07-03 Thread Jiangjie Qin
Hi Michael, For the consumer side question. Yes, turning off auto offset commit is what you want. But you should also commit offset manually after you have written the processed data to somewhere else. Also, the offset is only associated with a particular consumer group. So if you restart your

Re: EOL JDK 1.6 for Kafka

2015-07-01 Thread Jiangjie Qin
+1 On 7/1/15, 1:00 PM, Gwen Shapira gshap...@cloudera.com wrote: Huge +1. I don't think there is any other project that still supports 1.6. On Wed, Jul 1, 2015 at 8:05 AM, Harsha ka...@harsha.io wrote: Hi, During our SSL Patch KAFKA-1690. Some of the reviewers/users asked

Re: querying messages based on timestamps

2015-06-30 Thread Jiangjie Qin
Yes, we have plan to add better support for this. Thanks, Jiangjie (Becket) Qin On 6/30/15, 4:23 PM, Zaiming Shi zmst...@gmail.com wrote: Hi Jiangjie ! Does the word 'Currently' imply any plan in introducing timestamp in log entries? Regards /Zaiming On Jun 30, 2015 11:08 PM, Jiangjie Qin j

Re: querying messages based on timestamps

2015-06-30 Thread Jiangjie Qin
Currently Kafka only have a very coarse solution to find offset by time, which is based on the segment last modified time. This interface is only available in simple consumer. You may issue an OffsetRequest to specify a timestamp. The offset returned will be the first offset of segments whose last

Re: At-least-once guarantees with high-level consumer

2015-06-22 Thread Jiangjie Qin
, when autocommit is enabled. So if current users of the high level consumer switch to the new consumer, they might get more duplicates on rebalance, right? Regards Carl On Sun, Jun 21, 2015 at 1:43 AM, Jiangjie Qin j...@linkedin.com.invalid wrote: Hi Carl, Generally, you approach works

Re: OutOfMemoryError in mirror maker

2015-06-21 Thread Jiangjie Qin
. On Mon, Jun 22, 2015 at 5:26 AM Jiangjie Qin j...@linkedin.com.invalid wrote: Yes, we can expose a user callback in MM, just like we did for rebalance listener. I still think ErrorLoggingCallback needs some change, though. Can we only store the value bytes when logAsString is set to true

Re: OutOfMemoryError in mirror maker

2015-06-21 Thread Jiangjie Qin
://issues.apache.org/jira/browse/KAFKA-2281 On Mon, 15 Jun 2015 at 02:16 Jiangjie Qin j...@linkedin.com.invalid wrote: Hi Tao, Yes, the issue that ErrorLoggingCallback keeps value as local variable is known for a while and we probably should fix it as the value is not used

Re: Manual Offset Commits with High Level Consumer skipping messages

2015-06-21 Thread Jiangjie Qin
: loop on consume - process - commit offset every N messages. So we can make sure there is no weird race condition. Thanks, Jiangjie (Becket) Qin On 6/21/15, 6:23 AM, noah iamn...@gmail.com wrote: On Sun, Jun 21, 2015 at 1:10 AM Jiangjie Qin j...@linkedin.com.invalid wrote: Hey Noah, Carl

Re: Broker Fails to restart

2015-06-20 Thread Jiangjie Qin
It seems that your log.index.size.max.bytes was 1K and probably was too small. This will cause your index file to reach its upper limit before fully index the log segment. Jiangjie (Becket) Qin On 6/18/15, 4:52 PM, Zakee kzak...@netzero.net wrote: Any ideas on why one of the brokers which was

Re: At-least-once guarantees with high-level consumer

2015-06-20 Thread Jiangjie Qin
Hi Carl, Generally, you approach works to guarantee at least once consumption - basically people have to commit offset only after they have processed the message. The only problem is that in old high level consumer, during consumer rebalance consumer will (and should) commit offsets. To

Re: Kafka 0.8.3 - New Consumer - user implemented partition.assignment.strategies?

2015-06-18 Thread Jiangjie Qin
Hi Olof, I am just wondering what is the benefit of rebalancing with minimal number of reassignments here? I am asking this because in new consumer, the rebalance actually is quite cheap on the consumer side - just updating a topic partition set. That means the actually rebalance time on

Re: OutOfMemoryError in mirror maker

2015-06-14 Thread Jiangjie Qin
Hi Tao, Yes, the issue that ErrorLoggingCallback keeps value as local variable is known for a while and we probably should fix it as the value is not used except logging the its size. Can you open a ticket and maybe also submit a patch? For unreachable objects I don¹t think it is memory leak. As

Re: Mirror maker doesn't rebalance after getting ZkNoNodeException

2015-06-09 Thread Jiangjie Qin
Which version of MM are you running? On 6/9/15, 4:49 AM, tao xiao xiaotao...@gmail.com wrote: Hi, I have two mirror makers A and B both subscripting to the same whitelist. During topic rebalancing one of the mirror maker A encountered ZkNoNodeException and then stopped all connections. but

Re: New producer very slow to call Callback on error

2015-06-08 Thread Jiangjie Qin
KIP-19 should address this issue. https://cwiki.apache.org/confluence/display/KAFKA/KIP-19+-+Add+a+request+ti meout+to+NetworkClient Thanks, Jiangjie (Becket) Qin On 6/8/15, 10:44 AM, Christofer Hedbrandh christo...@knewton.com wrote: I think this question might relate to the very recently

Re: callback handler is not getting called if cluster is down

2015-06-08 Thread Jiangjie Qin
while sending messages ??* On Mon, Jun 8, 2015 at 10:46 PM, Jiangjie Qin j...@linkedin.com.invalid wrote: What replication factor are you using? Currently if a partition is offline, the message in producer will not be sent but sit in accumulator until the partition comes back online. Do you mean

Re: High CPU usage for idle kafka server

2015-06-08 Thread Jiangjie Qin
@kafka.apache.orgmailto:users@kafka.apache.org Subject: Re: High CPU usage for idle kafka server Seems to be LogCleaner and FetchRequest threads. Attached is the profiling screenshot On Fri, Jun 5, 2015 at 3:06 PM, Jiangjie Qin j...@linkedin.com.invalidmailto:j...@linkedin.com.invalid mailto:j

Re: simple consumer giving error always

2015-06-08 Thread Jiangjie Qin
That is offset out of range error. So you are fetching from an offset either greater than latest offset or smaller than earliest offset available on broker. Jiangjie (Becket) Qin On 6/8/15, 2:21 AM, sunil kalva sambarc...@gmail.com wrote: Hi For few topics i always get FetchResponse.error code

Re: High CPU usage for idle kafka server

2015-06-05 Thread Jiangjie Qin
Has this to do with KAFKA-1461? Can you see which thread is taking a lot of cpu? Some jconsole plugin can get that information. Jiangjie (Becket) Qin On 6/5/15, 2:57 PM, pundlik.anuja pundlik.an...@gmail.com wrote: Hi Jay, Good to hear from you. I met you at the kafka meetup at linkedin. -

Re: How to prevent custom Partitioner from increasing the number of producer's requests?

2015-06-04 Thread Jiangjie Qin
= new ProducerConfig(properties) val producer : Producer[Integer,String] = new Producer[Integer,String](prodConfig) How can I know which producer I am using? And what is the behavior of the new producer? Thanks, Sébastien 2015-06-03 20:04 GMT+02:00 Jiangjie Qin j...@linkedin.com.invalid

Re: How to prevent custom Partitioner from increasing the number of producer's requests?

2015-06-03 Thread Jiangjie Qin
Are you using new producer or old producer? The old producer has 10 min sticky partition behavior while the new producer does not. Thanks, Jiangjie (Becket) Qin On 6/2/15, 11:58 PM, Sebastien Falquier sebastien.falqu...@teads.tv wrote: Hi Jason, The default partitioner does not make the job

Re: Kafka Not Commiting Messages

2015-05-28 Thread Jiangjie Qin
@gmail.com Date: Thursday, May 28, 2015 at 1:09 PM To: Jiangjie Qin j...@linkedin.commailto:j...@linkedin.com Cc: users@kafka.apache.orgmailto:users@kafka.apache.org users@kafka.apache.orgmailto:users@kafka.apache.org Subject: Re: Kafka Not Commiting Messages Hi Jiangjie, Thanks for you message

Re: Java - High Level Consumer

2015-05-28 Thread Jiangjie Qin
Auto.offset.reset only comes into place when 1. the consumer fetches message from offset out of range - available offsets on broker side 2. The consumer group has no offset committed. So in your case, I guess high level consumer has already committed its offset before and that offset is not out

Re: KafkaException: Size of FileMessageSet has been truncated during write

2015-05-27 Thread Jiangjie Qin
This should be just a message fetch failure. The socket was disconnected when broker was writing to it. There should not be data loss. Jiangjie (Becket) Qin On 5/27/15, 11:00 AM, Andrey Yegorov andrey.yego...@gmail.com wrote: I've noticed a few exceptions in the logs like the one below, does it

Re: Compatibility of 0.8.2 client API (new Producer API) and 0.8.1 Kafka server

2015-05-27 Thread Jiangjie Qin
It should work, but usually we prefer the server version to be not lower than client version. On 5/27/15, 3:12 PM, Zhuo Liu zzl0...@tigermail.auburn.edu wrote: Dear all, In 0.8.2.1 Kafka, there is new Producer API (KafkaProducer etc.). My question is: will 0.8.2.1 new Producer API be able to

Re: Kafka Not Commiting Messages

2015-05-27 Thread Jiangjie Qin
Can you turn on TRACE level logging for kafka-request.log and see if broker received the producer request or not? You can go to KAKFA_FOLDER/config/log4j.properties and change log4j.logger.kafka.network.RequestChannels to TRACE. Jiangjie (Becket) Qin On 5/27/15, 12:12 PM, Charlie Mason

Re: Delete topic pending

2015-05-18 Thread Jiangjie Qin
Hmm, which Kafka version are you running? From the code it looks we should have already ignored the deletion of a topic if it does not exist. Jiangjie (Becket) Qin On 5/18/15, 10:15 AM, Dillian Murphey crackshotm...@gmail.com wrote: If a broker doesn't have the topic, and I run delete topic,

Re: Consumers are not load balanced against partitions

2015-05-14 Thread Jiangjie Qin
The default range partition assignment algorithm will assign partition on per topic basis. If you have more consumer threads than number of partitions in a topic, some threads will not be assigned any partition. If you are consuming from multiple topics, You might want to set the

Re: New Producer Async - Metadata Fetch Timeout

2015-05-13 Thread Jiangjie Qin
brokers are actually down ). Since, no metadata is available the send will block. Right? On Wed, May 13, 2015 at 10:51 AM, Jiangjie Qin j...@linkedin.com.invalid wrote: Application will not block on each metadata refresh or metadata is expired. Application will only be blocked

Re: Compression and batching

2015-05-13 Thread Jiangjie Qin
If you are sending in sync mode, producer will just group by partition the list of messages you provided as argument of send() and send them out. You don¹t need to worry about batch.num.messages. There is a potential that compressed message is even bigger than uncompressed message, though. I¹m

Re: Experiences testing new producer performance across multiple threads/producer counts

2015-05-13 Thread Jiangjie Qin
Thanks for sharing this, Garry. I actually did similar tests before but unfortunately lost the test data because my laptop rebooted and I forgot to save the dataŠ Anyway, several things to verify: 1. Remember KafkaProducer holds lock per partition. So if you have only one partition in the target

Re: Auto-rebalance not triggering in 2.10-0.8.1.1

2015-05-13 Thread Jiangjie Qin
Automatic preferred leader election hasn¹t been turned on in 0.8.1.1. It¹s been turned on in latest trunk though. The config name is ³auto.leader.rebalance.enable. Jiangjie (Becket) Qin On 5/13/15, 10:50 AM, Stephen Armstrong stephen.armstr...@linqia.com wrote: Does anyone have any insight

Re: Getting NotLeaderForPartitionException in kafka broker

2015-05-13 Thread Jiangjie Qin
Does this topic exist in Zookeeper? On 5/12/15, 11:35 PM, tao xiao xiaotao...@gmail.com wrote: Hi, Any updates on this issue? I keep seeing this issue happening over and over again On Thu, May 7, 2015 at 7:28 PM, tao xiao xiaotao...@gmail.com wrote: Hi team, I have a 12 nodes cluster that

Re: Compression and batching

2015-05-13 Thread Jiangjie Qin
Yes, in old producer we don¹t control the compressed message size. In new producer, we estimate the compressed size heuristically and decide whether to close the batch or not. It is not perfect but at least better than the old one. Jiangjie (Becket) Qin On 5/13/15, 4:00 PM, Jamie X

Re: New Producer Async - Metadata Fetch Timeout

2015-05-12 Thread Jiangjie Qin
That¹s right. Send() will first try to get metadata of a topic, that is a blocking operation. On 5/12/15, 2:48 AM, Rendy Bambang Junior rendy.b.jun...@gmail.com wrote: Hi, sorry if my understanding is incorrect. I am integrating kafka producer with application, when i try to shutdown all kafka

Re: New Producer Async - Metadata Fetch Timeout

2015-05-12 Thread Jiangjie Qin
to a particular broker in its current metadata (This might happen in some cases like if the leader moves). It cannot produce without having the right metadata. Thanks, Mayuresh On Tue, May 12, 2015 at 10:09 AM, Jiangjie Qin j...@linkedin.com.invalid wrote: That¹s right. Send

Re: Compression and batching

2015-05-12 Thread Jiangjie Qin
Mayuresh, this is about the old producer instead of the new Java producer. Jamie, In the old producer, if you use sync mode, the list of message will be sent as a batch. On the other hand, if you are using async mode, the messages are just put into the queue and batched with other messages.

Re: New Java producer broker metadata update stuck

2015-05-08 Thread Jiangjie Qin
Dan, Just to make sure I understand it correctly. What do you mean by different ip - broker mapping? Do you mean you changed your broker ip? We have different mechanism in and producer to get the cluster information. Consumer get all the information from Zookeeper while producer has to talk to

Re: Kafka behind AWS ELB

2015-05-04 Thread Jiangjie Qin
Hi Dillian, Yeah, ELB + ASG will is pretty popular however might need further tricks to use them for Kafka brokers. As I replied to Chandrashekhar in another email. You can use ELB as bootstrap.servers/metadata.broker.list to serve as client bootstrap use case. But all the producing/consuming

Re: Kafka brokers behind AWS ELB

2015-05-03 Thread Jiangjie Qin
Yes. On 5/3/15, 10:57 AM, Chandrashekhar Kotekar shekhar.kote...@gmail.com wrote: Thanks a lot for the information. So basically each broker needs to have his own public IP address? Regards, Chandrash3khar Kotekar Mobile - +91 8600011455 On Sun, May 3, 2015 at 11:09 PM, Jiangjie Qin j

Re: Data replication and zero data loss

2015-04-30 Thread Jiangjie Qin
Which mirror maker version did you look at? The MirrorMaker in trunk should not have data loss if you just use the default setting. On 4/30/15, 7:53 PM, Joong Lee jo...@me.com wrote: Hi, We are exploring Kafka to keep two data centers (primary and DR) running hosts of elastic search nodes in

Re: New Producer API - batched sync mode support

2015-04-30 Thread Jiangjie Qin
Roshan, If I understand correctly, you just want to make sure a number of messages has been sent successfully. Using callback might be easier to do so. Public class MyCallback implements Callback { public SetRecordMetadata failedSend; @Override Public void

Re: Kafka offset using kafka topic - not consuming messages

2015-04-29 Thread Jiangjie Qin
OK, so you turned off auto.offset.commit, and set the auto.offset.reset to largest. That means when you consume, 1. If you did not commit offsets manually, no offsets will be committed to Kafka. 2. If you do not have an offset stored in Kafka, you will start from the log end and ignore the

Re: Kafka commit offset

2015-04-28 Thread Jiangjie Qin
Yes, if you set the offset storage to Kafka, high level consumer will be using Kafka for all offset related operations. Jiangjie (Becket) Qin On 4/27/15, 7:03 PM, Gomathivinayagam Muthuvinayagam sankarm...@gmail.com wrote: I am trying to commit offset request in a background thread. I am able

Re: Getting java.lang.IllegalMonitorStateException in mirror maker when building fetch request

2015-04-27 Thread Jiangjie Qin
Hi Tao, KAFKA-2150 has been filed. Jiangjie On 4/24/15, 12:38 PM, tao xiao xiaotao...@gmail.com wrote: Hi team, I observed java.lang.IllegalMonitorStateException thrown from AbstractFetcherThread in mirror maker when it is trying to build the fetchrequst. Below is the error [2015-04-23

Re: New Java Producer: Single Producer vs multiple Producers

2015-04-27 Thread Jiangjie Qin
Hi Jay, Does o.a.k.clients.tools.ProducerPerformance provide multi-thread test? I did not find it. I tweaked the test a little bit to make it multi-threaded and what I found is that in a single thread case, with each message of 10 bytes, single caller thread has ~2M messages/second throughput.

Re: partition reassignment stuck

2015-04-21 Thread Jiangjie Qin
This means that the broker 0 thought broker 28 was leader for that partition but broker 28 has actually already received StopReplicaRequest from controller and stopped serving as a replica for that partition. This might happen transiently but broker 0 will be able to find the new leader for the

Re: partition reassignment stuck

2015-04-21 Thread Jiangjie Qin
this morning. Do the other brokers have to be bounced after the controller situation is fixed? I did not do that for all brokers. Wes On Apr 21, 2015 8:25 PM, Jiangjie Qin j...@linkedin.com.invalid wrote: Yes, should be broker 25 thread 0 from the log. This needs to be resolved, you might need

Re: Kafka Zookeeper queries

2015-04-21 Thread Jiangjie Qin
brings down write load on ZKs. Regards, Pushkar On Tue, Apr 21, 2015 at 1:13 PM, Jiangjie Qin j...@linkedin.com.invalid wrote: 2 partitions should be OK. On 4/21/15, 12:33 AM, Achanta Vamsi Subhash achanta.va...@flipkart.com wrote: We are planning to have ~2 partitions

Re: partition reassignment stuck

2015-04-21 Thread Jiangjie Qin
Yes, should be broker 25 thread 0 from the log. This needs to be resolved, you might need to bounce both of the brokers who think itself as controller respectively. The new controller should be able to continue the partition reassignment. From: Wes Chow

Re: Kafka Zookeeper queries

2015-04-20 Thread Jiangjie Qin
Producers usually do not query zookeeper at all. Consumers usually query zookeeper at beginning or rebalance. It is supposed to be in frequent if you don¹t have consumers come and go all the time. One exception is that if you are using zookeeper based consumer offset commit, it will commit offset

Re: ReplicaFetcherThread Error, Massive Logging, and Leader Flapping

2015-04-16 Thread Jiangjie Qin
It seems there are many different symptoms you see... Maybe we can start from leader flapping issue. Any findings in controller log? Jiangjie (Becket) Qin On 4/16/15, 12:09 PM, Kyle Banker kyleban...@gmail.com wrote: Hi, I've run into a pretty serious production issue with Kafka 0.8.2, and

Re: Fetch Request Purgatory and Mirrormaker

2015-04-14 Thread Jiangjie Qin
Hey Evan, Is this issue only observed when mirror maker is consuming? It looks that for Cluster A you have some other consumers. Do you mean if you stop mirror maker the problem goes away? Jiangjie (Becket) Qin On 4/14/15, 6:55 AM, Evan Huus evan.h...@shopify.com wrote: Any ideas on this? It's

Re: Consumer offsets in offsets topic 0.8.2

2015-04-13 Thread Jiangjie Qin
/config/topics [my-control] ls /consumers [] quit Thanks. Mayank. On Fri, Mar 20, 2015 at 9:54 AM, Jiangjie Qin j...@linkedin.com.invalid wrote: Hi Vamsi, The ConsumerOffsetChecker.scala or kafka-consumer-offset-checker.sh still works. You can use them to check the offsets. If you need

Re: question about Kafka

2015-04-07 Thread Jiangjie Qin
producer? Thanks Liang -Original Message- From: Jiangjie Qin [mailto:j...@linkedin.com.INVALID] Sent: Monday, April 06, 2015 11:46 AM To: users@kafka.apache.org Subject: Re: question about Kafka Hey Liang, Have you looked at the quick start here: https://kafka.apache.org

Re: question about Kafka

2015-04-06 Thread Jiangjie Qin
Hey Liang, Have you looked at the quick start here: https://kafka.apache.org/documentation.html#quickstart In Kafka, on the producer side, there is no concept of ³commit². If you are producing using KafkaProducer, you can do a send.get(), this is a synchronized send so if no exception was

Re: question about Kafka

2015-04-06 Thread Jiangjie Qin
Also if you are using Kafka from the latest trunk, KafkaProducer has a flush() interface that you may call. This will ensure all the message previously sent from send() methods are sent to Kafka server. On 4/3/15, 3:38 PM, Sun, Joey joey@emc.com wrote: Hello, group I am a newbie to Kafka. I

Re: Problem with node after restart no partitions?

2015-04-03 Thread Jiangjie Qin
This sounds a very serious issueŠ Could you provide the controller log and the log for the first broker on which you tried controlled shutdown and upgrade? On 4/3/15, 8:57 AM, Jason Rosenberg j...@squareup.com wrote: I'm preparing a longer post here, but we recently ran into a similar scenario.

Re: delete.retention.ms in 0.8.1

2015-04-03 Thread Jiangjie Qin
Another thing is that the active log segment would not be deleted, so if there is only one log segment, it will not honor the retention policy. You may config log.roll.ms to make sure you have a rolled over log segment. On 4/3/15, 5:20 AM, Madhukar Bharti bhartimadhu...@gmail.com wrote: Hi

Re: Which version works for kafka 0.8.2 as consumer?

2015-04-01 Thread Jiangjie Qin
Yes, KafkaConsumer in 0.8.2 is still in development. You probably still want to use ZookeeperConsumerConnector for now. On 4/1/15, 9:28 AM, Mark Zang deepnight...@gmail.com wrote: I found the 0.8.2.0 and 0.8.2.1 has a KafkaConsumer. But this class seems not completed and not functional. Lots of

Re: Consumer in Java client

2015-03-31 Thread Jiangjie Qin
Hi, KafkaConsumer is still under development and not ready for wide use case. Currently, it can be used to replace SimpleConsumer (low level consumer), but can not replace ZookeeperConsumerConnector(high level consumer). So if you need to use simple consumer, I would suggest using KafkaConsumer

Re: Async producer using Sync producer for send

2015-03-31 Thread Jiangjie Qin
The async send() put the message into a message queue then returns. When the messages are pulled out of the queue by the sender thread, it still uses SyncProducer to send ProducerRequests to brokers. Jiangjie (Becket) Qin On 3/30/15, 10:44 PM, Madhukar Bharti bhartimadhu...@gmail.com wrote: Hi

Re: Can Mirroring Preserve Every Topic's Partition?

2015-03-31 Thread Jiangjie Qin
The Mirror maker in trunk now supports mirroring with preserved partition. You can wire in a message handler to assign partitions for each producer record before handing them to producer. Jiangjie (Becket) Qin On 3/31/15, 3:41 AM, Ivan Balashov ibalas...@gmail.com wrote: Alex, Just wondering,

Re: Kafka server relocation

2015-03-25 Thread Jiangjie Qin
If you want to do a seamless migration. I think a better way is to build a cross datacenter Kafka cluster temporarily. So the process is: 1. Add several new Kafka brokers in your new datacenter and add them to the old cluster. 2. Use replica assignment tool to reassign all the partitions to

Re: Is replicating partitions cover disaster recovery

2015-03-24 Thread Jiangjie Qin
Yes, Kafka use replica to tolerate node failures. Depending on which level of durability and availability guarantee you need, you might need different settings on the broker and producer. Kafka cluster will automatically take care of node failure in a cluster for you. Jiangjie (Becket) Qin On

Re: Mirror maker fetcher thread unexpectedly stopped

2015-03-24 Thread Jiangjie Qin
Hi Xiao, I think the fix for IllegalStateExcepetion is correct. Can you also create a ticket and submit a patch? Thanks. Jiangjie (Becket) Qin On 3/24/15, 4:31 PM, tao xiao xiaotao...@gmail.com wrote: Hi community, I wanted to know if the solution I supplied can fix the

Re: Mirror maker fetcher thread unexpectedly stopped

2015-03-24 Thread Jiangjie Qin
fetcher thread is down. On Wed, Mar 25, 2015 at 8:00 AM, tao xiao xiaotao...@gmail.com wrote: Thanks JIanjie. Can I reuse KAFKA-1997 or should I create a new ticket? On Wed, Mar 25, 2015 at 7:58 AM, Jiangjie Qin j...@linkedin.com.invalid wrote: Hi Xiao, I think the fix

Re: 'roundrobin' partition assignment strategy restrictions

2015-03-20 Thread Jiangjie Qin
with a heterogeneous set of selected topics between consumers. Jason On Thu, Mar 19, 2015 at 8:07 PM, Mayuresh Gharat gharatmayures...@gmail.com wrote: Hi Becket, Can you list down an example for this. It would be easier to understand :) Thanks, Mayuresh On Thu, Mar 19, 2015 at 4:46 PM, Jiangjie

Re: Consumer offsets in offsets topic 0.8.2

2015-03-20 Thread Jiangjie Qin
Hi Vamsi, The ConsumerOffsetChecker.scala or kafka-consumer-offset-checker.sh still works. You can use them to check the offsets. If you need to check the offsets programmatically, you can send OffsetsFetcheRequest to broker using simple consumer. You may refer the ConsumerOffsetChecker.scala to

Re: New Java Producer Client handling case where Kafka is unreachable

2015-03-20 Thread Jiangjie Qin
This is correct when you send to a topic for the first time. After that the metadata will be cached, the metadata cache has an age and after it expires, metadata will be refreshed. So the time a producer found a broker is not reachable is the minimum value of the following times: 1. Linger.ms +

Re: 'roundrobin' partition assignment strategy restrictions

2015-03-19 Thread Jiangjie Qin
Hi Jason, The round-robin strategy first takes the partitions of all the topics a consumer is consuming from, then distributed them across all the consumers. If different consumers are consuming from different topics, the assigning algorithm will generate different answers on different consumers.

Re: Random failure testing

2015-03-16 Thread Jiangjie Qin
We are planning to develop a Chaos Monkey test in LinkedIn and will open source it. You can check out KAFKA-2014. Jiangjie (Becket) Qin On 3/16/15, 1:24 PM, John Lonergan john.loner...@gmail.com wrote: Re kafka-1539 Is the community executing random failure testing for Kafka? It would seem

Re: How to shutdown mirror maker safely

2015-03-13 Thread Jiangjie Qin
ctrl+c should work. Did you see any issue for that? On 3/12/15, 11:49 PM, tao xiao xiaotao...@gmail.com wrote: Hi, I wanted to know that how I can shutdown mirror maker safely (ctrl+c) when there is no message coming to consume. I am using mirror maker off trunk code. -- Regards, Tao

Re: Broker Restart failed w/ Corrupt index found

2015-03-13 Thread Jiangjie Qin
Can you reproduce this problem? Although the the fix is strait forward we would like to understand why this happened. On 3/13/15, 3:56 PM, Zakee kzak...@netzero.net wrote: Just found there is a known issue to be resolved in future kafka version: https://issues.apache.org/jira/browse/KAFKA-1554

Re: Does consumer support combination of whitelist and blacklist topic filtering

2015-03-12 Thread Jiangjie Qin
No sure if it is an option. But does filtering out topics with message handler works for you? Are you going to resume consuming from a topic after you stop consuming from it? Jiangjie (Becket) Qin On 3/12/15, 8:05 AM, tao xiao xiaotao...@gmail.com wrote: Yes, you are right. a dynamic

Re: How replicas catch up the leader

2015-03-10 Thread Jiangjie Qin
xiao and Jiangjie Qin, Thank you very much I try to run kafka-reassign-partitions.sh, but the issue still exists… this the log info: [2015-03-11 11:00:40,086] ERROR Conditional update of path /brokers/topics/ad_click_sts/partitions/4/state with data {controller_epoch:23,leader:1,version:1

  1   2   >