consumer offset not saved in zk

2013-05-22 Thread rk vishu
Hello All, I recently started experimenting Kafka for my usecase. I am running 0.8 in two node kafka setup. I produced 20messages using a java program(1 partition with 2 replicas) and I am running the consumer code as given in the example https://cwiki.apache.org/KAFKA/consumer-group-example.html

Re: large amount of disk space freed on restart

2013-05-22 Thread Jason Rosenberg
No, nothing outside of kafka would look at those files I'm wondering if it's an os level thing too On Wed, May 22, 2013 at 10:25 PM, Jonathan Creasy wrote: > Well, it sounds like files were deleted while Kafka still had them open. > Or something else opened them while Kafka deleted the

Re: large amount of disk space freed on restart

2013-05-22 Thread Jonathan Creasy
Well, it sounds like files were deleted while Kafka still had them open. Or something else opened them while Kafka deleted them. I haven't noticed this on our systems but we haven't looked for it either. Is anything outside of Kafka deleting or reading those files? On May 23, 2013 1:17 AM, "Jason

Re: large amount of disk space freed on restart

2013-05-22 Thread Jason Rosenberg
So, does this indicate kafka (or the jvm itself) is not aggressively closing file handles of deleted files? Is there a fix for this? Or is there not likely anything to be done? What happens if the disk fills up with file handles for phantom deleted files? Jason On Wed, May 22, 2013 at 9:50 PM

Re: large amount of disk space freed on restart

2013-05-22 Thread Jonathan Creasy
It isn't uncommon if a process has an open file handle on a file that is deleted, the space is not freed until the handle is closed. So restarting the process that has a handle on the file would cause the space to be freed also. You can troubleshoot that with lsof. Normally, I see 2-4 log segments

large amount of disk space freed on restart

2013-05-22 Thread Jason Rosenberg
Normally, I see 2-4 log segments deleted every hour in my brokers. I see log lines like this: 2013-05-23 04:40:06,857 INFO [kafka-logcleaner-0] log.LogManager - Deleting log segment 035434043157.kafka from However, it seems like if I restart the broker, a massive amount of disk space i

Re: Apache Kafka in AWS

2013-05-22 Thread Jun Rao
Jason, Thanks for sharing. This is very interesting. Normally, Kafka brokers don't use too much CPU. Are most of the 750% CPU actually used by Kafka brokers? Jun On Wed, May 22, 2013 at 6:11 PM, Jason Weiss wrote: > >>Did you check that you were using all cores? > > top was reporting over 750

RE: Apache Kafka in AWS

2013-05-22 Thread Jason Weiss
>>Did you check that you were using all cores? top was reporting over 750% Jason From: Ken Krugler [kkrugler_li...@transpac.com] Sent: Wednesday, May 22, 2013 20:59 To: users@kafka.apache.org Subject: Re: Apache Kafka in AWS Hi Jason, On May 22, 2013, a

Re: Apache Kafka in AWS

2013-05-22 Thread Ken Krugler
Hi Jason, On May 22, 2013, at 3:35pm, Jason Weiss wrote: > Ken, > > Great question! I should have indicated I was using EBS, 500GB with 2000 > provisioned IOPs. OK, thanks. Sounds like you were pegged on CPU usage. But that does surprise me a bit. Did you check that you were using all cores?

Re: Apache Kafka in AWS

2013-05-22 Thread Scott Clasen
Thanks. FWIW this one has been fine so far java version "1.7.0_13" OpenJDK Runtime Environment (IcedTea7 2.3.6) (Ubuntu build 1.7.0_13-b20) OpenJDK 64-Bit Server VM (build 23.7-b01, mixed mode) though not running at the load in your tests. On Wed, May 22, 2013 at 4:51 PM, Jason Weiss wrote:

RE: Apache Kafka in AWS

2013-05-22 Thread Jason Weiss
[ec2-user@ip-10-194-5-76 ~]$ java -version java version "1.6.0_24" OpenJDK Runtime Environment (IcedTea6 1.11.11) (amazon-61.1.11.11.53.amzn1-x86_64) OpenJDK 64-Bit Server VM (build 20.0-b12, mixed mode) Yes, as soon as I put it under heavy load, it would buckle almost consistently. I knew it w

Re: message ordering guarantees

2013-05-22 Thread Ross Black
Thanks for the explanation. Ross On 23 May 2013 07:19, Neha Narkhede wrote: > Thanks, > Neha > On May 21, 2013 5:42 PM, "Ross Black" wrote: > > > Hi, > > > > I am using Kafka 0.7.1, and using SyncProducer and SimpleConsumer with a > > single broker service process. > > > > I am occasionally

Re: Partitioning and scale

2013-05-22 Thread Neha Narkhede
Not automatically as of today. You have to run the reassign-partitions tool and explicitly move selected partitions to the new brokers. If you use this tool, you can move partitions to the new broker without any downtime. Thanks, Neha On Wed, May 22, 2013 at 2:20 PM, Timothy Chen wrote: > Hi N

Re: Apache Kafka in AWS

2013-05-22 Thread Scott Clasen
Hey Jason, question what openjdk version did you have issues with? Im running kafka on it now and has been ok. Was it a crash only at load? Thanks SC On Wed, May 22, 2013 at 1:42 PM, Jason Weiss wrote: > All, > > I asked a number of questions of the group over the last week, and I'm > happy

Re: Offset in high level consumer

2013-05-22 Thread Neha Narkhede
You can run the ConsumerOffsetChecker tool that ships with Kafka. Thanks, Neha On Wed, May 22, 2013 at 2:02 PM, arathi maddula wrote: > Hi, > > Could you tell me how to find the offset in a high level Java consumer ? > > Thanks > Arathi >

RE: Apache Kafka in AWS

2013-05-22 Thread Jason Weiss
Jonathan, Using 0.7.2, with just a single EBS volume per broker instance - negative on the RAID 10. I would speculate that if we used RAID 10 and we went with AWS's maximum provisioned IOPS (5000??) we probably could have squeaked out some more eps. I have no doubt, BTW, that if we would have

Re: Apache Kafka in AWS

2013-05-22 Thread Jonathan Hodges
Awesome right up Jason! Very helpful as we are also looking to build a Kafka environment in AWS. I am curious, are you using Kafka 0.7.2 or 0.8 in your tests? Did you have just one EBS volume per broker instance or RAID 10 across EBS volumes per broker? Thanks again for the great info! -Jonath

RE: Apache Kafka in AWS

2013-05-22 Thread Jason Weiss
Ken, Great question! I should have indicated I was using EBS, 500GB with 2000 provisioned IOPs. Jason From: Ken Krugler [kkrugler_li...@transpac.com] Sent: Wednesday, May 22, 2013 17:23 To: users@kafka.apache.org Subject: Re: Apache Kafka in AWS Hi Jaso

Re: Apache Kafka in AWS

2013-05-22 Thread Ken Krugler
Hi Jason, Thanks for the notes. I'm curious whether you went with using local drives (ephemeral storage) or EBS, and if with EBS then what IOPS. Thanks, -- Ken On May 22, 2013, at 1:42pm, Jason Weiss wrote: > All, > > I asked a number of questions of the group over the last week, and I'm ha

Re: Partitioning and scale

2013-05-22 Thread Timothy Chen
Hi Neha/Chris, Thanks for the reply, so if I set a fixed number of partitions and just add brokers to the broker pool, does it rebalance the load to the new brokers (along with the data)? Tim On Wed, May 22, 2013 at 1:15 PM, Neha Narkhede wrote: > - I see that Kafka server.properties allows on

Re: message ordering guarantees

2013-05-22 Thread Neha Narkhede
Thanks, Neha On May 21, 2013 5:42 PM, "Ross Black" wrote: > Hi, > > I am using Kafka 0.7.1, and using SyncProducer and SimpleConsumer with a > single broker service process. > > I am occasionally seeing messages (from a *single* partition) being > processed out of order to what I expect and I am

Offset in high level consumer

2013-05-22 Thread arathi maddula
Hi, Could you tell me how to find the offset in a high level Java consumer ? Thanks Arathi

Re: Apache Kafka in AWS

2013-05-22 Thread Neha Narkhede
Thanks for sharing your experience with the community, Jason! -Neha On Wed, May 22, 2013 at 1:42 PM, Jason Weiss wrote: > All, > > I asked a number of questions of the group over the last week, and I'm > happy to report that I've had great success getting Kafka up and running in > AWS. I am us

Apache Kafka in AWS

2013-05-22 Thread Jason Weiss
All, I asked a number of questions of the group over the last week, and I'm happy to report that I've had great success getting Kafka up and running in AWS. I am using 3 EC2 instances, each of which is a M2 High-Memory Quadruple Extra Large with 8 cores and 58.4 GiB of memory according to the A

Re: Partitioning and scale

2013-05-22 Thread Neha Narkhede
- I see that Kafka server.properties allows one to specify the number of partitions it supports. However, when we want to scale I wonder if we add # of partitions or # of brokers, will the same partitioner start distributing the messages to different partitions? And if it does, how can that same c

temporary files

2013-05-22 Thread Yu, Libo
Hi, After I run "./sbt assembly-package-dependency" A bunch of temporary files are created in core/target/scala-2.8.0/cache/core/global/assembly I think they should be deleted after that. Regards, Libo

Re: Partitioning and scale

2013-05-22 Thread Chris Curtin
Hi Tim, On Wed, May 22, 2013 at 3:25 PM, Timothy Chen wrote: > Hi, > > I'm currently trying to understand how Kafka (0.8) can scale with our usage > pattern and how to setup the partitioning. > > We want to route the same messages belonging to the same id to the same > queue, so its consumer wi

Partitioning and scale

2013-05-22 Thread Timothy Chen
Hi, I'm currently trying to understand how Kafka (0.8) can scale with our usage pattern and how to setup the partitioning. We want to route the same messages belonging to the same id to the same queue, so its consumer will able to consume all the messages of that id. My questions: - From my un

Re: orders of launching kafka servers and zookeepers

2013-05-22 Thread Neha Narkhede
First launch the zookeeper cluster completely followed by the kafka cluster. Thanks, Neha On May 22, 2013 8:43 AM, "Yu, Libo" wrote: > Hi, > > I want to launch kafka on three machines. I can launch zookeepers > on the three machines first. After that, start kafka server on each > machine. Or for

orders of launching kafka servers and zookeepers

2013-05-22 Thread Yu, Libo
Hi, I want to launch kafka on three machines. I can launch zookeepers on the three machines first. After that, start kafka server on each machine. Or for each machine, I start a zookeeper followed by the kafka. I believe the first way is the right way to go. But I want to confirm it. Regards, L

Re: message ordering guarantees

2013-05-22 Thread Jun Rao
Yes, reconnect.time.interval.ms is a new config that we added to 0.7. Will update the website. VIP is a virtual IP on a load balancer. You can associate multiple brokers with a VIP and the VIP will randomly select one of the brokers to connect to during socket establish time. The issue is that if