partition reassignment

Wes Chow Fri, 03 Apr 2015 10:09:11 -0700

I'm in the process of reassigning partitions away from failing machinesand it appears to be stuck. One thought is because our machines arefailing at a very high rate and so some partitions no longer have anylive replicas at all. At this point I don't care about the data, I justwant to get all partitions onto the set of machines that I know work. Isthere some way I can do this? I am happy to manipulate ZooKeeper andbounce nodes if need be.

And a warning... this is due to Amazon EC2 d2 instance type failures. Wespun up 9 d2.xlarge instances and within a few hours 6 have failed undera Kafka workload. So yeah, bleeding edge.

One thing I've done is rebuilt one of these nodes with the same brokerid and name but under a known working instance type. It came up and nowis spewing this in the logs:

[2015-04-03 13:05:30,275] 805497 [kafka-request-handler-0] WARNkafka.server.KafkaApis - [KafkaApi-29] Produce request with correlationid 5849 from client ping_partitioner on partition [pings,245] failed dueto Topic pings either doesn't exist or is in the process of being deleted

The topic most certainly should exist, however I'm guessing it'scomplaining because there are no live replicas for that partition. Isthere some way to get it to just become leader?

Wes

partition reassignment

Reply via email to