OOM when applying migrations

Vanger Thu, 20 Sep 2012 06:12:03 -0700

Hello,

We are trying to add new nodes to our *6-node* cassandra cluster withRF=3 cassandra version 1.0.11. We are *adding 18 new nodes* one-by-one.

First strange thing, I've noticed, is the number of completedMigrationStage in nodetool tpstats grows for every new node, whileschema is not changed. For now with 21-nodes ring, for final join itshows 184683 migrations, while with 7-nodes it was about 50k migrations.In fact it seems that this number is not a number of applied migrations.When i grep log file with

grep "Applying migration" /var/log/cassandra/system.log -c

For each new node result is pretty much the same - around 7500 "Applyingmigration" found in log.

And the real problem is that now new nodes fail with Out Of Memory whilebuilding schema from migrations. In logs we can find the following:

WARN [ScheduledTasks:1] 2012-09-19 18:51:22,497 GCInspector.java (line145) Heap is 0.7712290960125684 full. You may need to reduce memtableand/or cache sizes. Cassandra will now flush up to the two largestmemtables to free up memory. Adjust flush_largest_memtables_atthreshold in cassandra.yaml if you don't want Cassandra to do thisautomaticallyINFO [ScheduledTasks:1] 2012-09-19 18:51:22,498 StorageService.java(line 2658) Unable to reduce heap usage since there are no dirty columnfamilies

....

WARN [ScheduledTasks:1] 2012-09-19 18:51:29,500 GCInspector.java (line139) Heap is 0.853078131310858 full. You may need to reduce memtableand/or cache sizes. Cassandra is now reducing cache sizes to free upmemory. Adjust reduce_cache_sizes_at threshold in cassandra.yaml if youdon't want Cassandra to do this automaticallyWARN [ScheduledTasks:1] 2012-09-19 18:51:29,500 AutoSavingCache.java(line 187) Reducing AppUser RowCache capacity from 100000 to 0 to reducememory pressureWARN [ScheduledTasks:1] 2012-09-19 18:51:29,500 AutoSavingCache.java(line 187) Reducing AppUser KeyCache capacity from 100000 to 0 to reducememory pressureWARN [ScheduledTasks:1] 2012-09-19 18:51:29,500 AutoSavingCache.java(line 187) Reducing PaymentClaim KeyCache capacity from 50000 to 0 toreduce memory pressureWARN [ScheduledTasks:1] 2012-09-19 18:51:29,500 AutoSavingCache.java(line 187) Reducing Organization RowCache capacity from 1000 to 0 toreduce memory pressure

 .....

INFO [main] 2012-09-19 18:57:14,181 StorageService.java (line 668)JOINING: waiting for schema information to completeERROR [Thread-28] 2012-09-19 18:57:14,198 AbstractCassandraDaemon.java(line 139) Fatal exception in thread Thread[Thread-28,5,main]

java.lang.OutOfMemoryError: Java heap space

atorg.apache.cassandra.net.IncomingTcpConnection.receiveMessage(IncomingTcpConnection.java:140)atorg.apache.cassandra.net.IncomingTcpConnection.run(IncomingTcpConnection.java:115)

...

ERROR [ReadStage:353] 2012-09-19 18:57:20,453AbstractCassandraDaemon.java (line 139) Fatal exception in threadThread[ReadStage:353,5,main]

java.lang.OutOfMemoryError: Java heap space

atorg.apache.cassandra.service.MigrationManager.makeColumns(MigrationManager.java:256)atorg.apache.cassandra.db.DefinitionsUpdateVerbHandler.doVerb(DefinitionsUpdateVerbHandler.java:51)atorg.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:59)

Originally "max heap size" was set to 6G. Then we increased heap sizelimit to 8G and it works. But warnings still present

WARN [ScheduledTasks:1] 2012-09-20 11:39:11,373 GCInspector.java (line145) Heap is 0.7760745735786222 full. You may need to reduce memtableand/or cache sizes. Cassandra will now flush up to the two largestmemtables to free up memory. Adjust flush_largest_memtables_atthreshold in cassandra.yaml if you don't want Cassandra to do thisautomaticallyINFO [ScheduledTasks:1] 2012-09-20 11:39:11,374 StorageService.java(line 2658) Unable to reduce heap usage since there are no dirty columnfamilies


It is probably a bug in applying migrations.

Could anyone explain why cassandra behaves this way? Could you pleaserecommend us smth to cope with this situation?

Thank you in advance.

--
W/ best regards,
Sergey B.

OOM when applying migrations

Reply via email to