[ 
https://issues.apache.org/jira/browse/CASSANDRA-11010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

T Jake Luciani updated CASSANDRA-11010:
---------------------------------------
    Priority: Blocker  (was: Major)

> Bootstrap hangs on 3.2
> ----------------------
>
>                 Key: CASSANDRA-11010
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-11010
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: T Jake Luciani
>            Priority: Blocker
>
> Doing some testing I've noticed with 3.2 nodes can hang when bootstrapping. 
> below is the snippet from the log of a hung node.
> {quote}
> INFO  [main] 2016-01-14 02:53:25,537 StorageService.java:1281 - JOINING: 
> Starting to bootstrap...
> INFO  [main] 2016-01-14 02:53:25,564 StreamResultFuture.java:88 - [Stream 
> #fb512640-ba69-11e5-8f71-f38b96ae2156] Executing streaming plan for Bootstrap
> DEBUG [main] 2016-01-14 02:53:25,564 StreamCoordinator.java:144 - Connecting 
> next session fb512640-ba69-11e5-8f71-f38b96ae2156 with 172.31.4.27.
> INFO  [StreamConnectionEstablisher:1] 2016-01-14 02:53:25,565 
> StreamSession.java:238 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Starting streaming to /172.31.4.27
> DEBUG [StreamConnectionEstablisher:1] 2016-01-14 02:53:25,566 
> ConnectionHandler.java:82 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Sending stream init for incoming stream
> DEBUG [StreamConnectionEstablisher:1] 2016-01-14 02:53:25,568 
> ConnectionHandler.java:87 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Sending stream init for outgoing stream
> INFO  [StreamConnectionEstablisher:1] 2016-01-14 02:53:25,574 
> StreamCoordinator.java:266 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156, 
> ID#0] Beginning stream session with /172.31.4.27
> DEBUG [STREAM-OUT-/172.31.4.27] 2016-01-14 02:53:25,574 
> ConnectionHandler.java:334 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Sending Prepare (2 requests,  0 files}
> DEBUG [STREAM-IN-/172.31.4.27] 2016-01-14 02:53:25,597 
> ConnectionHandler.java:262 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Received Prepare (0 requests,  0 files}
> DEBUG [STREAM-OUT-/172.31.4.27] 2016-01-14 02:53:25,597 
> ConnectionHandler.java:334 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Sending Complete
> DEBUG [STREAM-IN-/172.31.4.27] 2016-01-14 02:53:25,597 
> ConnectionHandler.java:262 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Received Complete
> DEBUG [STREAM-IN-/172.31.4.27] 2016-01-14 02:53:25,598 
> ConnectionHandler.java:110 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Closing stream connection handler on /172.31.4.27
> INFO  [STREAM-IN-/172.31.4.27] 2016-01-14 02:53:25,598 
> StreamResultFuture.java:185 - [Stream #fb512640-ba69-11e5-8f71-f38b96ae2156] 
> Session with /172.31.4.27 is complete
> {quote}
> Vs the log of a successful bootstrap.
> {quote}
> INFO  [main] 2016-01-14 03:54:13,068 StreamResultFuture.java:88 - [Stream 
> #7963f460-ba72-11e5-b8d5-834198444b31] Executing streaming plan for Bootstrap
> INFO  [StreamConnectionEstablisher:1] 2016-01-14 03:54:13,070 
> StreamSession.java:238 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31] 
> Starting streaming to /172.31.18.61
> INFO  [StreamConnectionEstablisher:1] 2016-01-14 03:54:13,086 
> StreamCoordinator.java:266 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31, 
> ID#0] Beginning stream session with /172.31.18.61
> INFO  [STREAM-IN-/172.31.18.61] 2016-01-14 03:54:13,198 
> StreamResultFuture.java:171 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31 
> ID#0] Prepare completed. Receiving 1 files(102 bytes), sending 0 files(0 
> bytes)
> INFO  [StreamConnectionEstablisher:2] 2016-01-14 03:54:13,206 
> StreamSession.java:238 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31] 
> Starting streaming to /172.31.21.58
> INFO  [StreamConnectionEstablisher:2] 2016-01-14 03:54:13,209 
> StreamCoordinator.java:266 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31, 
> ID#0] Beginning stream session with /172.31.21.58
> INFO  [STREAM-IN-/172.31.21.58] 2016-01-14 03:54:13,255 
> StreamResultFuture.java:185 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31] 
> Session with /172.31.21.58 is complete
> INFO  [StreamReceiveTask:1] 2016-01-14 03:54:13,590 
> StreamResultFuture.java:185 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31] 
> Session with /172.31.18.61 is complete
> INFO  [StreamReceiveTask:1] 2016-01-14 03:54:13,594 
> StreamResultFuture.java:217 - [Stream #7963f460-ba72-11e5-b8d5-834198444b31] 
> All sessions completed
> INFO  [StreamReceiveTask:1] 2016-01-14 03:54:13,597 StorageService.java:1234 
> - Bootstrap completed! for the tokens [3074457345618258602]
> INFO  [main] 2016-01-14 03:54:14,424 StorageService.java:1924 - Node 
> /172.31.29.26 state jump to NORMAL
> {quote}
> The jstack of the bad bootstrap shows it waiting for the stream to complete 
> even though there are no outstanding streams.
> {quote}
> "main" #1 prio=5 os_prio=0 tid=0x00007f0c8c00afc0 nid=0x747a waiting on 
> condition [0x00007f0c95e31000]
>    java.lang.Thread.State: WAITING (parking)
>       at sun.misc.Unsafe.park(Native Method)
>       - parking to wait for  <0x00000006d8addd18> (a 
> com.google.common.util.concurrent.AbstractFuture$Sync)
>       at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
>       at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
>       at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
>       at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
>       at 
> com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:285)
>       at 
> com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:116)
>       at 
> org.apache.cassandra.service.StorageService.bootstrap(StorageService.java:1345)
>       at 
> org.apache.cassandra.service.StorageService.joinTokenRing(StorageService.java:933)
>       at 
> org.apache.cassandra.service.StorageService.initServer(StorageService.java:707)
>       - locked <0x00000006eee44028> (a 
> org.apache.cassandra.service.StorageService)
>       at 
> org.apache.cassandra.service.StorageService.initServer(StorageService.java:578)
>       - locked <0x00000006eee44028> (a 
> org.apache.cassandra.service.StorageService)
>       at 
> org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:349)
>       at 
> org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:551)
>       at 
> org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:680)
> {quote}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to