On 26/11/14 09:24, Tan Ban Wee wrote:
Hi,
This is a 2 nodes cluster and they are randomly rebooting itself. I hope
someone can help me to narrow down to the cause.
Nov 25 23:40:17 qdiskd Node 1 missed an update (3/4)
Nov 25 23:40:18 qdiskd Node 1 missed an update (4/4)
Nov 25 23:40:19 qdiskd Node 1 missed an update (5/4)
Nov 25 23:40:19 qdiskd Node 1 DOWN
Nov 25 23:40:19 qdiskd Writing eviction notice for node 1
Nov 25 23:40:19 qdiskd Telling CMAN to kill the node
Nov 25 23:40:20 qdiskd Node 1 evicted
Nov 25 23:44:19 qdiskd Node 1 is UP
Nov 25 23:44:20 qdiskd Node 1 shutdown
Nov 25 23:44:26 qdiskd Node 1 is UP
Nov 25 23:44:37 qdiskd Node 1 missed an update (2/4)
Nov 25 23:44:38 qdiskd Node 1 missed an update (3/4)
Nov 25 23:44:39 qdiskd Node 1 missed an update (4/4)
Nov 25 23:44:40 qdiskd Node 1 missed an update (5/4)
Nov 25 23:44:40 qdiskd Node 1 DOWN
Nov 25 23:44:40 qdiskd Writing eviction notice for node 1
Nov 25 23:44:40 qdiskd Telling CMAN to kill the node
Nov 25 23:44:41 qdiskd Node 1 evicted
Nov 25 23:50:48 qdiskd Loading dynamic configuration
Nov 25 23:50:49 qdiskd Setting autocalculated votes to 1
Nov 25 23:50:49 qdiskd Loading static configuration
Nov 25 23:50:49 qdiskd Auto-configured TKO as 4 based on token=10000
interval=1
Nov 25 23:50:49 qdiskd Timings: 4 tko, 1 interval
Nov 25 23:50:49 qdiskd Timings: 2 tko_up, 3 master_wait, 2 upgrade_wait
Nov 25 23:50:49 qdiskd Heuristic: 'ping -c3 -w5 10.101.210.250' score=1
interval=3 tko=5
Nov 25 23:50:49 qdiskd 1 heuristics loaded
Nov 25 23:50:49 qdiskd Quorum Daemon: 1 heuristics, 1 interval, 4 tko, 1
votes
Nov 25 23:50:49 qdiskd Run Flags: 00000231
Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad
Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512
Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad
Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512
Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad
Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512
Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad
Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512
Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad
Nov 25 23:50:49 qdiskd diskRawReadShadow: bad CRC32, offset = 0 len = 512
Nov 25 23:50:49 qdiskd Header CRC32 mismatch; Exp: 0x00000000 Got:
0x190a55ad
From that limited information I would guess that your quorum disk
partition is either offline or corrupted. First check that the drive is
online and if it seems OK physically then check that it's not been
formatted as a filesystem or something else by mistake and rebuild the
header using mkqdisk.
Chrissie
_______________________________________________
discuss mailing list
[email protected]
http://lists.corosync.org/mailman/listinfo/discuss