Vladislav Pyatkov created IGNITE-8710: -----------------------------------------
Summary: Applying WAL works long time or fail at all, when *.wal files been removed Key: IGNITE-8710 URL: https://issues.apache.org/jira/browse/IGNITE-8710 Project: Ignite Issue Type: Bug Reporter: Vladislav Pyatkov In specific cases when removed *.wal files or unmounted wal directories we got some warning message on start: {noformat} 2018-06-02 12:10:06.127[INFO ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Checking memory state [lastValidPos=FileWALPointer [idx=0, fileOff=0, len=0], lastMarked=FileWALPointer [idx=0, fileOff=0, len=0], lastCheckpointId=00000000-0000-0000-0000-000000000000] 2018-06-02 12:10:06.546[WARN ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected checkpoint marker, skipping [cpId=94b5ce03-87b7-489e-b08b-b4c5dc522bd5, expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=0, fileOff=44266869, len=977]] 2018-06-02 12:10:57.860[WARN ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected checkpoint marker, skipping [cpId=3f6ab238-23f7-4924-b4ef-0cb68d914a04, expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=7, fileOff=872888269, len=460112]] 2018-06-02 12:11:46.600[INFO ][Thread-100][o.a.i.i.p.c.p.w.FileWriteAheadLogManager] Stopping WAL iteration due to an exception: EOF at position [1073741824] expected to read [1] bytes, ptr=FileWALPointer [idx=15, fileOff=1073741824, len=0] 2018-06-02 12:12:21.181[WARN ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected checkpoint marker, skipping [cpId=3fe33806-ee11-49b7-8c47-648cd1adacbc, expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=23, fileOff=693360866, len=460112]] {noformat} And trying to recovery from WAL hangs a long try without success. Should to stop the node and print message about not found necessary wal-files. -- This message was sent by Atlassian JIRA (v7.6.3#76005)