[jira] [Assigned] (IOTDB-3409) Snapshot error about UDFInfo

2022-06-09 Thread Jialin Qiao (Jira)


 [ 
https://issues.apache.org/jira/browse/IOTDB-3409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jialin Qiao reassigned IOTDB-3409:
--

Assignee: Steve Yurong Su  (was: Yongzao Dan)

> Snapshot error about UDFInfo
> 
>
> Key: IOTDB-3409
> URL: https://issues.apache.org/jira/browse/IOTDB-3409
> Project: Apache IoTDB
>  Issue Type: Bug
>  Components: mpp-cluster
>Affects Versions: 0.14.0-SNAPSHOT
>Reporter: 刘珍
>Assignee: Steve Yurong Su
>Priority: Major
> Attachments: config.properties.first, 
> image-2022-06-07-10-24-31-654.png, screenshot-1.png, screenshot-2.png
>
>
> master_0606_c2afca1
> 3confignode,3datanode,schemaRegion/dataRegion 3副本。
> Restart recovery, the third confignode error :
> 2022-06-07 09:28:38,086 [pool-2-thread-1] ERROR o.a.i.c.r.SnapshotStorage:108 
> - read file info failed for snapshot file
> java.io.FileNotFoundException: 
> /data/liuzhen_test/master_0606_c2afca1/confignode/sbin/../data/consensus/47474747-4747-4747-4747-/sm/1_7/udf
>  (Is a directory)
> at java.io.FileInputStream.open0(Native Method)
> at java.io.FileInputStream.open(FileInputStream.java:195)
> at java.io.FileInputStream.(FileInputStream.java:138)
> at 
> org.apache.ratis.util.MD5FileUtil.computeMd5ForFile(MD5FileUtil.java:125)
> at 
> org.apache.iotdb.consensus.ratis.SnapshotStorage.getLatestSnapshot(SnapshotStorage.java:106)
> at 
> org.apache.ratis.statemachine.impl.BaseStateMachine.getLatestSnapshot(BaseStateMachine.java:96)
> at 
> org.apache.ratis.server.impl.ServerState.lambda$new$2(ServerState.java:145)
> at 
> org.apache.ratis.server.raftlog.RaftLogBase.(RaftLogBase.java:87)
> at 
> org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.(SegmentedRaftLog.java:194)
> at 
> org.apache.ratis.server.impl.ServerState.initRaftLog(ServerState.java:205)
> at 
> org.apache.ratis.server.impl.ServerState.(ServerState.java:152)
> at 
> org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:201)
> at 
> org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$5(RaftServerProxy.java:274)
> at 
> java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> 2022-06-07 09:28:38,087 [pool-2-thread-1] ERROR o.a.i.c.r.SnapshotStorage:108 
> - read file info failed for snapshot file
> java.io.FileNotFoundException: 
> /data/liuzhen_test/master_0606_c2afca1/confignode/sbin/../data/consensus/47474747-4747-4747-4747-/sm/1_7/ext
>  (Is a directory)
> at java.io.FileInputStream.open0(Native Method)
> at java.io.FileInputStream.open(FileInputStream.java:195)
> at java.io.FileInputStream.(FileInputStream.java:138)
> at 
> org.apache.ratis.util.MD5FileUtil.computeMd5ForFile(MD5FileUtil.java:125)
> at 
> org.apache.iotdb.consensus.ratis.SnapshotStorage.getLatestSnapshot(SnapshotStorage.java:106)
> at 
> org.apache.ratis.statemachine.impl.BaseStateMachine.getLatestSnapshot(BaseStateMachine.java:96)
> at 
> org.apache.ratis.server.impl.ServerState.lambda$new$2(ServerState.java:145)
> at 
> org.apache.ratis.server.raftlog.RaftLogBase.(RaftLogBase.java:87)
> at 
> org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.(SegmentedRaftLog.java:194)
> at 
> org.apache.ratis.server.impl.ServerState.initRaftLog(ServerState.java:205)
> at 
> org.apache.ratis.server.impl.ServerState.(ServerState.java:152)
> at 
> org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:201)
> at 
> org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$5(RaftServerProxy.java:274)
> at 
> java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> ~
> ~
>  !image-2022-06-07-10-24-31-654.png! 
> 复现流程
> 1. 192.168.10.62/66/68 72C256G
> bm 连ip-62 写数据
> 停止3个datanode,备份数据
> /data/liuzhen_test/master_0606_c2afca1/datanode/data_2022_06_07-08-55-06_for_rec
> 停止3个confignode,备份数据/data/liuzhen_test/master_0606_c2afca1/confignode/data_2022_06_07-08-55-39_for_rec
> 2. 顺序启动ip-62,66,68
> 只有68的confignode有error
> 另:3个datanode启动失败
> -
> master_0607_7cb2d3f 与上面相同配置,干净环境跑测试,有1个confignode 
> 共识组下存在下面的文件夹信息(另2个confignode相同sm下为空)
>   !sc

[jira] [Assigned] (IOTDB-3409) Snapshot error about UDFInfo

2022-06-09 Thread Jialin Qiao (Jira)


 [ 
https://issues.apache.org/jira/browse/IOTDB-3409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jialin Qiao reassigned IOTDB-3409:
--

Assignee: Song Ziyang  (was: Steve Yurong Su)

> Snapshot error about UDFInfo
> 
>
> Key: IOTDB-3409
> URL: https://issues.apache.org/jira/browse/IOTDB-3409
> Project: Apache IoTDB
>  Issue Type: Bug
>  Components: mpp-cluster
>Affects Versions: 0.14.0-SNAPSHOT
>Reporter: 刘珍
>Assignee: Song Ziyang
>Priority: Major
> Attachments: config.properties.first, 
> image-2022-06-07-10-24-31-654.png, screenshot-1.png, screenshot-2.png
>
>
> master_0606_c2afca1
> 3confignode,3datanode,schemaRegion/dataRegion 3副本。
> Restart recovery, the third confignode error :
> 2022-06-07 09:28:38,086 [pool-2-thread-1] ERROR o.a.i.c.r.SnapshotStorage:108 
> - read file info failed for snapshot file
> java.io.FileNotFoundException: 
> /data/liuzhen_test/master_0606_c2afca1/confignode/sbin/../data/consensus/47474747-4747-4747-4747-/sm/1_7/udf
>  (Is a directory)
> at java.io.FileInputStream.open0(Native Method)
> at java.io.FileInputStream.open(FileInputStream.java:195)
> at java.io.FileInputStream.(FileInputStream.java:138)
> at 
> org.apache.ratis.util.MD5FileUtil.computeMd5ForFile(MD5FileUtil.java:125)
> at 
> org.apache.iotdb.consensus.ratis.SnapshotStorage.getLatestSnapshot(SnapshotStorage.java:106)
> at 
> org.apache.ratis.statemachine.impl.BaseStateMachine.getLatestSnapshot(BaseStateMachine.java:96)
> at 
> org.apache.ratis.server.impl.ServerState.lambda$new$2(ServerState.java:145)
> at 
> org.apache.ratis.server.raftlog.RaftLogBase.(RaftLogBase.java:87)
> at 
> org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.(SegmentedRaftLog.java:194)
> at 
> org.apache.ratis.server.impl.ServerState.initRaftLog(ServerState.java:205)
> at 
> org.apache.ratis.server.impl.ServerState.(ServerState.java:152)
> at 
> org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:201)
> at 
> org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$5(RaftServerProxy.java:274)
> at 
> java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> 2022-06-07 09:28:38,087 [pool-2-thread-1] ERROR o.a.i.c.r.SnapshotStorage:108 
> - read file info failed for snapshot file
> java.io.FileNotFoundException: 
> /data/liuzhen_test/master_0606_c2afca1/confignode/sbin/../data/consensus/47474747-4747-4747-4747-/sm/1_7/ext
>  (Is a directory)
> at java.io.FileInputStream.open0(Native Method)
> at java.io.FileInputStream.open(FileInputStream.java:195)
> at java.io.FileInputStream.(FileInputStream.java:138)
> at 
> org.apache.ratis.util.MD5FileUtil.computeMd5ForFile(MD5FileUtil.java:125)
> at 
> org.apache.iotdb.consensus.ratis.SnapshotStorage.getLatestSnapshot(SnapshotStorage.java:106)
> at 
> org.apache.ratis.statemachine.impl.BaseStateMachine.getLatestSnapshot(BaseStateMachine.java:96)
> at 
> org.apache.ratis.server.impl.ServerState.lambda$new$2(ServerState.java:145)
> at 
> org.apache.ratis.server.raftlog.RaftLogBase.(RaftLogBase.java:87)
> at 
> org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.(SegmentedRaftLog.java:194)
> at 
> org.apache.ratis.server.impl.ServerState.initRaftLog(ServerState.java:205)
> at 
> org.apache.ratis.server.impl.ServerState.(ServerState.java:152)
> at 
> org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:201)
> at 
> org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$5(RaftServerProxy.java:274)
> at 
> java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> ~
> ~
>  !image-2022-06-07-10-24-31-654.png! 
> 复现流程
> 1. 192.168.10.62/66/68 72C256G
> bm 连ip-62 写数据
> 停止3个datanode,备份数据
> /data/liuzhen_test/master_0606_c2afca1/datanode/data_2022_06_07-08-55-06_for_rec
> 停止3个confignode,备份数据/data/liuzhen_test/master_0606_c2afca1/confignode/data_2022_06_07-08-55-39_for_rec
> 2. 顺序启动ip-62,66,68
> 只有68的confignode有error
> 另:3个datanode启动失败
> -
> master_0607_7cb2d3f 与上面相同配置,干净环境跑测试,有1个confignode 
> 共识组下存在下面的文件夹信息(另2个confignode相同sm下为空)
>   !screen