[ https://issues.apache.org/jira/browse/IOTDB-4040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Song Ziyang reassigned IOTDB-4040: ---------------------------------- Assignee: Yongzao Dan (was: Song Ziyang) > Sync SchemaRegion Failed: o.a.ratis.util.LogUtils:124 - : Failed requestVote > ..._50010#0 > ------------------------------------------------------------------------------------------ > > Key: IOTDB-4040 > URL: https://issues.apache.org/jira/browse/IOTDB-4040 > Project: Apache IoTDB > Issue Type: Bug > Components: mpp-cluster > Affects Versions: 0.14.0-SNAPSHOT > Reporter: 刘珍 > Assignee: Yongzao Dan > Priority: Major > Attachments: image-2022-08-04-17-51-22-725.png, > image-2022-08-04-17-52-13-492.png, ip15_2000_config-1.properties, > ip15_2000_config.properties, ip3_2000_config.properties, > ip4_2000_config.properties > > > master_0804_f900a34 > schema/data : RatisConsensus > 3副本3C7D,干净环境,ip5注册到集群成功后,kill datanode进程。 > 注册元数据, > 写入数据,ip5 上有2个schemaregion,启动ip5,同步schemaregion失败: > 2022-08-04 17:23:30,895 [grpc-default-executor-0] WARN > o.a.ratis.util.LogUtils:124 - 172.20.70.5_50010: Failed requestVote > 172.20.70.16_50010->172.20.70.5_50010#0 > org.apache.ratis.protocol.exceptions.GroupMismatchException: > 172.20.70.5_50010: group-000200000001 not found. > at > org.apache.ratis.server.impl.RaftServerProxy$ImplMap.get(RaftServerProxy.java:148) > at > org.apache.ratis.server.impl.RaftServerProxy.getImplFuture(RaftServerProxy.java:347) > at > org.apache.ratis.server.impl.RaftServerProxy.getImpl(RaftServerProxy.java:356) > at > org.apache.ratis.server.impl.RaftServerProxy.getImpl(RaftServerProxy.java:351) > at > org.apache.ratis.server.impl.RaftServerProxy.requestVote(RaftServerProxy.java:603) > at > org.apache.ratis.grpc.server.GrpcServerProtocolService.requestVote(GrpcServerProtocolService.java:172) > at > org.apache.ratis.proto.grpc.RaftServerProtocolServiceGrpc$MethodHandlers.invoke(RaftServerProtocolServiceGrpc.java:382) > at > org.apache.ratis.thirdparty.io.grpc.stub.ServerCalls$UnaryServerCallHandler$UnaryServerCallListener.onHalfClose(ServerCalls.java:182) > at > org.apache.ratis.thirdparty.io.grpc.PartialForwardingServerCallListener.onHalfClose(PartialForwardingServerCallListener.java:35) > at > org.apache.ratis.thirdparty.io.grpc.ForwardingServerCallListener.onHalfClose(ForwardingServerCallListener.java:23) > at > org.apache.ratis.thirdparty.io.grpc.internal.ServerCallImpl$ServerStreamListenerImpl.halfClosed(ServerCallImpl.java:340) > at > org.apache.ratis.thirdparty.io.grpc.internal.ServerImpl$JumpToApplicationThreadServerStreamListener$1HalfClosed.runInContext(ServerImpl.java:866) > at > org.apache.ratis.thirdparty.io.grpc.internal.ContextRunnable.run(ContextRunnable.java:37) > at > org.apache.ratis.thirdparty.io.grpc.internal.SerializingExecutor.run(SerializingExecutor.java:133) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > 复现流程 > 1.私有云172.20.70.2/3/4/5/13/14/16 > benchmark 在 ip 15 > confignode : > MAX_HEAP_SIZE="4G" > schema_region_consensus_protocol_class=org.apache.iotdb.consensus.ratis.RatisConsensus > data_region_consensus_protocol_class=org.apache.iotdb.consensus.ratis.RatisConsensus > schema_replication_factor=3 > data_replication_factor=3 > datanode: > MAX_HEAP_SIZE="16G" > max_waiting_time_when_insert_blocked=3600000 > query_timeout_threshold=3600000 > 2. ip5启动脚本监测datanode进程 > while true > do > jps|grep -i datanode|awk '{print "kill -9 " $1}'|sh > sleep 2 > done > 3. 启动3个confignode > 启动7个datanode > !image-2022-08-04-17-51-22-725.png! > 4.启动3个bm > 配置见附件 > 查看regions信息 > !image-2022-08-04-17-52-13-492.png! > 5. 启动ip5 的datanode > 日志warn,schemaregion同步失败。 -- This message was sent by Atlassian Jira (v8.20.10#820010)