[jira] [Commented] (IOTDB-4350) [ MultiLeader Throttle Down] Performance does not return to normal after “Throttle Down“
[ https://issues.apache.org/jira/browse/IOTDB-4350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17601136#comment-17601136 ] 刘珍 commented on IOTDB-4350: --- 不制造故障,运行完成4546.75 second。 此issue场景(72故障3分钟,73故障2分钟),运行完成 :13210.52 second,13210.52/4546.75=2.9。 > [ MultiLeader Throttle Down] Performance does not return to normal after > “Throttle Down“ > - > > Key: IOTDB-4350 > URL: https://issues.apache.org/jira/browse/IOTDB-4350 > Project: Apache IoTDB > Issue Type: Bug > Components: mpp-cluster >Affects Versions: 0.14.0-SNAPSHOT >Reporter: 刘珍 >Assignee: 张洪胤 >Priority: Major > Attachments: net_restart.conf, screenshot-1.png, screenshot-2.png, > screenshot-3.png, screenshot-4.png, screenshot-5.png, screenshot-6.png > > > m_0905_0095eb3,3副本3C3D > 3个dataregion , 每个node上有1个leader。 > ip72 断网3分钟(16:52 ~ 16:55),查看集群状态,切主成功后, > ip73断网2分钟,之后不执行故障操作。 > 同步慢,multiLeader一直在合并限流,但是限流性能也回不去,如下,统计1分钟的写入数据量(bm中的batch) > !screenshot-6.png! > IoTDB> select count(latency) from > root.result.moresession_2022_09_06_04_47_03.INGESTION where okPoint>0 group > by ([1662454041076000186,1662459764764000179),1m); > +---++ > | > Time|count(root.result.moresession_2022_09_06_04_47_03.INGESTION.latency)| > +---++ > |2022-09-06T16:47:21.076000186+08:00| >5544| > |2022-09-06T16:48:21.076000186+08:00| >6282| > |2022-09-06T16:49:21.076000186+08:00| >5671| > |2022-09-06T16:50:21.076000186+08:00| >4589| > |2022-09-06T16:51:21.076000186+08:00| >5350| > |2022-09-06T16:52:21.076000186+08:00| >1121| > |2022-09-06T16:53:21.076000186+08:00| > 901| > |2022-09-06T16:54:21.076000186+08:00| > 201| > |2022-09-06T16:55:21.076000186+08:00| > 334| > |2022-09-06T16:56:21.076000186+08:00| >3501| > |2022-09-06T16:57:21.076000186+08:00| >3677| > |2022-09-06T16:58:21.076000186+08:00| >3111| > |2022-09-06T16:59:21.076000186+08:00| >1948| > |2022-09-06T17:00:21.076000186+08:00| >3889| > |2022-09-06T17:01:21.076000186+08:00| >2982| > |2022-09-06T17:02:21.076000186+08:00| >4465| > |2022-09-06T17:03:21.076000186+08:00| >4871| > |2022-09-06T17:04:21.076000186+08:00| >4478| > |2022-09-06T17:05:21.076000186+08:00| >3242| > |2022-09-06T17:06:21.076000186+08:00| >2545| > |2022-09-06T17:07:21.076000186+08:00| >2579| > |2022-09-06T17:08:21.076000186+08:00| > 133| > |2022-09-06T17:09:21.076000186+08:00| > 488| > |2022-09-06T17:10:21.076000186+08:00| > 253| > |2022-09-06T17:11:21.076000186+08:00| > 445| > |2022-09-06T17:12:21.076000186+08:00| >2122| > |2022-09-06T17:13:21.076000186+08:00| >1799| > |2022-09-06T17:14:21.076000186+08:00| >1568| > |2022-09-06T17:15:21.076000186+08:00| > 355| > |2022-09-06T17:16:21.076000186+08:00|
[jira] [Created] (IOTDB-4356) Filter reduplicative device paths in PathPatternTree
Yongzao Dan created IOTDB-4356: -- Summary: Filter reduplicative device paths in PathPatternTree Key: IOTDB-4356 URL: https://issues.apache.org/jira/browse/IOTDB-4356 Project: Apache IoTDB Issue Type: Improvement Reporter: Yongzao Dan Assignee: yanze chen Fix For: 0.14.0 Attachments: image-2022-09-07-11-42-50-076.png !image-2022-09-07-11-42-50-076.png! -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4355) Support index skip in PlanNodeIterator
Haiming Zhu created IOTDB-4355: -- Summary: Support index skip in PlanNodeIterator Key: IOTDB-4355 URL: https://issues.apache.org/jira/browse/IOTDB-4355 Project: Apache IoTDB Issue Type: Improvement Components: Core/WAL Reporter: Haiming Zhu Assignee: Haiming Zhu PlanNodeIterator reads plan node by index order, if index has gap, this iterator will block forever. We should move this gap process logic to consensus layer. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4354) [ confignode ] The leader of the schemaregion is displayed on the node that has been down
刘珍 created IOTDB-4354: - Summary: [ confignode ] The leader of the schemaregion is displayed on the node that has been down Key: IOTDB-4354 URL: https://issues.apache.org/jira/browse/IOTDB-4354 Project: Apache IoTDB Issue Type: Bug Components: mpp-cluster Affects Versions: 0.14.0-SNAPSHOT Reporter: 刘珍 Assignee: Yongzao Dan Attachments: image-2022-09-07-10-36-30-819.png m_0905_0095eb3, 3副本3C3D 下图,74节点的datanode已经down掉(72也down了),show regions|grep Leader ,SchemaRegion的Leader还在74,应该切主。 !image-2022-09-07-10-36-30-819.png! 测试环境同https://issues.apache.org/jira/browse/IOTDB-4350 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (IOTDB-4294) [ mem leak ] Request metadata timed out, causing memory leak
[ https://issues.apache.org/jira/browse/IOTDB-4294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17601086#comment-17601086 ] 刘珍 commented on IOTDB-4294: --- OOM的issue ,见 https://issues.apache.org/jira/browse/IOTDB-4251 > [ mem leak ] Request metadata timed out, causing memory leak > > > Key: IOTDB-4294 > URL: https://issues.apache.org/jira/browse/IOTDB-4294 > Project: Apache IoTDB > Issue Type: Bug > Components: mpp-cluster >Affects Versions: 0.14.0-SNAPSHOT >Reporter: 刘珍 >Assignee: Yuan Tian >Priority: Major > Labels: pull-request-available > Attachments: add_dn.conf, screenshot-1.png, screenshot-2.png, > screenshot-3.png > > > master_0830_42fcbfc > schema_region_consensus_protocol_class=org.apache.iotdb.consensus.standalone.{color:#DE350B}*StandAloneConsensus*{color} > {color:#DE350B}*schemaregion 1副本*{color} > dataregion 3副本 > 先启动1confignode,3datanode,启动benchmark,只有写入,{color:#DE350B}15小时后,大量写入失败{color},下图是统计每小时的写入数据量: > !screenshot-1.png! > ip5 error > 2022-08-30 18:01:04,256 [20220830_094446_42966_3.1.0-1068] ERROR > o.a.i.d.m.e.f.FragmentInstanceManager:157 - Execute error caused by > org.apache.iotdb.db.mpp.exception.MemoryNotEnoughException: There is not > enough memory to execute current fragment instance, current remaining free > memory is 1014007, estimated memory usage for current fragment instance is > 1048576 > at > org.apache.iotdb.db.mpp.plan.planner.LocalExecutionPlanner.checkMemory(LocalExecutionPlanner.java:132) > at > org.apache.iotdb.db.mpp.plan.planner.LocalExecutionPlanner.plan(LocalExecutionPlanner.java:104) > at > org.apache.iotdb.db.mpp.execution.fragment.FragmentInstanceManager.lambda$execSchemaQueryFragmentInstance$3(FragmentInstanceManager.java:147) > at > java.util.concurrent.ConcurrentHashMap.computeIfAbsent(ConcurrentHashMap.java:1660) > at > org.apache.iotdb.db.mpp.execution.fragment.FragmentInstanceManager.execSchemaQueryFragmentInstance(FragmentInstanceManager.java:133) > at > org.apache.iotdb.db.consensus.statemachine.SchemaRegionStateMachine.read(SchemaRegionStateMachine.java:94) > at > org.apache.iotdb.consensus.standalone.StandAloneServerImpl.read(StandAloneServerImpl.java:72) > at > org.apache.iotdb.consensus.standalone.StandAloneConsensus.read(StandAloneConsensus.java:135) > at > org.apache.iotdb.db.service.thrift.impl.DataNodeInternalRPCServiceImpl.sendFragmentInstance(DataNodeInternalRPCServiceImpl.java:169) > at > org.apache.iotdb.mpp.rpc.thrift.IDataNodeRPCService$Processor$sendFragmentInstance.getResult(IDataNodeRPCService.java:2136) > at > org.apache.iotdb.mpp.rpc.thrift.IDataNodeRPCService$Processor$sendFragmentInstance.getResult(IDataNodeRPCService.java:2116) > at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:38) > at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:38) > at > org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:248) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > 复现流程 > 1. 172.20.70.3/4/5 8c32G datanode > 172.20.70.31 confignode 8c32G > benchmark在ip15 /data/benchmark/bm_0620_7ec96c1 > 集群与regions信息 > !screenshot-2.png! > 2. 数据库配置参数 > confignode > MAX_HEAP_SIZE="16G" > schema_region_consensus_protocol_class=org.apache.iotdb.consensus.standalone.StandAloneConsensus > data_region_consensus_protocol_class=org.apache.iotdb.consensus.multileader.MultiLeaderConsensus > schema_replication_factor=1 > data_replication_factor=3 > datanode > MAX_HEAP_SIZE="16G" > wal_buffer_size_in_byte=1048576 > max_waiting_time_when_insert_blocked=360 > 3. 启动bm 配置文件见附件 > 4. 后续还有add datanode(20分钟 加1个datanode,共加6个) > ip 2/13/14/16/18/19 > 但是没有新的写入操作,所以这些新的datanode上没有数据。 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (IOTDB-4176) Research Procedure Framework
[ https://issues.apache.org/jira/browse/IOTDB-4176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17601080#comment-17601080 ] Jinrui Zhang commented on IOTDB-4176: - Please paste the doc link here > Research Procedure Framework > > > Key: IOTDB-4176 > URL: https://issues.apache.org/jira/browse/IOTDB-4176 > Project: Apache IoTDB > Issue Type: Task >Reporter: 伊丹翔 >Assignee: 伊丹翔 >Priority: Major > -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4353) [Benchmark] Add new strategy of dividing device into client.
张洪胤 created IOTDB-4353: -- Summary: [Benchmark] Add new strategy of dividing device into client. Key: IOTDB-4353 URL: https://issues.apache.org/jira/browse/IOTDB-4353 Project: Apache IoTDB Issue Type: New Feature Reporter: 张洪胤 Assignee: 张洪胤 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4352) Fix PatialPath#overlapWith PartialPath contains MULTI_LEVEL_PATH_WILDCARD
yanze chen created IOTDB-4352: - Summary: Fix PatialPath#overlapWith PartialPath contains MULTI_LEVEL_PATH_WILDCARD Key: IOTDB-4352 URL: https://issues.apache.org/jira/browse/IOTDB-4352 Project: Apache IoTDB Issue Type: Bug Reporter: yanze chen Assignee: yanze chen PatialPath#overlapWith should not return true directly if PartialPath contains MULTI_LEVEL_PATH_WILDCARD. For example, root.**.s1 is not overlap with root.**.s2 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Assigned] (IOTDB-4351) Enhance client-cpp for better performance
[ https://issues.apache.org/jira/browse/IOTDB-4351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Zhang reassigned IOTDB-4351: --- Assignee: Bo Zhang > Enhance client-cpp for better performance > - > > Key: IOTDB-4351 > URL: https://issues.apache.org/jira/browse/IOTDB-4351 > Project: Apache IoTDB > Issue Type: Improvement >Reporter: Bo Zhang >Assignee: Bo Zhang >Priority: Major > -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4351) Enhance client-cpp for better performance
Bo Zhang created IOTDB-4351: --- Summary: Enhance client-cpp for better performance Key: IOTDB-4351 URL: https://issues.apache.org/jira/browse/IOTDB-4351 Project: Apache IoTDB Issue Type: Improvement Reporter: Bo Zhang -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4350) [ MultiLeader Throttle Down] Performance does not return to normal after “Throttle Down“
刘珍 created IOTDB-4350: - Summary: [ MultiLeader Throttle Down] Performance does not return to normal after “Throttle Down“ Key: IOTDB-4350 URL: https://issues.apache.org/jira/browse/IOTDB-4350 Project: Apache IoTDB Issue Type: Bug Components: mpp-cluster Affects Versions: 0.14.0-SNAPSHOT Reporter: 刘珍 Assignee: 张洪胤 m_0905_0095eb3,3副本3C3D 3个dataregion , 每个node上有1个leader。 ip72 断网2分钟,查看集群状态,切主成功后。 ip73断网2分钟,之后不执行故障操作。 同步慢,multiLeader一直在合并限流,但是限流性能也回不去,如下,统计1分钟的写入数据量(bm中的batch) IoTDB> select count(latency) from root.result.moresession_2022_09_06_04_47_03.INGESTION where okPoint>0 group by ([1662454041076000186,1662459764764000179),1m); +---++ | Time|count(root.result.moresession_2022_09_06_04_47_03.INGESTION.latency)| +---++ |2022-09-06T16:47:21.076000186+08:00| 5544| |2022-09-06T16:48:21.076000186+08:00| 6282| |2022-09-06T16:49:21.076000186+08:00| 5671| |2022-09-06T16:50:21.076000186+08:00| 4589| |2022-09-06T16:51:21.076000186+08:00| 5350| |2022-09-06T16:52:21.076000186+08:00| 1121| |2022-09-06T16:53:21.076000186+08:00| 901| |2022-09-06T16:54:21.076000186+08:00| 201| |2022-09-06T16:55:21.076000186+08:00| 334| |2022-09-06T16:56:21.076000186+08:00| 3501| |2022-09-06T16:57:21.076000186+08:00| 3677| |2022-09-06T16:58:21.076000186+08:00| 3111| |2022-09-06T16:59:21.076000186+08:00| 1948| |2022-09-06T17:00:21.076000186+08:00| 3889| |2022-09-06T17:01:21.076000186+08:00| 2982| |2022-09-06T17:02:21.076000186+08:00| 4465| |2022-09-06T17:03:21.076000186+08:00| 4871| |2022-09-06T17:04:21.076000186+08:00| 4478| |2022-09-06T17:05:21.076000186+08:00| 3242| |2022-09-06T17:06:21.076000186+08:00| 2545| |2022-09-06T17:07:21.076000186+08:00| 2579| |2022-09-06T17:08:21.076000186+08:00| 133| |2022-09-06T17:09:21.076000186+08:00| 488| |2022-09-06T17:10:21.076000186+08:00| 253| |2022-09-06T17:11:21.076000186+08:00| 445| |2022-09-06T17:12:21.076000186+08:00| 2122| |2022-09-06T17:13:21.076000186+08:00| 1799| |2022-09-06T17:14:21.076000186+08:00| 1568| |2022-09-06T17:15:21.076000186+08:00| 355| |2022-09-06T17:16:21.076000186+08:00| 1127| |2022-09-06T17:17:21.076000186+08:00| 803| |2022-09-06T17:18:21.076000186+08:00| 674| |2022-09-06T17:19:21.076000186+08:00| 621| |2022-09-06T17:20:21.076000186+08:00| 361| |2022-09-06T17:21:21.076000186+08:00| 367| |2022-09-06T17:22:21.076000186+08:00| 999|
[jira] [Commented] (IOTDB-4328) Replica add/remove for multiLeader
[ https://issues.apache.org/jira/browse/IOTDB-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17600695#comment-17600695 ] Jinrui Zhang commented on IOTDB-4328: - The implementation is completed. The basic test is passed. !image-2022-09-06-17-23-35-762.png|width=2071,height=576! > Replica add/remove for multiLeader > -- > > Key: IOTDB-4328 > URL: https://issues.apache.org/jira/browse/IOTDB-4328 > Project: Apache IoTDB > Issue Type: Bug >Reporter: Jinrui Zhang >Assignee: Jinrui Zhang >Priority: Major > Attachments: image-2022-09-06-17-22-34-700.png, > image-2022-09-06-17-23-35-762.png > > -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Assigned] (IOTDB-4027) ERROR o.a.i.d.e.s.SnapshotLoader:94 - Exception occurs when creating links from snapshot directory to data directory
[ https://issues.apache.org/jira/browse/IOTDB-4027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liuxuxin reassigned IOTDB-4027: --- Assignee: Song Ziyang (was: Liuxuxin) > ERROR o.a.i.d.e.s.SnapshotLoader:94 - Exception occurs when creating links > from snapshot directory to data directory > - > > Key: IOTDB-4027 > URL: https://issues.apache.org/jira/browse/IOTDB-4027 > Project: Apache IoTDB > Issue Type: Bug > Components: mpp-cluster >Affects Versions: 0.14.0-SNAPSHOT >Reporter: 刘珍 >Assignee: Song Ziyang >Priority: Major > Labels: pull-request-available > Fix For: 0.14.0 > > Attachments: image-2022-08-03-09-39-10-230.png, > image-2022-08-03-09-39-48-739.png, image-2022-09-06-17-05-21-387.png, > ip18_befor_stop_datanode_log.tar.gz, ip18_restart_with-error_log.tar.gz, > ip4_2000_config.properties, screenshot-1.png > > > master_0801_55b5b17 > 问题描述 > RatisConsensus,3副本3C9D,1个bm连1个datanode执行并发写入,停止1个follower节点,5分钟后启动;{color:#DE350B}*然后停止另1个follower节点10分钟后启动,此节点启动过程中报错,此节点少数据*{color}: > 2022-08-02 18:04:17,376 [pool-4-thread-1] ERROR o.a.i.d.e.s.SnapshotLoader:94 > - Exception occurs when creating links from snapshot directory to data > directory > java.io.IOException: Cannot find > /data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-00010001/sm/1_354536/sequence/root.ip4.g_0 > or > /data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-00010001/sm/1_354536/unsequence/root.ip4.g_0 > at > org.apache.iotdb.db.engine.snapshot.SnapshotLoader.createLinksFromSnapshotDirToDataDir(SnapshotLoader.java:163) > at > org.apache.iotdb.db.engine.snapshot.SnapshotLoader.loadSnapshotForStateMachine(SnapshotLoader.java:91) > at > org.apache.iotdb.db.consensus.statemachine.DataRegionStateMachine.loadSnapshot(DataRegionStateMachine.java:93) > at > org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.loadSnapshot(ApplicationStateMachineProxy.java:188) > at > org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.lambda$initialize$0(ApplicationStateMachineProxy.java:73) > at > org.apache.ratis.util.LifeCycle.startAndTransition(LifeCycle.java:270) > at > org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.initialize(ApplicationStateMachineProxy.java:69) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:136) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:201) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$5(RaftServerProxy.java:274) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > 2022-08-02 18:04:17,376 [pool-4-thread-1] ERROR > o.a.i.d.c.s.DataRegionStateMachine:95 - Fail to load snapshot from > /data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-00010001/sm/1_354536 > ip18少数据,期望序列的count值是2点 > !screenshot-1.png! > 1. 复现流程 > 私有云172.20.70.2/3/4/5/13/14/16/18/19 > benchmark 在ip15(连ip4) > 停ip4/启动ip4 , 停ip18/启动ip18,ip18报错 > !image-2022-08-03-09-39-10-230.png! > !image-2022-08-03-09-39-48-739.png! > 2. 启动benchmark > 2022-08-02 17:34:57 启动bm > 3. 停止ip4的datanode > 2022-08-02 17:45:42停止datanode > sleep 300 > 启动ip4 > 4. 停止ip18的datanode > 2022-08-02 17:54:11 停止ip18的datanode > sleep 600 > 启动ip18 > {color:#DE350B}*启动过程中,报错*{color}: > 见问题描述 > bm写入完成,各节点同步完成,{color:#DE350B}*ip18节点少数据*{color},ip16,ip4 的数据正确。 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Reopened] (IOTDB-4027) ERROR o.a.i.d.e.s.SnapshotLoader:94 - Exception occurs when creating links from snapshot directory to data directory
[ https://issues.apache.org/jira/browse/IOTDB-4027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 刘珍 reopened IOTDB-4027: --- master_0906_0095eb3 在snapshot前 ,down follower ip2 ,ip2在线后,raft log不同步 ip2日志 2022-09-06 15:53:31,056 [null-request--thread1] INFO o.a.r.g.s.GrpcClientProtocolService$UnorderedRequestStreamObserver:284 - Failed RaftClientRequest:client-0DF34DD58B52->172.20.70.2_40010@group-00010002, cid=36, seq=0, RW, Message:000d0012726f6f74...(size=68644), reply=RaftClientReply:client-0DF34DD58B52->172.20.70.2_40010@group-00010002, cid=36, FAILED org.apache.ratis.protocol.exceptions.NotLeaderException: Server 172.20.70.2_40010@group-00010002 is not the leader, logIndex=0, commits[172.20.70.2_40010:c202840, 172.20.70.14_40010:c556712, 172.20.70.18_40010:c556711] > ERROR o.a.i.d.e.s.SnapshotLoader:94 - Exception occurs when creating links > from snapshot directory to data directory > - > > Key: IOTDB-4027 > URL: https://issues.apache.org/jira/browse/IOTDB-4027 > Project: Apache IoTDB > Issue Type: Bug > Components: mpp-cluster >Affects Versions: 0.14.0-SNAPSHOT >Reporter: 刘珍 >Assignee: Liuxuxin >Priority: Major > Labels: pull-request-available > Fix For: 0.14.0 > > Attachments: image-2022-08-03-09-39-10-230.png, > image-2022-08-03-09-39-48-739.png, image-2022-09-06-17-05-21-387.png, > ip18_befor_stop_datanode_log.tar.gz, ip18_restart_with-error_log.tar.gz, > ip4_2000_config.properties, screenshot-1.png > > > master_0801_55b5b17 > 问题描述 > RatisConsensus,3副本3C9D,1个bm连1个datanode执行并发写入,停止1个follower节点,5分钟后启动;{color:#DE350B}*然后停止另1个follower节点10分钟后启动,此节点启动过程中报错,此节点少数据*{color}: > 2022-08-02 18:04:17,376 [pool-4-thread-1] ERROR o.a.i.d.e.s.SnapshotLoader:94 > - Exception occurs when creating links from snapshot directory to data > directory > java.io.IOException: Cannot find > /data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-00010001/sm/1_354536/sequence/root.ip4.g_0 > or > /data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-00010001/sm/1_354536/unsequence/root.ip4.g_0 > at > org.apache.iotdb.db.engine.snapshot.SnapshotLoader.createLinksFromSnapshotDirToDataDir(SnapshotLoader.java:163) > at > org.apache.iotdb.db.engine.snapshot.SnapshotLoader.loadSnapshotForStateMachine(SnapshotLoader.java:91) > at > org.apache.iotdb.db.consensus.statemachine.DataRegionStateMachine.loadSnapshot(DataRegionStateMachine.java:93) > at > org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.loadSnapshot(ApplicationStateMachineProxy.java:188) > at > org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.lambda$initialize$0(ApplicationStateMachineProxy.java:73) > at > org.apache.ratis.util.LifeCycle.startAndTransition(LifeCycle.java:270) > at > org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.initialize(ApplicationStateMachineProxy.java:69) > at > org.apache.ratis.server.impl.ServerState.(ServerState.java:136) > at > org.apache.ratis.server.impl.RaftServerImpl.(RaftServerImpl.java:201) > at > org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$5(RaftServerProxy.java:274) > at > java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > 2022-08-02 18:04:17,376 [pool-4-thread-1] ERROR > o.a.i.d.c.s.DataRegionStateMachine:95 - Fail to load snapshot from > /data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-00010001/sm/1_354536 > ip18少数据,期望序列的count值是2点 > !screenshot-1.png! > 1. 复现流程 > 私有云172.20.70.2/3/4/5/13/14/16/18/19 > benchmark 在ip15(连ip4) > 停ip4/启动ip4 , 停ip18/启动ip18,ip18报错 > !image-2022-08-03-09-39-10-230.png! > !image-2022-08-03-09-39-48-739.png! > 2. 启动benchmark > 2022-08-02 17:34:57 启动bm > 3. 停止ip4的datanode > 2022-08-02 17:45:42停止datanode > sleep 300 > 启动ip4 > 4. 停止ip18的datanode > 2022-08-02 17:54:11 停止ip18的datanode > sleep 600 > 启动ip18 > {color:#DE350B}*启动过程中,报错*{color}: > 见问题描述 > bm写入完成,各节点同步完成,{color:#DE350B}*ip18节点少数据*{color},ip16,ip4 的数据正确。 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4349) Remove error status in data node
Haiming Zhu created IOTDB-4349: -- Summary: Remove error status in data node Key: IOTDB-4349 URL: https://issues.apache.org/jira/browse/IOTDB-4349 Project: Apache IoTDB Issue Type: New Feature Reporter: Haiming Zhu Assignee: Haiming Zhu Error status is useless, remove it. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Assigned] (IOTDB-4343) [master]After starting mqtt, after executing Trigger's SQL, there is an NPE error in the log
[ https://issues.apache.org/jira/browse/IOTDB-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 张洪胤 reassigned IOTDB-4343: -- Assignee: 张洪胤 (was: liaolanyu) > [master]After starting mqtt, after executing Trigger's SQL, there is an NPE > error in the log > > > Key: IOTDB-4343 > URL: https://issues.apache.org/jira/browse/IOTDB-4343 > Project: Apache IoTDB > Issue Type: Bug > Components: Core/Server >Affects Versions: master branch >Reporter: xiaozhihong >Assignee: 张洪胤 >Priority: Major > Attachments: log_all.log > > > commit 518dcfbfa461ffe4912fba18cb29299416301588 > After enabling MQTT, execute the trigger use case, and find that there is an > NPE authorization authentication error in the log. > Step 1: In the iotdb-datanode.properties ,modify parameter: > enable_mqtt_service=true > Step 2: Start server, Enter CLI > Step 3: Execute SQL: > {code:java} > set storage group to root.sg1; > CREATE TIMESERIES root.sg1.dev1.s_1 WITH DATATYPE=INT32, ENCODING=GORILLA; > CREATE TIMESERIES root.sg1.dev1.s_2 WITH DATATYPE=INT64, ENCODING=PLAIN; > CREATE TIMESERIES root.sg1.dev1.s_3 WITH DATATYPE=float, ENCODING=RLE; > CREATE TIMESERIES root.sg1.dev1.s_4 WITH DATATYPE=double, ENCODING=TS_2DIFF; > CREATE TIMESERIES root.sg1.dev1.s_5 WITH DATATYPE=text, ENCODING=DICTIONARY; > CREATE TIMESERIES root.sg1.dev1.s_6 WITH DATATYPE=boolean, ENCODING=PLAIN; > CREATE TRIGGER trig1 > AFTER INSERT > ON root.sg1.dev1.s_1 > AS 'TriggerTest' > WITH ( > 'ts_type' = 'int32', > 'remote_ip' = '127.0.0.1', > 'trig_name'='trig1' > ); > CREATE TRIGGER trig2 > AFTER INSERT > ON root.sg1.dev1.s_2 > AS 'TriggerTest' > WITH ( > 'ts_type' = 'int64', > 'remote_ip' = '127.0.0.1', > 'trig_name'='trig2' > ); > insert into root.sg1.dev1(time,s_1,s_2) values(1,100,200); > insert into root.sg1.dev1(time,s_1,s_2) values(2,300,400); > select s_1,s_2,s_3,s_4,s_5,s_6 from root.sg1.dev1; > select local_trig1,local_trig2,remotetrig1,remotetrig2 from > root.target.alerting; > > delete timeseries root.sg1.dev1.s_1; > insert into root.sg1.dev1(time,s_2) values(100,1600);{code} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Assigned] (IOTDB-4343) [master]After starting mqtt, after executing Trigger's SQL, there is an NPE error in the log
[ https://issues.apache.org/jira/browse/IOTDB-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 张洪胤 reassigned IOTDB-4343: -- Assignee: liaolanyu (was: 张洪胤) > [master]After starting mqtt, after executing Trigger's SQL, there is an NPE > error in the log > > > Key: IOTDB-4343 > URL: https://issues.apache.org/jira/browse/IOTDB-4343 > Project: Apache IoTDB > Issue Type: Bug > Components: Core/Server >Affects Versions: master branch >Reporter: xiaozhihong >Assignee: liaolanyu >Priority: Major > Attachments: log_all.log > > > commit 518dcfbfa461ffe4912fba18cb29299416301588 > After enabling MQTT, execute the trigger use case, and find that there is an > NPE authorization authentication error in the log. > Step 1: In the iotdb-datanode.properties ,modify parameter: > enable_mqtt_service=true > Step 2: Start server, Enter CLI > Step 3: Execute SQL: > {code:java} > set storage group to root.sg1; > CREATE TIMESERIES root.sg1.dev1.s_1 WITH DATATYPE=INT32, ENCODING=GORILLA; > CREATE TIMESERIES root.sg1.dev1.s_2 WITH DATATYPE=INT64, ENCODING=PLAIN; > CREATE TIMESERIES root.sg1.dev1.s_3 WITH DATATYPE=float, ENCODING=RLE; > CREATE TIMESERIES root.sg1.dev1.s_4 WITH DATATYPE=double, ENCODING=TS_2DIFF; > CREATE TIMESERIES root.sg1.dev1.s_5 WITH DATATYPE=text, ENCODING=DICTIONARY; > CREATE TIMESERIES root.sg1.dev1.s_6 WITH DATATYPE=boolean, ENCODING=PLAIN; > CREATE TRIGGER trig1 > AFTER INSERT > ON root.sg1.dev1.s_1 > AS 'TriggerTest' > WITH ( > 'ts_type' = 'int32', > 'remote_ip' = '127.0.0.1', > 'trig_name'='trig1' > ); > CREATE TRIGGER trig2 > AFTER INSERT > ON root.sg1.dev1.s_2 > AS 'TriggerTest' > WITH ( > 'ts_type' = 'int64', > 'remote_ip' = '127.0.0.1', > 'trig_name'='trig2' > ); > insert into root.sg1.dev1(time,s_1,s_2) values(1,100,200); > insert into root.sg1.dev1(time,s_1,s_2) values(2,300,400); > select s_1,s_2,s_3,s_4,s_5,s_6 from root.sg1.dev1; > select local_trig1,local_trig2,remotetrig1,remotetrig2 from > root.target.alerting; > > delete timeseries root.sg1.dev1.s_1; > insert into root.sg1.dev1(time,s_2) values(100,1600);{code} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Assigned] (IOTDB-4347) NPE error when session is expired
[ https://issues.apache.org/jira/browse/IOTDB-4347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yifu Zhou reassigned IOTDB-4347: Assignee: Yifu Zhou > NPE error when session is expired > - > > Key: IOTDB-4347 > URL: https://issues.apache.org/jira/browse/IOTDB-4347 > Project: Apache IoTDB > Issue Type: Bug >Reporter: Yifu Zhou >Assignee: Yifu Zhou >Priority: Major > Labels: pull-request-available > Fix For: master branch, 0.14.0-SNAPSHOT > > Attachments: image-2022-09-06-14-31-32-982.png > > > When session is expired or session is closed, it will occurs NPE when > obtaining the username. > !image-2022-09-06-14-31-32-982.png! -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4348) NoSuchFileExeception when creating snapshot
Jinrui Zhang created IOTDB-4348: --- Summary: NoSuchFileExeception when creating snapshot Key: IOTDB-4348 URL: https://issues.apache.org/jira/browse/IOTDB-4348 Project: Apache IoTDB Issue Type: Bug Reporter: Jinrui Zhang Assignee: Liuxuxin Attachments: image-2022-09-06-14-22-05-045.png NoSuchFileException is threw when taking snapshot in DataRegion migration !image-2022-09-06-14-22-05-045.png|width=1270,height=409! -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (IOTDB-4347) NPE error when session is expired
Yifu Zhou created IOTDB-4347: Summary: NPE error when session is expired Key: IOTDB-4347 URL: https://issues.apache.org/jira/browse/IOTDB-4347 Project: Apache IoTDB Issue Type: Bug Reporter: Yifu Zhou Fix For: master branch, 0.14.0-SNAPSHOT -- This message was sent by Atlassian Jira (v8.20.10#820010)