[
https://issues.apache.org/jira/browse/HDFS-17589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ruiliang updated HDFS-17589:
----------------------------
Attachment: FindDirtyBlk.py
> hdfs EC data new blk reconstruct old blk not delete
> ------------------------------------------------------
>
> Key: HDFS-17589
> URL: https://issues.apache.org/jira/browse/HDFS-17589
> Project: Hadoop HDFS
> Issue Type: Bug
> Affects Versions: 3.1.1
> Reporter: ruiliang
> Priority: Major
> Attachments: DiskBlkAPI.py, FindDirtyBlk.py
>
>
> The reason is that the cluster was faulty before, and Datanodes kept losing
> connections and recovering, resulting in a lot of EC data reconstruct, but a
> lot of old blk failed to clean up correctly. Has this been repaired? What
> patch do I need to add, thank you
> The following is a detailed check log
>
> ok: blk_-9223372036371044652 in 10.12.66.225
> {color:#de350b}error: blk_-9223372036371044652 in
> 10.12.66.154(/data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652)
> {color}
> {color:#de350b}Why didn't you delete it?{color}
>
> {code:java}
> ====datanode delete data ec blk ?
> grep blk_-9223372036371044656
> hadoop-hdfs-root-datanode-fs-hiido-dn-12-66-111.hiido.host.xxx.com.log
> 2024-07-18 17:25:07,879 INFO datanode.DataNode
> (DataXceiver.java:writeBlock(738)) - Receiving
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793
> src: /10.12.66.111:25066 dest: /10.12.66.111:1019
> 2024-07-18 17:25:17,396 INFO datanode.DataNode
> (StripedBlockReconstructor.java:run(86)) - ok EC reconstruct striped block:
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793
> blockId: -9223372036371044656
> 2024-07-18 17:25:17,396 INFO datanode.DataNode
> (DataXceiver.java:writeBlock(914)) - Received
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793
> src: /10.12.66.111:25066 dest: /10.12.66.111:1019 of size 193986560
> 2024-07-18 17:25:25,465 INFO impl.FsDatasetAsyncDiskService
> (FsDatasetAsyncDiskService.java:deleteAsync(225)) - Scheduling
> blk_-9223372036371044656_1688858793 replica FinalizedReplica,
> blk_-9223372036371044656_1688858793, FINALIZED
> getBlockURI() =
> file:/data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656
> for deletion
> 2024-07-18 17:25:25,746 INFO impl.FsDatasetAsyncDiskService
> (FsDatasetAsyncDiskService.java:run(333)) - Deleted
> BP-1822992414-10.12.65.48-1660893388633 blk_-9223372036371044656_1688858793
> URI
> file:/data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656=============my
> config
> dfs.blockreport.intervalMsec =21600000============namenode3 log
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
> 04:34:39,523 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
> 04:34:40,131 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
> 10:34:38,950 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
> 10:34:39,559 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log:2024-07-18
> 16:34:38,564 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log:2024-07-18
> 16:34:39,190 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 04:34:39,462 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 04:34:40,083 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 10:34:39,686 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 10:34:40,295 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 16:34:39,667 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 16:34:40,301 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 22:34:38,187 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
> 22:34:38,794 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238))
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to
> storageType DISK on node 10.12.66.154:1019
> =====namenode2 log active
> grep blk_-9223372036371044656
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn2.hiido.host.xxx.com.log.10
> 2024-07-18 17:25:04,786 WARN BlockStateChange
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node
> 10.12.66.154:1019
> 2024-07-18 17:25:05,703 WARN BlockStateChange
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node
> 10.12.66.154:1019
> ========namenode2 log
> root@fs-hiido-yycluster06-yynn1:/data/logs/hadoop/hdfs# grep
> blk_-9223372036371044656
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn1.hiido.host.xxx.com.log
> 2024-07-18 07:20:41,525 WARN BlockStateChange
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node
> 10.12.66.154:1019
> 2024-07-18 07:20:42,049 WARN BlockStateChange
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node
> 10.12.66.154:1019
> 2024-07-18 13:20:40,726 WARN BlockStateChange
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node
> 10.12.66.154:1019
> 2024-07-18 13:20:41,251 WARN BlockStateChange
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node
> 10.12.66.154:1019
> ================
> hdfs fsck -fs hdfs://yycluster06 -blockId blk_-9223372036371044656
> Connecting to namenode via
> http://fs-hiido-yycluster06-yynn2.hiido.host.xxx.com:50070/fsck?ugi=hdfs&blockId=blk_-9223372036371044656+&path=%2F
> FSCK started by hdfs (auth:KERBEROS_SSL) from /10.12.19.4 at Thu Jul 18
> 17:57:06 CST 2024Block Id: blk_-9223372036371044656
> Block belongs to:
> /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz
> No. of Expected Replica: 5
> No. of live Replica: 5
> No. of excess Replica: 0
> No. of stale Replica: 0
> No. of decommissioned Replica: 0
> No. of decommissioning Replica: 0
> No. of corrupted Replica: 0
> Block replica on datanode/rack:
> fs-hiido-dn-12-66-225.hiido.host.xxxyy.com/4F08-02-03 is HEALTHY
> Block replica on datanode/rack:
> fs-hiido-dn-12-67-38.hiido.host.xxxyy.com/4F08-02-15 is HEALTHY
> Block replica on datanode/rack:
> fs-hiido-dn-12-66-191.hiido.host.xxx.com/4F08-12-06 is HEALTHY
> Block replica on datanode/rack:
> fs-hiido-dn-12-67-5.hiido.host.xxxyy.com/4F08-12-09 is HEALTHY
> Block replica on datanode/rack:
> fs-hiido-dn-12-66-154.hiido.host.xxxyy.com/4F08-02-13 is HEALTHYhdfs fsck -fs
> hdfs://yycluster06
> /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz
> -files -blocks -locations
> /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz
> 581647843 bytes, erasure-coded: policy=RS-3-2-1024k, 1 block(s): OK
> 0.
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793
> len=581647843 Live_repl=5 [
> blk_-9223372036371044656:DatanodeInfoWithStorage[10.12.66.154:1019,DS-4b66fe61-93ca-4f8d-8fe0-e00f2ed09e82,DISK],
>
> blk_-9223372036371044655:DatanodeInfoWithStorage[10.12.67.5:1019,DS-af799695-8e9b-4884-a741-7a0742db6f79,DISK],
>
> blk_-9223372036371044654:DatanodeInfoWithStorage[10.12.66.191:1019,DS-cef171fa-8e8e-43bb-9fd1-aab84ea046cd,DISK],
>
> blk_-9223372036371044653:DatanodeInfoWithStorage[10.12.67.38:1019,DS-d2346a26-14c4-41e9-b349-e198ab5b684e,DISK],
>
> blk_-9223372036371044652:DatanodeInfoWithStorage[10.12.66.225:1019,DS-13c19c1d-9221-45cc-919f-b54ada9cab15,DISK]]
> ========================================================================================================================================
> root@fs-hiido-dn-12-66-154:/data/logs/hadoop/hdfs# ll
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Feb 26 19:54
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656
> -rw-r--r-- 1 hdfs hdfs 1515527 Feb 26 19:54
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656_1688858793.meta
> -rw-r--r-- 1 hdfs hdfs 193986560 Feb 27 21:49
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> -rw-r--r-- 1 hdfs hdfs 1515527 Feb 27 21:49
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652_1688858793.meta
> root@fs-hiido-dn-12-67-5:/home/liangrui# ll
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Nov 29 2023
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044655
> -rw-r--r-- 1 hdfs hdfs 1515527 Nov 29 2023
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044655_1688858793.meta
> root@fs-hiido-dn-12-66-191:/home/liangrui# ll
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193674723 Aug 30 2023
> /data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044654
> -rw-r--r-- 1 hdfs hdfs 1513091 Aug 30 2023
> /data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044654_1688858793.meta
> root@fs-hiido-dn-12-67-38:/home/liangrui# ll
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Apr 19 18:20
> /data1/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044653
> -rw-r--r-- 1 hdfs hdfs 1515527 Apr 19 18:20
> /data1/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044653_1688858793.meta
> root@fs-hiido-dn-12-66-225:/home/liangrui# ll
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Dec 1 2023
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> -rw-r--r-- 1 hdfs hdfs 1515527 Dec 1 2023
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652_1688858793.meta
> ========
> root@fs-hiido-dn-12-66-154:/data/logs/hadoop/hdfs# md5sum
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> b661b6d711d753a82c3bf42bb2ceec51
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> 10.12.66.225
> md5sum
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> b661b6d711d753a82c3bf42bb2ceec51
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> {code}
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]