[ 
https://issues.apache.org/jira/browse/HDFS-12204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

HanRyong,Jung updated HDFS-12204:
---------------------------------
    Attachment:     (was: HDFS-12204.1.patch)

> Dfsclient Do not close file descriptor when using shortcircuit
> --------------------------------------------------------------
>
>                 Key: HDFS-12204
>                 URL: https://issues.apache.org/jira/browse/HDFS-12204
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: hdfs-client
>    Affects Versions: 2.7.3, 3.1.4
>         Environment: HDFS 2.7.3, HBASE 1.2.6, centOS 6.8
>            Reporter: HanRyong,Jung
>            Priority: Major
>         Attachments: HDFS-12204.branch-2.7.001.patch, 
> HDFS-12204.branch-3.1.001.patch, HDFS-12204.branch-3.1.002.patch
>
>
> I am a user using HDFS 2.7.3, HBASE 1.2.6, centOS 6.8.
> The regionserver uses 11 hard disks(jbod) and uses the hbase short circuit.
> At this time, when one disk fails in HDFS, I found a phenomenon that I did a 
> hotswap but did not close file descriptor in hbase.
> And the fd path on the umount disk is changed to an incorrect path.
> If I check /proc/regionserver_pid/fd, if I used /data1/volumn and umounted 
> data1, the path changed to /volumn.
> And many file descriptors used in shortcircuit are in the delete state.
> example ) 
> ls -al /proc/regionserver_pid/fd 
> lr-x------ 1 dragonboy dragonboy 64 2017-07-26 20:54 946 -> 
> /data8/volumn/hdfs/datanode/current/BP-199986352-10.114.243.73-1490077615453/current/finalized/subdir111/subdir21/blk_1215239490
>  (deleted)
> lr-x------ 1 dragonboy dragonboy 64 2017-07-26 20:54 947 -> 
> /data8/volumn/hdfs/datanode/current/BP-199986352-10.114.243.73-1490077615453/current/finalized/subdir111/subdir21/blk_1215239490_141511919.meta
>  (deleted)
> lr-x------ 1 dragonboy dragonboy 64 2017-07-26 20:54 948 -> 
> /data7/volumn/hdfs/datanode/current/BP-199986352-10.114.243.73-1490077615453/current/finalized/subdir111/subdir27/blk_1215241080
>  (deleted)
> lr-x------ 1 dragonboy dragonboy 64 2017-07-26 20:54 949 -> 
> /data7/volumn/hdfs/datanode/current/BP-199986352-10.114.243.73-1490077615453/current/finalized/subdir111/subdir27/blk_1215241080_141513509.meta
>  (deleted)
> lr-x------ 1 dragonboy dragonboy 64 2017-07-26 20:54 *192 -> 
> /volumn/hdfs/datanode/current/BP-199986352-10.114.243.73-1490077615453/current/finalized/subdir244/subdir160/blk_1257545757
>  (deleted)*
>                                                      .
>                                                      .
>                                                      .
>                                                      .
>                                                      
> when data4 fails, execute fuser)
> /sbin/fuser -cu /data4
> Cannot stat file /proc/regionserver_pid/fd/*192*: input/output error
> Cannot stat file /proc/regionserver_pid/fd/1282: input/output error
> Cannot stat file /proc/regionserver_pid/fd/1283: input/output error
>                                                      .
>                                                      .
>                                                      .
>                                                      .
>                                                      .
>                                                      



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to