[jira] Commented: (HDFS-487) HDFS should expose a fileid to uniquely identify a file

Sanjay Radia (JIRA) Wed, 15 Jul 2009 12:30:42 -0700

    [ 
https://issues.apache.org/jira/browse/HDFS-487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731640#action_12731640
 ]


Sanjay Radia commented on HDFS-487:
-----------------------------------

It would be useful to articulate the various use cases for the file id plus the 
mapping data structures that would be needed.

Dhruba has given one use case.
 Another one is that if we propagate the fileId to the DNs than one can 
reconstruct files from blocks (though without their file names.)
 Hard links is another use case though I am not a fan of hard links.
Others?
Would fileIds simplify things inside the NN?

Mappings:
Currently, inside a NN we have mappings from filename->inode,, inode->blocks.

To make the file id useful we would have to add a mapping from fileId->inode. 
This mapping would use up memory which is scarce inside a NN, but perhaps it 
could replace
other data structures. 
Wouldn't one need the fileId to inode->blocks for  puggable block placement 
policies?

> HDFS should expose a fileid to uniquely identify a file
> -------------------------------------------------------
>
>                 Key: HDFS-487
>                 URL: https://issues.apache.org/jira/browse/HDFS-487
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: dhruba borthakur
>            Assignee: dhruba borthakur
>         Attachments: fileid1.txt
>
>
> HDFS should expose a id that uniquely identifies a file. This helps in 
> developing  applications that work correctly even when files are moved from 
> one directory to another. A typical use-case is to make the Pluggable Block 
> Placement Policy (HDFS-385) use fileid instead of filename.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HDFS-487) HDFS should expose a fileid to uniquely identify a file

Reply via email to