[jira] [Commented] (HDFS-6673) Add Delimited format supports for PB OIV tool

Haohui Mai (JIRA) Mon, 19 Jan 2015 17:29:29 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-6673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14283272#comment-14283272
 ]


Haohui Mai commented on HDFS-6673:
----------------------------------

The current patch stores the map from  {{inodeid}} to {{parent}} in LevelDB in 
the first pass, then in the second pass it iterates all inodes in the fsimage 
then prints out the results.

However, given the fact that (1) LevelDB stores the KV pair in sorted order on 
the disk, and (2) the inodes are stored in random orders in the fsimage, the 
scheme requires one seek per file. It makes more sense to adopt the scheme 
demonstrated in HDFS-6293, that is, using {{IN || parent_id || localName}} as 
the key. That way it requires at most one seek per directory instead of one 
seek per file.

> Add Delimited format supports for PB OIV tool
> ---------------------------------------------
>
>                 Key: HDFS-6673
>                 URL: https://issues.apache.org/jira/browse/HDFS-6673
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>    Affects Versions: 2.4.0
>            Reporter: Lei (Eddy) Xu
>            Assignee: Lei (Eddy) Xu
>            Priority: Minor
>         Attachments: HDFS-6673.000.patch, HDFS-6673.001.patch, 
> HDFS-6673.002.patch, HDFS-6673.003.patch, HDFS-6673.004.patch, 
> HDFS-6673.005.patch
>
>
> The new oiv tool, which is designed for Protobuf fsimage, lacks a few 
> features supported in the old {{oiv}} tool. 
> This task adds supports of _Delimited_ processor to the oiv tool. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-6673) Add Delimited format supports for PB OIV tool

Reply via email to