[ https://issues.apache.org/jira/browse/HDFS-6673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14283272#comment-14283272 ]
Haohui Mai commented on HDFS-6673: ---------------------------------- The current patch stores the map from {{inodeid}} to {{parent}} in LevelDB in the first pass, then in the second pass it iterates all inodes in the fsimage then prints out the results. However, given the fact that (1) LevelDB stores the KV pair in sorted order on the disk, and (2) the inodes are stored in random orders in the fsimage, the scheme requires one seek per file. It makes more sense to adopt the scheme demonstrated in HDFS-6293, that is, using {{IN || parent_id || localName}} as the key. That way it requires at most one seek per directory instead of one seek per file. > Add Delimited format supports for PB OIV tool > --------------------------------------------- > > Key: HDFS-6673 > URL: https://issues.apache.org/jira/browse/HDFS-6673 > Project: Hadoop HDFS > Issue Type: Sub-task > Affects Versions: 2.4.0 > Reporter: Lei (Eddy) Xu > Assignee: Lei (Eddy) Xu > Priority: Minor > Attachments: HDFS-6673.000.patch, HDFS-6673.001.patch, > HDFS-6673.002.patch, HDFS-6673.003.patch, HDFS-6673.004.patch, > HDFS-6673.005.patch > > > The new oiv tool, which is designed for Protobuf fsimage, lacks a few > features supported in the old {{oiv}} tool. > This task adds supports of _Delimited_ processor to the oiv tool. -- This message was sent by Atlassian JIRA (v6.3.4#6332)