[ 
https://issues.apache.org/jira/browse/HDFS-8833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14699965#comment-14699965
 ] 

Andrew Wang commented on HDFS-8833:
-----------------------------------

I mentioned a [possible 
design|https://issues.apache.org/jira/browse/HDFS-8833?focusedCommentId=14649618&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14649618]
 earlier that unifies SP and EC semantics, it basically comes down to the 
default policy on file creation being either "inherit" vs. "set". SP also can 
be set on files, so mixed policies within a directory can still happen. I don't 
know how confusing mixing is, since replication and SP for instance are already 
per-file, and EC feels a lot more like those two than directory-level policies 
like snapshots and encryption zones.

We also had a similar question come up during SP design, of if we needed 
per-file SP settings at all. At the time Jing convinced me we needed the 
flexibility since it's useful for automatic storage tiering. Said system would 
automatically adjust things like replication, SP, EC, etc based on usage 
information. In this situation it's not okay to require a rename since it's not 
transparent to the user.

> Erasure coding: store EC schema and cell size in INodeFile and eliminate 
> notion of EC zones
> -------------------------------------------------------------------------------------------
>
>                 Key: HDFS-8833
>                 URL: https://issues.apache.org/jira/browse/HDFS-8833
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: namenode
>    Affects Versions: HDFS-7285
>            Reporter: Zhe Zhang
>            Assignee: Zhe Zhang
>         Attachments: HDFS-8833-HDFS-7285-merge.00.patch
>
>
> We have [discussed | 
> https://issues.apache.org/jira/browse/HDFS-7285?focusedCommentId=14357754&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14357754]
>  storing EC schema with files instead of EC zones and recently revisited the 
> discussion under HDFS-8059.
> As a recap, the _zone_ concept has severe limitations including renaming and 
> nested configuration. Those limitations are valid in encryption for security 
> reasons and it doesn't make sense to carry them over in EC.
> This JIRA aims to store EC schema and cell size on {{INodeFile}} level. For 
> simplicity, we should first implement it as an xattr and consider memory 
> optimizations (such as moving it to file header) as a follow-on. We should 
> also disable changing EC policy on a non-empty file / dir in the first phase.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to