[ 
https://issues.apache.org/jira/browse/HDDS-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17728287#comment-17728287
 ] 

Hongbing Wang commented on HDDS-8716:
-------------------------------------

distcp -p or -pr will preserve replication number. HDFS ec rep is 1.
In this case, it may be better for ozone to use the default replicationFactor 
if ozone dir does not has ec policy. 

ONE replicationFactor maybe risks.

>  ReplicationFactor will be ONE when distcp hdfs ec file to ozone
> ----------------------------------------------------------------
>
>                 Key: HDDS-8716
>                 URL: https://issues.apache.org/jira/browse/HDDS-8716
>             Project: Apache Ozone
>          Issue Type: Bug
>    Affects Versions: 1.3.0
>            Reporter: guangbao zhao
>            Assignee: guangbao zhao
>            Priority: Major
>         Attachments: clipboard_image_1685349424008.png, 
> clipboard_image_1685349999645.png
>
>
> When I used distcp to copy data from hdfs to ozone, I found that some data 
> has three replication, and some data has one replication
> I submit with the following command:
> {code:java}
> hadoop distcp \
> -i -p \
> hdfs://xxx/user/hadoop/warehouse/bigdata_admin.db/hive_audit_res/year=2020 \
> ofs://xxx/vol-bigdata-offline/prod-bigdata-offline/hive_audit_res/year=2020 
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to