[ https://issues.apache.org/jira/browse/SPARK-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14174224#comment-14174224 ]
Apache Spark commented on SPARK-3979: ------------------------------------- User 'vanzin' has created a pull request for this issue: https://github.com/apache/spark/pull/2831 > Yarn backend's default file replication should match HDFS's default one > ----------------------------------------------------------------------- > > Key: SPARK-3979 > URL: https://issues.apache.org/jira/browse/SPARK-3979 > Project: Spark > Issue Type: Bug > Components: YARN > Reporter: Marcelo Vanzin > Assignee: Marcelo Vanzin > Priority: Minor > > This code in ClientBase.scala sets the replication used for files uploaded to > HDFS: > {code} > val replication = sparkConf.getInt("spark.yarn.submit.file.replication", > 3).toShort > {code} > Instead of a hardcoded "3" (which is the default value for HDFS), it should > be using the default value from the HDFS conf ("dfs.replication"). -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org