[ https://issues.apache.org/jira/browse/SPARK-13628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15177512#comment-15177512 ]
Sean Owen commented on SPARK-13628: ----------------------------------- I don't think that's a good idea in general, because you leave the file in its final location while it's still being written. The rename is on purpose. > Temporary intermediate output file should be renamed before copying to > destination filesystem > --------------------------------------------------------------------------------------------- > > Key: SPARK-13628 > URL: https://issues.apache.org/jira/browse/SPARK-13628 > Project: Spark > Issue Type: Improvement > Components: Input/Output > Affects Versions: 1.6.0 > Reporter: Chen He > > Spark Executor will dump temporary file into local temp dir, copy it to > destination filesystem, and then, rename it. It could be costly for Blobstore > (such as openstack swift) which do the actual copy when file is renamed. If > it does not affect other components, we may switch the sequence of copy and > rename so that Spark can use Blobstore as final output destination. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org