[ 
https://issues.apache.org/jira/browse/HIVE-4773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13757365#comment-13757365
 ] 

Shuaishuai Nie commented on HIVE-4773:
--------------------------------------

The problem seems not exclusive for asv. According to "Hadoop the definitive 
guide" 3nd edition P75, "HDFS trades off some POSIX requirements for 
performance, so some operations
may behave differently than you expect them to." "any content written to the 
file is not guaranteed to be visible, even if the stream is flushed".
Not sure if this will break Yarn if it does container reuse. One safer way is 
to use "FSDataOutputStream" instead of "PrintWriter" which implement function 
sync() to ensure data written up to that point in the file is visible to user 
in HDFS.
                
> Templeton intermittently fail to commit output to file system
> -------------------------------------------------------------
>
>                 Key: HIVE-4773
>                 URL: https://issues.apache.org/jira/browse/HIVE-4773
>             Project: Hive
>          Issue Type: Bug
>          Components: WebHCat
>            Reporter: Shuaishuai Nie
>            Assignee: Shuaishuai Nie
>         Attachments: HIVE-4773.1.patch
>
>
> With ASV as a default FS, we saw instances where output is not fully flushed 
> to storage before the Templeton controller process exits. This results in 
> stdout and stderr being empty even though the job completed successfully.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to