[ 
https://issues.apache.org/jira/browse/HDFS-9607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15082848#comment-15082848
 ] 

Dinesh S. Atreya commented on HDFS-9607:
----------------------------------------

Regarding below, we want to be clear from the outset that proposals are 
different from standard POSIX hence use of different names such as 
"writeInPlace". Later on it is up to the HDFS team to indicate the API naming 
preferences.
{quote}
We have something called the Posix specification to tell us what to call 
things. The write() operation is the one to consider here, simply having it 
fail if an update condition cannot be met; possibly add a non-side-effecting 
canWrite call which checks the conditions.
{quote}
I believe currently HDFS is not POSIX compliant. However, Use of 
"canWriteInPlace" is a good suggestion instead of "checkWriteInPlace" and I 
will use it.

I had also thought along the following lines:
{quote}
I know that the core HDFS write pipeline is one of the most critical parts of 
HDFS, so leave it alone.
{quote}
The are a number of compelling reasons for *write-in-place* enhancements, the 
detailed list of business capabilities in [parent/umbrella JIRA HADOOP-12620 | 
https://issues.apache.org/jira/browse/HADOOP-12620] is one.
Also for those interested, you can check out 
https://www.mapr.com/resources/videos/whiteboard-walkthrough-append-only-vs-read-write-file-system
 
My thought process has been, there should be an easier way to provide 
equivalent capabilities while furthering HDFS without attempting to provide 
full-fledged POSIX compliance. This JIRA and theme is an attempt towards that 
end.

It is obvious that the whole process has to be very careful and deliberate. I 
will add the implementation design in due course, covering how to tackle 
reliable in-place updates, in HDFS, including in encrypted, and coping with 
failures of 1+ node during that update process.  (I have not been following 
erasure-coding developments so far.)

> Advance Hadoop Architecture (AHA) - HDFS
> ----------------------------------------
>
>                 Key: HDFS-9607
>                 URL: https://issues.apache.org/jira/browse/HDFS-9607
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Dinesh S. Atreya
>
> Link to Umbrella JIRA
> https://issues.apache.org/jira/browse/HADOOP-12620 
> Provide capability to carry out in-place writes/updates. Only writes in-place 
> are supported where the existing length does not change.
> For example, "Hello World" can be replaced by "Hello HDFS!"
> See 
> https://issues.apache.org/jira/browse/HADOOP-12620?focusedCommentId=15046300&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15046300
>  for more details.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to