[
https://issues.apache.org/jira/browse/HDFS-9020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729751#comment-14729751
]
Chris Douglas commented on HDFS-9020:
-------------------------------------
Talking with [~jghoman] and [~daryn], the following seems like the least
objectionable modification to WebHDFS.
# Change the server response for {{PUT}} to return a session cookie for the
stream, addressing a client cache at the DN
# Subsequent {{POST}} operations including this cookie address a client
instance, maintained per user (the session cookie is _not_ used for auth)
# If the instance is not found (possibly because the client is retrying and
directed to another server), a new client is created and the lease recovered
for the file
# {{POST}} (append) operations include an optional, expected offset for the
stream
# {{POST}} (append) operations include an optional header/trailer specifying
whether the stream should be flushed, synced, or closed
Backwards compatibility is maintained by using only the chunked {{PUT}}
operation if the server does not return a session cookie (i.e., the existing
code path).
Thoughts? I'll also attach a brief list of alternative designs for discussion.
> Support hflush/hsync in WebHDFS
> -------------------------------
>
> Key: HDFS-9020
> URL: https://issues.apache.org/jira/browse/HDFS-9020
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: webhdfs
> Reporter: Chris Douglas
> Attachments: HDFS-9020-alt.txt
>
>
> In the current implementation, hflush/hsync have no effect on WebHDFS
> streams, particularly w.r.t. visibility to other clients. This proposes to
> extend the protocol and implementation to enable this functionality.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)