[ 
https://issues.apache.org/jira/browse/HDFS-2316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100891#comment-13100891
 ] 

Alejandro Abdelnur commented on HDFS-2316:
------------------------------------------

IMO, the nice thing about #2 is that the file path of *HDFS:* and a *HTTP:* 
URIs will be exactly the same, and in the case of using the NN/DD deployment of 
HOOP it will be even the same host. 

In addition is it intuitive without any caveat, a given path will just work by 
replacing the SCHEME://HOST:PORT part of it. 

Finally, and IMO this is very important from the Usability perspective, user 
applications that take are designed to take the URI of the FS as parameter and 
operate via HDFS: or HTTP: will be otherwise difficult to code. Hadoop's 
*Path(String parent, String child)* uses the *URI.resolve(...)* that uses a 
well defined logic to resolve URIs based on other URIs[ 
http://download.oracle.com/javase/6/docs/api/java/net/URI.html#resolve(java.net.URI)
 ]. If we use a prefix for HTTP URIs then it will become difficult and error 
prone to compose HDFS: URIs from HTTP: URIs and viceversa. (And I believe the 
same is true for libraries in other languages)

Finally, I have not seen HDFS files under */data* as a common practice, thus 
the name collision won't be that common.


> webhdfs: a complete FileSystem implementation for accessing HDFS over HTTP
> --------------------------------------------------------------------------
>
>                 Key: HDFS-2316
>                 URL: https://issues.apache.org/jira/browse/HDFS-2316
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Tsz Wo (Nicholas), SZE
>            Assignee: Tsz Wo (Nicholas), SZE
>
> We current have hftp for accessing HDFS over HTTP.  However, hftp is a 
> read-only FileSystem and does not provide "write" accesses.
> In HDFS-2284, we propose to have webhdfs for providing a complete FileSystem 
> implementation for accessing HDFS over HTTP.  The is the umbrella JIRA for 
> the tasks.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to