[
https://issues.apache.org/jira/browse/HADOOP-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doug Cutting updated HADOOP-1563:
---------------------------------
Attachment: httpfs.patch
Attaching a FileSystem for HTTP-browsable directories.
This assumes that:
* the URL space is hierarchical and slash-delimited;
* URLs of directories always end in a slash;
* GET of non-slash-terminated directory URL redirects to the slash-terminated
URL;
* the content of a directory URL contains the URLs of its children;
* child URLs can be extracted from parent content with a regular expression.
This seems to work for directory listings produced by Apache, Tomcat, Jetty,
and Subversion. If we make HDFS browsable over HTTP in the above manner, then
this will work for HDFS too.
I've also added default definitions for a bunch of abstract FileSystem methods,
and removed definitions in implementations that matched these default
definitions, simplifying most FileSystem implementations.
> Create FileSystem implementation to read HDFS data via http
> -----------------------------------------------------------
>
> Key: HADOOP-1563
> URL: https://issues.apache.org/jira/browse/HADOOP-1563
> Project: Hadoop
> Issue Type: New Feature
> Components: fs
> Affects Versions: 0.14.0
> Reporter: Owen O'Malley
> Assignee: Chris Douglas
> Attachments: httpfs.patch
>
>
> There should be a FileSystem implementation that can read from a Namenode's
> http interface. This would have a couple of useful abilities:
> 1. Copy using distcp between different versions of HDFS.
> 2. Use map/reduce inputs from a different version of HDFS.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.