Re: using distcp for http source files

2009-01-23 Thread Doug Cutting
Can you please attach your latest version of this to https://issues.apache.org/jira/browse/HADOOP-496? Thanks, Doug Boris Musykantski wrote: we have fixed some patches in JIRA for support of webdav server on top of HDFS, updated to work with newer version (0.18.0 IIRC) and added support for

Re: using distcp for http source files

2009-01-22 Thread Boris Musykantski
we have fixed some patches in JIRA for support of webdav server on top of HDFS, updated to work with newer version (0.18.0 IIRC) and added support for permissions. See code and description here: http://www.hadoop.iponweb.net/Home/hdfs-over-webdav Hope it is useful, Regards, Boris, IPonWeb On

Re: using distcp for http source files

2009-01-22 Thread Doug Cutting
Aaron Kimball wrote: Is anyone aware of an OSS web dav library that could be wrapped in a FileSystem implementation? We'd need a Java WebDAV client to talk to foreign filesystems. But to expose HDFS to foreign filesystems (i.e., to better support mounting HDFS) we'd need a Java WebDAV server

Re: using distcp for http source files

2009-01-22 Thread Doug Cutting
Aaron Kimball wrote: Doesn't the WebDAV protocol use http for file transfer, and support reads / writes / listings / etc? Yes. Getting a WebDAV-based FileSystem in Hadoop has long been a goal. It could replace libhdfs, since there are already a WebDav-based FUSE filesystem for Linux (wdfs,

Re: using distcp for http source files

2009-01-22 Thread Aaron Kimball
Doesn't the WebDAV protocol use http for file transfer, and support reads / writes / listings / etc? Is anyone aware of an OSS web dav library that could be wrapped in a FileSystem implementation? - Aaron On Wed, Jan 21, 2009 at 3:30 PM, Doug Cutting wrote: > Derek Young wrote: > >> Reading htt

Re: using distcp for http source files

2009-01-21 Thread Doug Cutting
Derek Young wrote: Reading http://issues.apache.org/jira/browse/HADOOP-341 it sounds like this should be supported, but the http URLs are not working for me. Are http source URLs still supported? No. They used to be supported, but when distcp was converted to accept any Path this stopped wo

Re: using distcp for http source files

2009-01-21 Thread Derek Young
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at org.apache.hadoop.tools.DistCp.main(DistCp.java:871) > > - Original Message > > From: Derek Young > > To: core-u...@... > > Sent: Wednesday, January 21, 2009 1:23:56 PM > > Su

Re: using distcp for http source files

2009-01-21 Thread Tsz Wo (Nicholas), Sze
@hadoop.apache.org > Sent: Wednesday, January 21, 2009 1:23:56 PM > Subject: using distcp for http source files > > I plan to use hadoop to do some log processing and I'm working on a method to > load the files (probably nightly) into hdfs. My plan is to have a web serve

using distcp for http source files

2009-01-21 Thread Derek Young
I plan to use hadoop to do some log processing and I'm working on a method to load the files (probably nightly) into hdfs. My plan is to have a web server on each machine with logs that serves up the log directories. Then I would give distcp a list of http URLs of the log files and have it co