Re: Hadoop+s3 & fuse-dfs

Craig Macdonald Wed, 28 Jan 2009 16:36:36 -0800

Hi Roopa,

Firstly, can you get the fuse-dfs working for an instance HDFS?

There is also a debug mode for fuse: enable this by adding -d on thecommand line.


C

Roopa Sudheendra wrote:

Hey Craig,
I tried the way u suggested..but i get this transport endpoint notconnected. Can i see the logs anywhere? I dont see anything in/var/log/messages eitherlooks like it tries to create the file system in hdfs.c but not surewhere it fails.
I have the hadoop home set so i believe it gets the config info.

any idea?

Thanks,
Roopa
On Jan 28, 2009, at 1:59 PM, Craig Macdonald wrote:
In theory, yes.
On inspection of libhdfs, which underlies fuse-dfs, I note that:
* libhdfs takes a host and port number as input when connecting, butnot a scheme (hdfs etc). The easiest option would be to set the S3 asyour default file system in your hadoop-site.xml, then use the hostof "default". That should get libhdfs to use the S3 file system. i.e.set fuse-dfs to mount dfs://default:0/ and all should work as planned.
* libhdfs also casts the FileSystem to a DistributedFileSystem forthe df command. This would fail in your case. This issue is currentlybeing worked on - see HADOOP-4368
https://issues.apache.org/jira/browse/HADOOP-4368.

C


Roopa Sudheendra wrote:
Thanks for the response craig.
I looked at fuse-dfs c code and looks like it does not like anythingother than "dfs:// " so with the fact that hadoop can connect to S3file system ..allowing s3 scheme should solve my problem?
Roopa

On Jan 28, 2009, at 1:03 PM, Craig Macdonald wrote:
Hi Roopa,
I cant comment on the S3 specifics. However, fuse-dfs is based on aC interface called libhdfs which allows C programs (such asfuse-dfs) to connect to the Hadoop file system Java API. This beingthe case, fuse-dfs should (theoretically) be able to connect to anyfile system that Hadoop can. Your mileage may vary, but if you findissues, please do report them through the normal channels.
Craig


Roopa Sudheendra wrote:
I am experimenting with Hadoop backed by Amazon s3 filesystem asone of our backup storage solution. Just the hadoop and s3(blockbased since it overcomes the 5gb limit) so far seems to be fine.My problem is that i want to mount this filesystem using fuse-dfs( since i don't have to worry about how the file is written on thesystem ) . Since the namenode does not get started with s3 backedhadoop system how can i connect fuse-dfs to this setup.
Appreciate your help.
Thanks,
Roopa

Re: Hadoop+s3 & fuse-dfs

Reply via email to