Re: accessing hdfs cluster through ssh tunnel

2019-09-16 Thread saurabh pratap singh
Hi all So I was not satisfied with the above mentioned approach and tried hadoop socks server config at client end and used ssh with -D option as mentioned by Hariharan Iyer (Thank you for that) and it worked as expected without the need of opening separate ssh tunnels for data nodes. Thanks. On

Re: accessing hdfs cluster through ssh tunnel

2019-09-13 Thread saurabh pratap singh
Thank you all for your help . Solution that worked for me is as follows: I opened ssh tunnel for namenode which ensure that hadoop fs -ls works In order for hadoop fs -put to work (as it was timing out because namenode was returning private ip addresses of datanode which cant be resolved by edge ma

Re: accessing hdfs cluster through ssh tunnel

2019-09-13 Thread Hariharan Iyer
You will have to use a socks proxy (-D option in ssh tunnel). In addition, when invoking hadoop fs command, you will have to add -Dsocks.proxyHost and - Dsocks.proxyPort. Thanks, Hariharan On Thu, 12 Sep 2019, 23:26 saurabh pratap singh, wrote: > Thank you so much for your reply . > I have furt

Re: accessing hdfs cluster through ssh tunnel

2019-09-13 Thread Julien Laurenceau
Hi Hadoop is designed to avoid proxy as it will act as a bottleneck. Namenodes are used to obtain a direct socket client / datanodes that is specific to each job. Le ven. 13 sept. 2019 à 14:21, Tony S. Wu a écrit : > You need connectivity from edge node to the entire cluster, not just > namenode

Re: accessing hdfs cluster through ssh tunnel

2019-09-12 Thread saurabh pratap singh
Thank you so much for your reply . I have further question there are some blogs which talks about some similar setup like this one https://github.com/vkovalchuk/hadoop-2.6.0-windows/wiki/How-to-access-HDFS-behind-firewall-using-SOCKS-proxy I am just curious how does that works. On Thu, Sep 12,

Re: accessing hdfs cluster through ssh tunnel

2019-09-12 Thread Tony S. Wu
You need connectivity from edge node to the entire cluster, not just namenode. Your topology, unfortunately, probably won’t work too well. A proper VPN / IPSec tunnel might be a better idea. On Thu, Sep 12, 2019 at 12:04 AM saurabh pratap singh < saurabh.cs...@gmail.com> wrote: > Hadoop version :

Re: accessing HDFS Snapshot using HDFS NFS Gateway

2019-05-15 Thread Mukul Kumar Singh
Hi Markus, HDFS NFS gateway currently does not support snapshots. Following issues are tracking this in HDFS. https://issues.apache.org/jira/browse/HDFS-5084 https://issues.apache.org/jira/browse/HDFS-11315 However we do not have a fix for these jira. Thanks, Mukul On 5/9/19 1:16 AM, mark

RE: Accessing HDFS

2013-07-15 Thread John Lilley
Thanks! They are fine, I was just confused seeing them talked about in forums. John -Original Message- From: Harsh J [mailto:ha...@cloudera.com] Sent: Friday, July 05, 2013 8:01 PM To: Subject: Re: Accessing HDFS These APIs (ClientProtocol, DFSClient) are not for Public access

Re: Accessing HDFS

2013-07-05 Thread Harsh J
These APIs (ClientProtocol, DFSClient) are not for Public access. Please do not use them in production. The only API we care not to change incompatibly are the FileContext and the FileSystem APIs. They provide much of what you want - if not, log a JIRA. On Fri, Jul 5, 2013 at 11:40 PM, John Lilley