Typically users ssh edge node which is co-located with the cluster. It also minimizes latency between client and cluster.
— Sent from Mailbox On Sat, Jun 7, 2014 at 7:12 AM, Peyman Mohajerian <mohaj...@gmail.com> wrote: > In my experience you build a node called Edge Node which has all the > libraries and configuration setting in XML to connect to the cluster, it > just doesn't have any of the Hadoop daemons running. > On Wed, Jun 4, 2014 at 2:46 PM, John Lilley <john.lil...@redpoint.net> > wrote: >> We’ve found that much of the Hadoop samples assume that running is being >> done form a cluster node, and that the connection information can be >> gleaned directly from a configuration object. However, we always run our >> client from a remote computer, and our users must manually specify the >> NN/RM addresses and ports. We’ve found this varies maddeningly between >> distros and especially on hosted virtual implementations. Getting the >> wrong port results in various inscrutable errors with red-herring messages >> about security. Is there a prescribed way to get the correct connection >> information more easily, like from a web API (where at least we’d only need >> one address and port)? >> >> >> >> john >> >> >>