On Aug 18, 2011, at 1:25 AM, Ted Dunning wrote: > It should be noted that the public names resolve to internal address inside > EC2. That means that traffic runs as cheaply as possible.
Oh cool, that's good to know. > Regarding the identification of a host, the idea that there is a DNS name on > the far side of the connections that matches anything on the local side is > just incorrect. A single machine has to be considered to have multiple > interfaces, multiple versions of host name and multiple names that resolve > to addresses that reach it from other machines. That these don't match up > very well is only to be expected. This would definitely be ideal, but the problem is that our apps depend on a system that makes this assumption (HDFS). Hostnames are how HDFS communicates location information to clients such as the JobTracker. Matei
