Hi, We have deployed hadoop cluster on EC2, hadoop version 0.20.1. We are having couple of data nodes. We want to get some files from the data node which is there on the amazon ec2 instance to our local instance using java application, which in turn use SequentialFile.reader to read file. The problem is amazon uses private IP for host communication, but to connect form the environment other than amazon we will be using public IP. So when we try to connect to the data nodes via name node, it will report data node's private IP and using the same we are not able to reach the data node. Is there any way we can set name node to send data nodes public NAT IP not the internal IP, or any other work around is there to overcome this problem.
Thanks Viral.