Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change 
notification.

The "HadoopIPv6" page has been changed by SomeOtherAccount:
http://wiki.apache.org/hadoop/HadoopIPv6?action=diff&rev1=4&rev2=5

Comment:
Removed a bunch of pontification, wrong information, and second guessing about 
the design.  Thanks to  Jan Schaumann for pointing it out.

  = Hadoop and IPv6 =
  
- Apache Hadoop does not currently support IPv6 networks, it uses IPv4 
addresses for communicating between nodes. This is because Hadoop is designed 
to work in private datacenters, which usually have private IP addresses in the 
10.x.x.x address space.  
+ Apache Hadoop is not currently supported on IPv6 networks. It has only been 
tested and developed on IPv4 stacks.  Hadoop needs IPv4 to work, and only IPv4 
clients can talk to the cluster.If your organisation moves to IPv6 only, you 
will encounter problems.
  
+ Some things to be aware of:
-  1. Using IPv4 addresses everywhere provides a single form of TCP addressing 
for all our tests. Different network configurations (DNS, reverse DNS, DNS 
caching) still provide lots of problems and performance issues, but there is no 
need to worry about which IP protocol version is used.
-  1. Shorter addresses make for shorter packets, which can have a benefit on 
busy networks. 
- 
- This does not mean that the Hadoop team thinks that IPv4 is the best ever 
network protocol and that there is no reason to upgrade ever, only that it 
works well in datacenters. If you are using Hadoop in other places you may 
encounter problems. A key limitation of this design decision is that it means 
Hadoop needs IPv4 to work, and only IPv4 clients can talk to the cluster. 
Equally critically, MapReduce jobs cannot talk to services, including web 
services, that only work on IPv6. If your organisation moves to IPv6 only, you 
will encounter problems.
- 
- 
- In the mean time, the main concern is that the linux distribution tries to 
force Hadoop to use IPv6, which does not work.
-  1. Many recent Linux distributions do not allow you to turn IPv6 off. There 
is a risk that Hadoop or Jetty-under-Hadoop has picked up an IPv6 address, 
which is why other machines may not be able to talk to it.
-  1. Later Linux releases default to being IPv6 only. That means unless the 
systems are configured to re-enable IPv4, some machines break. As of Jan 2010, 
this was causing problems in Debian 
[[http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=560044|1]], 
[[http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=560056|2]], which is then 
leading to bug reports in other programs 
[[http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6342561|Sun bug 
database]], [[https://issues.apache.org/jira/browse/HADOOP-6056|Apache Jira]].
+  1. Some Linux releases default to being IPv6 only. That means unless the 
systems are configured to re-enable IPv4, some machines will break. As of Jan 
2010, this was causing problems in Debian 
[[http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=560044|1]], 
[[http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=560056|2]], which is then 
leading to bug reports in other programs 
[[http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6342561|Sun bug 
database]], [[https://issues.apache.org/jira/browse/HADOOP-6056|Apache Jira]].
+  1. Binding Hadoop to an IPv6-only interface/name will not work.
   1. If you see NoRouteToHost error messages, this may be the cause.
  
- == Fixing ==
+ == Fixing the Debian Problem ==
  
  To fix this, you need to change your system to allow IPv4 addressing. 
  

Reply via email to