Support efficient Hadoop distcp from external clusters
------------------------------------------------------

                 Key: WHIRR-81
                 URL: https://issues.apache.org/jira/browse/WHIRR-81
             Project: Whirr
          Issue Type: New Feature
          Components: service/hadoop
            Reporter: Tom White


On EC2 currently all external traffic to a Hadoop cluster is proxied through 
the namenode, which make distcp impractical. This JIRA is to explore ways to 
improve this operation, possible candidates include a SocketFactory 
implementation that is aware of the cloud provider's networking (and can supply 
the public addresses appropriately), or a VPN. Ideally this would support 
different cloud providers, although it is possible that different providers 
need different solutions.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to