Hi Andrei,
I changed it to return the hostIP if the DNSResolution fails. The
Authorization went fine after that but another exception was thrown. Just
figuring out if the issue is because of this change.
Authorizing firewall ingress to [us-east-1/i-7e9af91c] on ports [2181] for [
122.172.3.56/32]
Authorizing firewall ingress to [us-east-1/i-7e9af91c] on ports [50070] for
[122.172.3.56/32]
Authorizing firewall ingress to [us-east-1/i-7e9af91c] on ports [8020,
8021] for [50.16.100.176/32]
Unable to start the cluster. Terminating all nodes.
java.lang.NullPointerException
at
org.apache.whirr.service.hadoop.HadoopConfigurationBuilder.buildMapReduceConfiguration(HadoopConfigurationBuilder.java:113)
at
org.apache.whirr.service.hadoop.HadoopConfigurationBuilder.buildMapReduce(HadoopConfigurationBuilder.java:71)
at
org.apache.whirr.service.hadoop.HadoopClusterActionHandler.createHadoopConfigFiles(HadoopClusterActionHandler.java:102)
at
org.apache.whirr.service.hadoop.HadoopClusterActionHandler.beforeConfigure(HadoopClusterActionHandler.java:88)
at
org.apache.whirr.service.ClusterActionHandlerSupport.beforeAction(ClusterActionHandlerSupport.java:53)
at
org.apache.whirr.actions.ScriptBasedClusterAction.execute(ScriptBasedClusterAction.java:100)
at
org.apache.whirr.ClusterController.configureServices(ClusterController.java:148)
at
org.apache.whirr.ClusterController.launchCluster(ClusterController.java:110)
at
org.apache.whirr.cli.command.LaunchClusterCommand.run(LaunchClusterCommand.java:63)
at org.apache.whirr.cli.Main.run(Main.java:64)
at org.apache.whirr.cli.Main.main(Main.java:97)
Finished running destroy phase scripts on all cluster instances
Cheers,
Akash A
On Mon, Jan 2, 2012 at 9:50 PM, Akash Ashok <[email protected]> wrote:
>
>
> On Mon, Jan 2, 2012 at 12:45 AM, Andrei Savu <[email protected]>wrote:
>
>> On Jan 1, 2012 7:02 PM, "Akash Ashok" <[email protected]> wrote:
>> >
>> > Hi Andrei,
>> > Does it try to connect back to the system from which Whirr is run by any
>> > chance? I am not aware as to why reverse DNS resolution happens in this
>> > case. But if its tries to connect back that doesn't work as my system is
>> > behind a NAT.
>>
>> No. We are using the hostnames only for configuration.
>>
>> >
>> > I am asking this because the cluster launch fails during
>> > "Authorizing firewall ingress to [us-east-1/i-aec053cc] on ports [2181]
>> for
>> > [122.172.3.56/32]"
>>
>> It should not fail.
>>
>> >
>> > If its trying to establish a connection then even if the DNS resolution
>> > part is fixed, it would fail when it tries to connect during the
>> subsequent
>> > steps right ?
>>
>> If there is no known hostname Hbase may not start (not sure)
>>
>> Hmmm. HBase cluster is starting fine. But after creating the cluster,
> because of the exception it is destroying the cluster. Will take a deeper
> look into this.
>
>> >
>> > Cheers,
>> > Akash A
>> >
>> >
>> > On Sat, Dec 31, 2011 at 1:50 PM, Andrei Savu <[email protected]>
>> wrote:
>> >
>> > > I think that when reverse DNS resolution fails we should just return
>> the
>> > > raw IP address. What do you think?
>> > > On Dec 31, 2011 2:08 AM, "Akash Ashok" <[email protected]>
>> wrote:
>> > >
>> > > > I have filed an issue for this
>> > > >
>> > > > https://issues.apache.org/jira/browse/WHIRR-459
>> > > >
>> > > > Cheers,
>> > > > AKash A
>> > > >
>> > > > On Fri, Dec 30, 2011 at 11:12 AM, Andrei Savu <
>> > > > [email protected]
>> > > > > wrote:
>> > > >
>> > > > > Please file an issue. It looks like a DNS failure and we should
>> do a
>> > > > better
>> > > > > job at handling this.
>> > > > > On Dec 30, 2011 5:37 AM, "Akash Ashok" <[email protected]>
>> wrote:
>> > > > >
>> > > > > > Looping In Whirr Dev.
>> > > > > > Is there a patch available for the below issue ? If not can I
>> file a
>> > > > > JIRA ?
>> > > > > >
>> > > > > > Cheers,
>> > > > > > Akash A
>> > > > > >
>> > > > > > On Tue, Dec 27, 2011 at 10:44 AM, Akash Ashok <
>> > > [email protected]
>> > > > > > >wrote:
>> > > > > >
>> > > > > > > Hi All,
>> > > > > > > I am trying to launch a hbase cluter. It creates the cluster
>> but
>> > > > fails
>> > > > > > > which configuring the security configurations. Could some1
>> please
>> > > > help
>> > > > > me
>> > > > > > > out with this ?
>> > > > > > >
>> > > > > > > [akash@localhost whirr-0.7.0]$ bin/whirr launch-cluster
>> --config
>> > > > > > > hbase-ec2.properties
>> > > > > > > Bootstrapping cluster
>> > > > > > > Configuring template
>> > > > > > > Configuring template
>> > > > > > > Starting 1 node(s) with roles [zookeeper, hadoop-namenode,
>> > > > > > > hadoop-jobtracker, hbase-master]
>> > > > > > > Starting 2 node(s) with roles [hadoop-datanode,
>> hadoop-tasktracker,
>> > > > > > > hbase-regionserver]
>> > > > > > > Nodes started: [[id=us-east-1/i-5890203a,
>> providerId=i-5890203a,
>> > > > > > > group=hbase, name=hbase-5890203a, location=[id=us-east-1c,
>> > > > scope=ZONE,
>> > > > > > > description=us-east-1c, parent=us-east-1,
>> iso3166Codes=[US-VA],
>> > > > > > > metadata={}], uri=null, imageId=us-east-1/ami-da0cf8b3,
>> > > > os=[name=null,
>> > > > > > > family=ubuntu, version=10.04, arch=paravirtual, is64Bit=true,
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> description=ubuntu-images-us/ubuntu-lucid-10.04-amd64-server-20101020.manifest.xml],
>> > > > > > > state=RUNNING, loginPort=22, hostname=domU-12-31-39-0F-94-D1,
>> > > > > > > privateAddresses=[10.193.151.31],
>> > > publicAddresses=[204.236.208.250],
>> > > > > > > hardware=[id=c1.xlarge, providerId=c1.xlarge, name=null,
>> > > > > > > processors=[[cores=8.0, speed=2.5]], ram=7168,
>> volumes=[[id=null,
>> > > > > > > type=LOCAL, size=10.0, device=/dev/sda1, durable=false,
>> > > > > > isBootDevice=true],
>> > > > > > > [id=null, type=LOCAL, size=420.0, device=/dev/sdb,
>> durable=false,
>> > > > > > > isBootDevice=false], [id=null, type=LOCAL, size=420.0,
>> > > > device=/dev/sdc,
>> > > > > > > durable=false, isBootDevice=false], [id=null, type=LOCAL,
>> > > size=420.0,
>> > > > > > > device=/dev/sdd, durable=false, isBootDevice=false], [id=null,
>> > > > > > type=LOCAL,
>> > > > > > > size=420.0, device=/dev/sde, durable=false,
>> isBootDevice=false]],
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> supportsImage=And(ALWAYS_TRUE,Or(isWindows(),requiresVirtualizationType(paravirtual)),ALWAYS_TRUE,is64Bit()),
>> > > > > > > tags=[]], loginUser=ubuntu,
>> userMetadata={Name=hbase-5890203a},
>> > > > > tags=[]]]
>> > > > > > > Nodes started: [[id=us-east-1/i-54902036,
>> providerId=i-54902036,
>> > > > > > > group=hbase, name=hbase-54902036, location=[id=us-east-1c,
>> > > > scope=ZONE,
>> > > > > > > description=us-east-1c, parent=us-east-1,
>> iso3166Codes=[US-VA],
>> > > > > > > metadata={}], uri=null, imageId=us-east-1/ami-da0cf8b3,
>> > > > os=[name=null,
>> > > > > > > family=ubuntu, version=10.04, arch=paravirtual, is64Bit=true,
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> description=ubuntu-images-us/ubuntu-lucid-10.04-amd64-server-20101020.manifest.xml],
>> > > > > > > state=RUNNING, loginPort=22, hostname=ip-10-7-29-242,
>> > > > > > > privateAddresses=[10.7.29.242],
>> publicAddresses=[75.101.240.254],
>> > > > > > > hardware=[id=c1.xlarge, providerId=c1.xlarge, name=null,
>> > > > > > > processors=[[cores=8.0, speed=2.5]], ram=7168,
>> volumes=[[id=null,
>> > > > > > > type=LOCAL, size=10.0, device=/dev/sda1, durable=false,
>> > > > > > isBootDevice=true],
>> > > > > > > [id=null, type=LOCAL, size=420.0, device=/dev/sdb,
>> durable=false,
>> > > > > > > isBootDevice=false], [id=null, type=LOCAL, size=420.0,
>> > > > device=/dev/sdc,
>> > > > > > > durable=false, isBootDevice=false], [id=null, type=LOCAL,
>> > > size=420.0,
>> > > > > > > device=/dev/sdd, durable=false, isBootDevice=false], [id=null,
>> > > > > > type=LOCAL,
>> > > > > > > size=420.0, device=/dev/sde, durable=false,
>> isBootDevice=false]],
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> supportsImage=And(ALWAYS_TRUE,Or(isWindows(),requiresVirtualizationType(paravirtual)),ALWAYS_TRUE,is64Bit()),
>> > > > > > > tags=[]], loginUser=ubuntu,
>> userMetadata={Name=hbase-54902036},
>> > > > > tags=[]],
>> > > > > > > [id=us-east-1/i-5a902038, providerId=i-5a902038, group=hbase,
>> > > > > > > name=hbase-5a902038, location=[id=us-east-1c, scope=ZONE,
>> > > > > > > description=us-east-1c, parent=us-east-1,
>> iso3166Codes=[US-VA],
>> > > > > > > metadata={}], uri=null, imageId=us-east-1/ami-da0cf8b3,
>> > > > os=[name=null,
>> > > > > > > family=ubuntu, version=10.04, arch=paravirtual, is64Bit=true,
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> description=ubuntu-images-us/ubuntu-lucid-10.04-amd64-server-20101020.manifest.xml],
>> > > > > > > state=RUNNING, loginPort=22, hostname=ip-10-108-182-53,
>> > > > > > > privateAddresses=[10.108.182.53],
>> publicAddresses=[50.16.48.211],
>> > > > > > > hardware=[id=c1.xlarge, providerId=c1.xlarge, name=null,
>> > > > > > > processors=[[cores=8.0, speed=2.5]], ram=7168,
>> volumes=[[id=null,
>> > > > > > > type=LOCAL, size=10.0, device=/dev/sda1, durable=false,
>> > > > > > isBootDevice=true],
>> > > > > > > [id=null, type=LOCAL, size=420.0, device=/dev/sdb,
>> durable=false,
>> > > > > > > isBootDevice=false], [id=null, type=LOCAL, size=420.0,
>> > > > device=/dev/sdc,
>> > > > > > > durable=false, isBootDevice=false], [id=null, type=LOCAL,
>> > > size=420.0,
>> > > > > > > device=/dev/sdd, durable=false, isBootDevice=false], [id=null,
>> > > > > > type=LOCAL,
>> > > > > > > size=420.0, device=/dev/sde, durable=false,
>> isBootDevice=false]],
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> supportsImage=And(ALWAYS_TRUE,Or(isWindows(),requiresVirtualizationType(paravirtual)),ALWAYS_TRUE,is64Bit()),
>> > > > > > > tags=[]], loginUser=ubuntu,
>> userMetadata={Name=hbase-5a902038},
>> > > > > tags=[]]]
>> > > > > > > Authorizing firewall ingress to [us-east-1/i-5890203a] on
>> ports
>> > > > [2181]
>> > > > > > for
>> > > > > > > [122.172.0.45/32]
>> > > > > > > Unable to start the cluster. Terminating all nodes.
>> > > > > > > org.apache.whirr.net.DnsException: java.net.ConnectException:
>> > > > > Connection
>> > > > > > > refused
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.net.FastDnsResolver.apply(FastDnsResolver.java:83)
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.net.FastDnsResolver.apply(FastDnsResolver.java:40)
>> > > > > > > at
>> > > > > > >
>> > > org.apache.whirr.Cluster$Instance.getPublicHostName(Cluster.java:112)
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.Cluster$Instance.getPublicAddress(Cluster.java:94)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.service.hadoop.HadoopNameNodeClusterActionHandler.doBeforeConfigure(HadoopNameNodeClusterActionHandler.java:58)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.service.hadoop.HadoopClusterActionHandler.beforeConfigure(HadoopClusterActionHandler.java:86)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.service.ClusterActionHandlerSupport.beforeAction(ClusterActionHandlerSupport.java:53)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.actions.ScriptBasedClusterAction.execute(ScriptBasedClusterAction.java:100)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.ClusterController.launchCluster(ClusterController.java:109)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.cli.command.LaunchClusterCommand.run(LaunchClusterCommand.java:63)
>> > > > > > > at org.apache.whirr.cli.Main.run(Main.java:64)
>> > > > > > > at org.apache.whirr.cli.Main.main(Main.java:97)
>> > > > > > > Caused by: java.net.ConnectException: Connection refused
>> > > > > > > at sun.nio.ch.SocketChannelImpl.checkConnect(Native
>> Method)
>> > > > > > > at
>> > > > > > >
>> > > >
>> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592)
>> > > > > > > at org.xbill.DNS.TCPClient.connect(TCPClient.java:30)
>> > > > > > > at org.xbill.DNS.TCPClient.sendrecv(TCPClient.java:118)
>> > > > > > > at
>> org.xbill.DNS.SimpleResolver.send(SimpleResolver.java:254)
>> > > > > > > at
>> > > > > > >
>> > > > >
>> > >
>> org.xbill.DNS.ExtendedResolver$Resolution.start(ExtendedResolver.java:95)
>> > > > > > > at
>> > > org.xbill.DNS.ExtendedResolver.send(ExtendedResolver.java:358)
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.net.FastDnsResolver.apply(FastDnsResolver.java:69)
>> > > > > > > ... 11 more
>> > > > > > > Unable to load cluster state, assuming it has no running
>> nodes.
>> > > > > > > java.io.FileNotFoundException:
>> /home/akash/.whirr/hbase/instances
>> > > (No
>> > > > > > such
>> > > > > > > file or directory)
>> > > > > > > at java.io.FileInputStream.open(Native Method)
>> > > > > > > at
>> java.io.FileInputStream.<init>(FileInputStream.java:137)
>> > > > > > > at com.google.common.io.Files$1.getInput(Files.java:100)
>> > > > > > > at com.google.common.io.Files$1.getInput(Files.java:97)
>> > > > > > > at
>> > > > com.google.common.io.CharStreams$2.getInput(CharStreams.java:91)
>> > > > > > > at
>> > > > com.google.common.io.CharStreams$2.getInput(CharStreams.java:88)
>> > > > > > > at
>> > > > com.google.common.io.CharStreams.readLines(CharStreams.java:306)
>> > > > > > > at com.google.common.io.Files.readLines(Files.java:580)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.state.FileClusterStateStore.load(FileClusterStateStore.java:54)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.state.ClusterStateStore.tryLoadOrEmpty(ClusterStateStore.java:58)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.ClusterController.destroyCluster(ClusterController.java:143)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.ClusterController.launchCluster(ClusterController.java:118)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.cli.command.LaunchClusterCommand.run(LaunchClusterCommand.java:63)
>> > > > > > > at org.apache.whirr.cli.Main.run(Main.java:64)
>> > > > > > > at org.apache.whirr.cli.Main.main(Main.java:97)
>> > > > > > > Starting to run scripts on cluster for phase destroyinstances:
>> > > > > > > Starting to run scripts on cluster for phase destroyinstances:
>> > > > > > > Finished running destroy phase scripts on all cluster
>> instances
>> > > > > > > Destroying hbase cluster
>> > > > > > > Cluster hbase destroyed
>> > > > > > > Exception in thread "main" java.lang.RuntimeException:
>> > > > > > > org.apache.whirr.net.DnsException: java.net.ConnectException:
>> > > > > Connection
>> > > > > > > refused
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.ClusterController.launchCluster(ClusterController.java:125)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.cli.command.LaunchClusterCommand.run(LaunchClusterCommand.java:63)
>> > > > > > > at org.apache.whirr.cli.Main.run(Main.java:64)
>> > > > > > > at org.apache.whirr.cli.Main.main(Main.java:97)
>> > > > > > > Caused by: org.apache.whirr.net.DnsException:
>> > > > > java.net.ConnectException:
>> > > > > > > Connection refused
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.net.FastDnsResolver.apply(FastDnsResolver.java:83)
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.net.FastDnsResolver.apply(FastDnsResolver.java:40)
>> > > > > > > at
>> > > > > > >
>> > > org.apache.whirr.Cluster$Instance.getPublicHostName(Cluster.java:112)
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.Cluster$Instance.getPublicAddress(Cluster.java:94)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.service.hadoop.HadoopNameNodeClusterActionHandler.doBeforeConfigure(HadoopNameNodeClusterActionHandler.java:58)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.service.hadoop.HadoopClusterActionHandler.beforeConfigure(HadoopClusterActionHandler.java:86)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.service.ClusterActionHandlerSupport.beforeAction(ClusterActionHandlerSupport.java:53)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.actions.ScriptBasedClusterAction.execute(ScriptBasedClusterAction.java:100)
>> > > > > > > at
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>> org.apache.whirr.ClusterController.launchCluster(ClusterController.java:109)
>> > > > > > > ... 3 more
>> > > > > > > Caused by: java.net.ConnectException: Connection refused
>> > > > > > > at sun.nio.ch.SocketChannelImpl.checkConnect(Native
>> Method)
>> > > > > > > at
>> > > > > > >
>> > > >
>> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592)
>> > > > > > > at org.xbill.DNS.TCPClient.connect(TCPClient.java:30)
>> > > > > > > at org.xbill.DNS.TCPClient.sendrecv(TCPClient.java:118)
>> > > > > > > at
>> org.xbill.DNS.SimpleResolver.send(SimpleResolver.java:254)
>> > > > > > > at
>> > > > > > >
>> > > > >
>> > >
>> org.xbill.DNS.ExtendedResolver$Resolution.start(ExtendedResolver.java:95)
>> > > > > > > at
>> > > org.xbill.DNS.ExtendedResolver.send(ExtendedResolver.java:358)
>> > > > > > > at
>> > > > > >
>> org.apache.whirr.net.FastDnsResolver.apply(FastDnsResolver.java:69)
>> > > > > > > ... 11 more
>> > > > > > >
>> > > > > > > Cheers,
>> > > > > > > Akash A
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>>
>
>