Hi Guys,
Needed help with Twill. I am trying to run a simple Distributed Shell
application on a single node cluster. When I run it, in the standard out logs I
get a ton of kafka related errors. I tee’ed the logs, but each run was making
25MBs of it. The only main exception I see is this
20:57:42.382 [YarnTwillRunnerService
STARTING-SendThread(localhost.localdomain:2181)] DEBUG
org.apache.zookeeper.ClientCnxn - Reading
reply sessionid:0x144e1a859d40052, packet::
clientPath:/MY_BASE_APP/c47fd263-a5c1-48ef-8c76-a91cf8009431/state
serverPath:/MY_BASE_A
PP/c47fd263-a5c1-48ef-8c76-a91cf8009431/state finished:false header:: 15,4
replyHeader:: 15,652,0 request:: '/MY_BASE_APP/c47fd263-
a5c1-48ef-8c76-a91cf8009431/state,T response::
#7b227374617465223a2253544f5050494e47227d,s{627,652,1395363459875,1395363462375,3,0,0
,0,20,0,627}
20:57:42.639 [Kafka-Consumer-log-0] INFO o.a.t.i.k.client.SimpleKafkaConsumer
- Exception when fetching message on TopicPartition{to
pic=log, partition=0}.
java.net.ConnectException: Connection refused
at sun.nio.ch.Net.connect0(Native Method) ~[na:1.7.0_45]
at sun.nio.ch.Net.connect(Net.java:465) ~[na:1.7.0_45]
at sun.nio.ch.Net.connect(Net.java:457) ~[na:1.7.0_45]
at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:666)
~[na:1.7.0_45]
at kafka.network.BlockingChannel.connect(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.consumer.SimpleConsumer.connect(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.consumer.SimpleConsumer.reconnect(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.consumer.SimpleConsumer.liftedTree1$1(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at
kafka.consumer.SimpleConsumer.kafka$consumer$SimpleConsumer$$sendRequest(Unknown
Source) ~[kafka_2.10-0.8.0.jar:0.8.0]
at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(Unknown
Source) ~[kafka_2.10-0.8.0.jar:0.8.0]
at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(Unknown
Source) ~[kafka_2.10-0.8.0.jar:0.8.0]
at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(Unknown
Source) ~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.metrics.KafkaTimer.time(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply$mcV$sp(Unknown
Source) ~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.metrics.KafkaTimer.time(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.consumer.SimpleConsumer.fetch(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at kafka.javaapi.consumer.SimpleConsumer.fetch(Unknown Source)
~[kafka_2.10-0.8.0.jar:0.8.0]
at
org.apache.twill.internal.kafka.client.SimpleKafkaConsumer$ConsumerThread.fetchMessages(SimpleKafkaConsumer.java:419)
~[twill-core-0.2.0-incubating-SNAPSHOT.jar:0.2.0-incubating-SNAPSHOT]
at
org.apache.twill.internal.kafka.client.SimpleKafkaConsumer$ConsumerThread.run(SimpleKafkaConsumer.java:355)
~[twill-core-0.2.0-incubating-SNAPSHOT.jar:0.2.0-incubating-SNAPSHOT]
20:57:42.642 [Kafka-Consumer-log-0] INFO o.a.t.i.k.client.SimpleKafkaConsumer
- Exception when fetching message on TopicPartition{topic=log, partition=0}.
java.net.ConnectException: Connection refused
I also attached the application logs on the yarn end. That is showing a
different exception.
[main] ERROR o.apache.twill.internal.ServiceMain - Exception when starting
service org.apache.twill.internal.appmaster.ApplicationMasterService@1d16eaf2.
java.util.concurrent.ExecutionException:
java.util.concurrent.ExecutionException:
org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode =
NodeExists for /c47fd263-a5c1-48ef-8c76-a91cf8009431/state
at
com.google.common.util.concurrent.AbstractFuture$Sync.getValue(AbstractFuture.java:294)
~[guava-13.0.1.jar:na]
at
com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:281)
~[guava-13.0.1.jar:na]
at
com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:116)
~[guava-13.0.1.jar:na]
at org.apache.twill.internal.ServiceMain.doMain(ServiceMain.java:80)
~[twill-yarn-0.2.0-incubating-SNAPSHOT.jar:0.2.0-incubating-SNAPSHOT]
at
org.apache.twill.internal.appmaster.ApplicationMasterMain.main(ApplicationMasterMain.java:69)
[twill-yarn-0.2.0-incubating-SNAPSHOT.jar:0.2.0-incubating-SNAPSHOT]
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
~[na:1.7.0_45]
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
~[na:1.7.0_45]
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
~[na:1.7.0_45]
at java.lang.reflect.Method.invoke(Method.java:606) ~[na:1.7.0_45]
at org.apache.twill.launcher.TwillLauncher.main(TwillLauncher.java:86)
[launcher.71cb0f5e-fc14-43e7-8149-71e57defd89f.jar:na]
java.util.concurrent.ExecutionException:
org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode =
NodeExists for /c47fd263-a5c1-48ef-8c76-a91cf8009431/state
Please help!
Safder