Thanks Terence for your clarification. I tried to remove the node from the cluster by removing an entry from "slaves" file and then ran "hadoop dfsadmin -refreshNodes" but looks like this is not the right command. Is there any specific command I need to use to remove or add a node , *without restarting the services* ?
I coded in such a way that new Runnables should get launched if they see new nodes but I am stuck with basic yarn command. Thanks and regards, Srini On Tue, May 20, 2014 at 12:02 AM, Terence Yim <[email protected]> wrote: > Hi Srinivas, > > Sorry for the late reply. BTW, I just noticed that this discussion is > not on the dev@ mailing list, hence I CC my reply to the mailing list. > You could subscribe to the list by sending an email to > [email protected] > > To your question about rebalancing, currently Twill won't stop > executing Runnable and move it to run on newly available resource, as > it doesn't know what the Runnable is doing and whether it is close to > finish or not. After you added a new node to the cluster, only newly > launched runnable (either a new application run or increase number of > instances of existing runnable) may runs on the new node (up to YARN > to allocate). > > Terence > > > On Fri, May 16, 2014 at 1:31 PM, Srinivas Reddy Kancharla > <[email protected]> wrote: > > HI Terence, > > > > Thanks for the information you have provided and now I could execute my > > programs. I am trying to experiment on re-balance behavior, your input > will > > really help me to test further: > > > > - I created a my own TwillApplication which launches 3 > > AbstractTwillRunnables (say this program is time consuming job). > > - I have a setup of 3 nodes (one master and 2 slave). When I launch my > > program, I could see that : > > > First slave node has launched ApplicationMaster and one Runnable. > > > Second Slave node has taken care of launch other 2 runnables. > > > > - During execution of above application, If I add 3rd slave node to > cluster, > > and configure it for re-balance, will this re-balance process will take > care > > of re-distributing of runnables again? i.e. now in this scenario the > > second slave node will have only one runnable and third new slave node > > should take care of one of the runnable. This way the load is > distributed. > > > > Thanks and regards, > > Srini > > > > > > > > > > On Fri, May 9, 2014 at 12:15 AM, Terence Yim <[email protected]> wrote: > >> > >> Hi Srinivas, > >> > >> First of all, through I never tried, I won't expect a YARN app could > >> work correctly on local cluster after computer sleep and wake. > >> > >> The exception is about RM tries to restart the AM after wake up (maybe > >> it though the AM is dead, as it has't been heartbeating when the > >> computer sleep, and RM uses wall clock to check), however the restart > >> failed due to token expiration (when someone asked RM for a container, > >> it comes with a timed token). The expiration time is governed by the > >> setting > "yarn.resourcemanager.rm.container-allocation.expiry-interval-ms" > >> and default is 600 seconds. > >> > >> Terence > >> > >> On Thu, May 8, 2014 at 11:45 AM, Srinivas Reddy Kancharla > >> <[email protected]> wrote: > >> > HI Terence, > >> > > >> > Yesterda the same program was working. Today when I opened my MacBook > >> > and so > >> > my 3 VM nodes are running back, I am seeing below exception as shown: > >> > > >> > I am getting below exception, is there any configuration which can > >> > ignore > >> > such exception??: > >> > > >> > Got exception: org.apache.hadoop.yarn.exceptions.YarnException: > >> > Unauthorized > >> > request to start container. > >> > This token is expired. current time is 1399573775978 found > 1399573627677 > >> > > >> > 2014-05-08 11:17:07,682 INFO > >> > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher: > >> > Command > >> > to launch container container_1399572736534_0002_02_000001 : > >> > $JAVA_HOME/bin/java -Djava.io.tmpdir=tmp -Dyarn.appId=$YARN_APP_ID_STR > >> > -Dtwill.app=$TWILL_APP_NAME -cp launcher.jar:$HADOOP_CONF_DIR -Xmx362m > >> > org.apache.twill.launcher.TwillLauncher appMaster.jar > >> > org.apache.twill.internal.appmaster.ApplicationMasterMain false > >> > 1><LOG_DIR>/stdout 2><LOG_DIR>/stderr > >> > 2014-05-08 11:17:07,694 INFO > >> > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher: > >> > Error > >> > launching appattempt_1399572736534_0002_000002. Got exception: > >> > org.apache.hadoop.yarn.exceptions.YarnException: Unauthorized request > to > >> > start container. > >> > This token is expired. current time is 1399573775978 found > 1399573627677 > >> > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native > >> > Method) > >> > at > >> > > >> > > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > >> > at > >> > > >> > > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > >> > at java.lang.reflect.Constructor.newInstance(Constructor.java: > >> > 534) > >> > at > >> > > >> > > org.apache.hadoop.yarn.api.records.impl.pb.SerializedExceptionPBImpl.instantiateException(SerializedExceptionPBImpl.java:152) > >> > at > >> > > >> > > org.apache.hadoop.yarn.api.records.impl.pb.SerializedExceptionPBImpl.deSerialize(SerializedExceptionPBImpl.java:106) > >> > at > >> > > >> > > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.launch(AMLauncher.java:122) > >> > at > >> > > >> > > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.run(AMLauncher.java:249) > >> > at > >> > > >> > > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146) > >> > at > >> > > >> > > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > >> > at java.lang.Thread.run(Thread.java:701) > >> > > >> > 2014-05-08 11:17:07,695 INFO > >> > > org.apache.hadoop.yarn.server.resourcemanager.ApplicationMasterService: > >> > Unregistering app attempt : appattempt_1399572736534_0002_000002 > >> > 2014-05-08 11:17:07,695 INFO > >> > > >> > > org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl: > >> > appattempt_1399572736534_0002_000002 State change from ALLOCATED to > >> > FAILED > >> > 2014-05-08 11:17:07,695 INFO > >> > org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: > >> > Application > >> > application_1399572736534_0002 failed 2 times due to Error launching > >> > appattempt_1399572736534_0002_000002. Got exception: > >> > org.apache.hadoop.yarn.exceptions.YarnException: Unauthorized request > to > >> > start container. > >> > This token is expired. current time is 1399573775978 found > 1399573627677 > >> > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native > >> > Method) > >> > at > >> > > >> > > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > >> > at > >> > > >> > > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > >> > at > java.lang.reflect.Constructor.newInstance(Constructor.java:534) > >> > at > >> > > >> > > org.apache.hadoop.yarn.api.records.impl.pb.SerializedExceptionPBImpl.instantiateException(SerializedExceptionPBImpl.java:152) > >> > at > >> > > >> > > org.apache.hadoop.yarn.api.records.impl.pb.SerializedExceptionPBImpl.deSerialize(SerializedExceptionPBImpl.java:106) > >> > at > >> > > >> > > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.launch(AMLauncher.java:122) > >> > at > >> > > >> > > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.run(AMLauncher.java:249) > >> > at > >> > > >> > > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146) > >> > at > >> > > >> > > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > >> > at java.lang.Thread.run(Thread.java:701) > >> > . Failing the application. > >> > 2014-05-08 11:17:07,695 INFO > >> > org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: > >> > application_1399572736534_0002 State change from ACCEPTED to FAILED > >> > 2014-05-08 11:17:07,695 WARN > >> > org.apache.hadoop.yarn.server.resourcemanager.RMAuditLogger: > >> > USER=srini OPERATION=Application Finished - > >> > Failed TARGET=RMAppManager RESULT=FAILURE DESCRIPTION=App > >> > failed with state: > >> > FAILED PERMISSIONS=Application application_1399572736534_0002 > >> > failed 2 times > >> > due to Error launching appattempt_1399572736534_0002_000002. Got > >> > exception: > >> > org.apache.hadoop.yarn.exceptions.YarnException: Unauthorized request > to > >> > start container. > >> > This token is expired. current time is 1399573775978 found > 1399573627677 > >> > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native > >> > Method) > >> > at > >> > > >> > > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > >> > at > >> > > >> > > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > >> > at > java.lang.reflect.Constructor.newInstance(Constructor.java:534) > >> > at > >> > > >> > > org.apache.hadoop.yarn.api.records.impl.pb.SerializedExceptionPBImpl.instantiateException(SerializedExceptionPBImpl.java:152) > >> > at > >> > > >> > > org.apache.hadoop.yarn.api.records.impl.pb.SerializedExceptionPBImpl.deSerialize(SerializedExceptionPBImpl.java:106) > >> > at > >> > > >> > > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.launch(AMLauncher.java:122) > >> > at > >> > > >> > > org.apache.hadoop.yarn.server.resourcemanager.amlauncher.AMLauncher.run(AMLauncher.java:249) > >> > at > >> > > >> > > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146) > >> > at > >> > > >> > > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > >> > at java.lang.Thread.run(Thread.java:701) > >> > > >> > > >> > > >> > On Wed, May 7, 2014 at 1:35 PM, Srinivas Reddy Kancharla > >> > <[email protected]> wrote: > >> >> > >> >> I got answer for one my own question: > >> >> Can I expect "Hello world" on master node where I launched the > program > >> >> ? > >> >> > >> >> After I copied jopt-simple.jar of proper version, it worked and I can > >> >> see > >> >> "Hello world" output on master node. Sorry for the spam. > >> >> > >> >> Srini > >> >> > >> >> > >> >> On Wed, May 7, 2014 at 1:12 PM, Srinivas Reddy Kancharla > >> >> <[email protected]> wrote: > >> >>> > >> >>> Exciting.. it worked after I got all required jars. Advantage of not > >> >>> using maven project is , faced all these issues and exposed to all > the > >> >>> required jars and exceptions. > >> >>> > >> >>> Now when I launched my program, it got executed in one of my slave > >> >>> node. > >> >>> Both application master and task ran on the same node and I could > see > >> >>> "Hello > >> >>> world" in "stdout" log. > >> >>> > >> >>> Can I expect "Hello world" on master node where I launched the > program > >> >>> ? > >> >>> > >> >>> Thanks again for all your help. From here I will try different > >> >>> programs > >> >>> with different options and will see how it goes. > >> >>> > >> >>> Is there any particular forum where I can ask questions or should be > >> >>> fine > >> >>> to send you questions ? It was a great help from you. > >> >>> > >> >>> I am doing all this during my free time (i.e. after office hours). I > >> >>> would like to try more and so if possible please let me know if I > can > >> >>> be > >> >>> helpful in anyway. > >> >>> > >> >>> Regards, > >> >>> Srini > >> >>> > >> >>> > >> >>> > >> >>> On Wed, May 7, 2014 at 1:06 AM, Terence Yim <[email protected]> > wrote: > >> >>>> > >> >>>> Hi Srinivas, > >> >>>> > >> >>>> It’s the ASM library version issue. Try to include the > >> >>>> asm-4.0-all.jar > >> >>>> in your classpath before the hadoop classpath. > >> >>>> > >> >>>> http://mvnrepository.com/artifact/org.ow2.asm/asm-all/4.0 > >> >>>> > >> >>>> Terence > >> >>>> > >> >>>> On May 6, 2014, at 4:22 PM, Srinivas Reddy Kancharla > >> >>>> <[email protected]> wrote: > >> >>>> > >> >>>> Hi Terence, > >> >>>> > >> >>>> After all step-by-step downloading of required jar files (b'cos I > am > >> >>>> not > >> >>>> using maven for now), I am able to pass through the zookeeper issue > >> >>>> (have a > >> >>>> setup of 3 nodes i.e. one leader and 2 followers) and now I am > >> >>>> seeing below > >> >>>> exception: (Any pointer for this would be helpful for me). > >> >>>> > >> >>>> I suspect on the hadoop libraries I am using, b'cos from the pom > >> >>>> files > >> >>>> which you have created for hello world examples is referring to > >> >>>> hadoop 2.3 > >> >>>> ... whereas I am using Hadoop 2.2 .. Do you think below exception > is > >> >>>> due to > >> >>>> that reason? > >> >>>> > >> >>>> > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:java.io.tmpdir=/tmp > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:java.compiler=<NA> > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:os.name=Linux > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:os.arch=amd64 > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:os.version=3.11.0-12-generic > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:user.name=srini > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:user.home=/home/srini > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Client > >> >>>> environment:user.dir=/home/srini/twill/twilljars > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ZooKeeper: Initiating client > >> >>>> connection, connectString=localhost:2181 sessionTimeout=10000 > >> >>>> watcher=ServiceDelegate [STARTING] > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ClientCnxn: Opening socket > >> >>>> connection > >> >>>> to server localhost/127.0.0.1:2181. Will not attempt to > authenticate > >> >>>> using > >> >>>> SASL (unknown error) > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ClientCnxn: Socket connection > >> >>>> established to localhost/127.0.0.1:2181, initiating session > >> >>>> 14/05/06 15:53:39 INFO zookeeper.ClientCnxn: Session establishment > >> >>>> complete on server localhost/127.0.0.1:2181, sessionid = > >> >>>> 0x145d3a544bd0006, > >> >>>> negotiated timeout = 10000 > >> >>>> 14/05/06 15:53:39 INFO zookeeper.DefaultZKClientService: Connected > to > >> >>>> ZooKeeper: localhost:2181 > >> >>>> Exception in thread " STARTING" > >> >>>> java.lang.IncompatibleClassChangeError: > >> >>>> class > >> >>>> org.apache.twill.internal.utils.Dependencies$DependencyClassVisitor > >> >>>> has interface org.objectweb.asm.ClassVisitor as super class > >> >>>> at java.lang.ClassLoader.defineClass1(Native Method) > >> >>>> at java.lang.ClassLoader.defineClass(ClassLoader.java:643) > >> >>>> at > >> >>>> > >> >>>> > java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142) > >> >>>> at java.net.URLClassLoader.defineClass(URLClassLoader.java:277) > >> >>>> at java.net.URLClassLoader.access$000(URLClassLoader.java:73) > >> >>>> at java.net.URLClassLoader$1.run(URLClassLoader.java:212) > >> >>>> at java.security.AccessController.doPrivileged(Native Method) > >> >>>> at java.net.URLClassLoader.findClass(URLClassLoader.java:205) > >> >>>> at java.lang.ClassLoader.loadClass(ClassLoader.java:323) > >> >>>> at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:294) > >> >>>> at java.lang.ClassLoader.loadClass(ClassLoader.java:268) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.internal.utils.Dependencies.findClassDependencies(Dependencies.java:102) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.internal.ApplicationBundler.findDependencies(ApplicationBundler.java:179) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.internal.ApplicationBundler.createBundle(ApplicationBundler.java:136) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.internal.ApplicationBundler.createBundle(ApplicationBundler.java:106) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.yarn.YarnTwillPreparer.createAppMasterJar(YarnTwillPreparer.java:366) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.yarn.YarnTwillPreparer.access$2(YarnTwillPreparer.java:350) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.yarn.YarnTwillPreparer$1.call(YarnTwillPreparer.java:263) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.yarn.YarnTwillPreparer$1.call(YarnTwillPreparer.java:1) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.yarn.YarnTwillController.doStartUp(YarnTwillController.java:98) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.internal.AbstractZKServiceController.startUp(AbstractZKServiceController.java:82) > >> >>>> at > >> >>>> > >> >>>> > org.apache.twill.internal.AbstractExecutionServiceController$ServiceDelegate.startUp(AbstractExecutionServiceController.java:109) > >> >>>> at > >> >>>> > >> >>>> > com.google.common.util.concurrent.AbstractIdleService$1$1.run(AbstractIdleService.java:43) > >> >>>> at java.lang.Thread.run(Thread.java:701) > >> >>>> > >> >>>> > >> >>>> Thanks and regards, > >> >>>> Srini > >> >>>> > >> >>>> > >> >>>> On Tue, May 6, 2014 at 2:40 PM, Srinivas Reddy Kancharla > >> >>>> <[email protected]> wrote: > >> >>>>> > >> >>>>> Got it. I will do that and will update you. Earlier my assumption > >> >>>>> was > >> >>>>> my hadoop cluster would be starting zookeeper as part of Namenode, > >> >>>>> Datanode, > >> >>>>> resourcemanager , Nodemanager initialization. Seems like I was > wrong > >> >>>>> and I > >> >>>>> have to start zookeeper as a separate process. > >> >>>>> > >> >>>>> Thanks again for this information. > >> >>>>> > >> >>>>> Regards, > >> >>>>> Srini > >> >>>>> > >> >>>>> > >> >>>>> > >> >>>>> On Tue, May 6, 2014 at 2:32 PM, Terence Yim <[email protected]> > >> >>>>> wrote: > >> >>>>>> > >> >>>>>> Hi Srinivas, > >> >>>>>> > >> >>>>>> Yes you'll need to start zookeeper manually before executing the > >> >>>>>> twill > >> >>>>>> program. The assumption is that zookeeper is a long running > service > >> >>>>>> in the > >> >>>>>> cluster > >> >>>>>> > >> >>>>>> Terence > >> >>>>>> > >> >>>>>> Sent from my iPhone > >> >>>>>> > >> >>>>>> On May 6, 2014, at 2:14 PM, Srinivas Reddy Kancharla > >> >>>>>> <[email protected]> wrote: > >> >>>>>> > >> >>>>>> HI Terence, > >> >>>>>> > >> >>>>>> Thank you very much for the pointer. So i have used "hadoop > >> >>>>>> classpath" > >> >>>>>> command and copied that list to my "java" command and atleast > now I > >> >>>>>> am out > >> >>>>>> of classpath issues. So this shows that I am fine with my current > >> >>>>>> version of > >> >>>>>> Hadoop 2.2 jars. > >> >>>>>> > >> >>>>>> Now as I asked in my previous mail, Do I need to start > "zookeeper" > >> >>>>>> separately or its a part of my existing hadoop running cluster ?? > >> >>>>>> b'cos I am > >> >>>>>> getting below exception for my "Hello world" example (I have > taken > >> >>>>>> your > >> >>>>>> example of "localhost:2181" for ZKServer string: > >> >>>>>> > >> >>>>>> > >> >>>>>> > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> > >> >>>>>> > environment:java.library.path=/usr/lib/jvm/java-6-openjdk-amd64/jre/lib/amd64/server:/usr/lib/jvm/java-6-openjdk-amd64/jre/lib/amd64:/usr/lib/jvm/java-6-openjdk-amd64/jre/../lib/amd64:/usr/java/packages/lib/amd64:/usr/lib/jni:/lib:/usr/lib > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:java.io.tmpdir=/tmp > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:java.compiler=<NA> > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:os.name=Linux > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:os.arch=amd64 > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:os.version=3.11.0-12-generic > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:user.name=srini > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:user.home=/home/srini > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Client > >> >>>>>> environment:user.dir=/home/srini/twill/twilljars > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ZooKeeper: Initiating client > >> >>>>>> connection, connectString=localhost:2181 sessionTimeout=10000 > >> >>>>>> watcher=ServiceDelegate [STARTING] > >> >>>>>> 14/05/06 14:08:11 INFO zookeeper.ClientCnxn: Opening socket > >> >>>>>> connection > >> >>>>>> to server localhost/127.0.0.1:2181. Will not attempt to > >> >>>>>> authenticate using > >> >>>>>> SASL (unknown error) > >> >>>>>> 14/05/06 14:08:11 WARN zookeeper.ClientCnxn: Session 0x0 for > server > >> >>>>>> null, unexpected error, closing socket connection and attempting > >> >>>>>> reconnect > >> >>>>>> java.net.ConnectException: Connection refused > >> >>>>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:601) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > >> >>>>>> 14/05/06 14:08:12 INFO zookeeper.ClientCnxn: Opening socket > >> >>>>>> connection > >> >>>>>> to server localhost/127.0.0.1:2181. Will not attempt to > >> >>>>>> authenticate using > >> >>>>>> SASL (unknown error) > >> >>>>>> 14/05/06 14:08:12 WARN zookeeper.ClientCnxn: Session 0x0 for > server > >> >>>>>> null, unexpected error, closing socket connection and attempting > >> >>>>>> reconnect > >> >>>>>> java.net.ConnectException: Connection refused > >> >>>>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:601) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > >> >>>>>> 14/05/06 14:08:13 INFO zookeeper.ClientCnxn: Opening socket > >> >>>>>> connection > >> >>>>>> to server localhost/127.0.0.1:2181. Will not attempt to > >> >>>>>> authenticate using > >> >>>>>> SASL (unknown error) > >> >>>>>> 14/05/06 14:08:13 WARN zookeeper.ClientCnxn: Session 0x0 for > server > >> >>>>>> null, unexpected error, closing socket connection and attempting > >> >>>>>> reconnect > >> >>>>>> java.net.ConnectException: Connection refused > >> >>>>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:601) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > >> >>>>>> 14/05/06 14:08:14 INFO zookeeper.ClientCnxn: Opening socket > >> >>>>>> connection > >> >>>>>> to server localhost/127.0.0.1:2181. Will not attempt to > >> >>>>>> authenticate using > >> >>>>>> SASL (unknown error) > >> >>>>>> 14/05/06 14:08:14 WARN zookeeper.ClientCnxn: Session 0x0 for > server > >> >>>>>> null, unexpected error, closing socket connection and attempting > >> >>>>>> reconnect > >> >>>>>> java.net.ConnectException: Connection refused > >> >>>>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:601) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > >> >>>>>> 14/05/06 14:08:15 INFO zookeeper.ClientCnxn: Opening socket > >> >>>>>> connection > >> >>>>>> to server localhost/127.0.0.1:2181. Will not attempt to > >> >>>>>> authenticate using > >> >>>>>> SASL (unknown error) > >> >>>>>> 14/05/06 14:08:15 WARN zookeeper.ClientCnxn: Session 0x0 for > server > >> >>>>>> null, unexpected error, closing socket connection and attempting > >> >>>>>> reconnect > >> >>>>>> java.net.ConnectException: Connection refused > >> >>>>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:601) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) > >> >>>>>> at > >> >>>>>> > >> >>>>>> > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) > >> >>>>>> > >> >>>>>> > >> >>>>>> Thank you again for your help and hopefully once I am out of this > >> >>>>>> initial setup issues, I will not bother you much unless its very > >> >>>>>> technical. > >> >>>>>> > >> >>>>>> Thanks and regards, > >> >>>>>> Srini > >> >>>>>> > >> >>>>>> > >> >>>>>> On Mon, May 5, 2014 at 10:34 PM, Terence Yim <[email protected]> > >> >>>>>> wrote: > >> >>>>>>> > >> >>>>>>> Hi Srinivas, > >> >>>>>>> > >> >>>>>>> Looks like you missed some hadoop classes in your classpath. > >> >>>>>>> You’ll > >> >>>>>>> need the hadoop classpath in your classpath. Have you try > running > >> >>>>>>> like this? > >> >>>>>>> > >> >>>>>>> HDCP=`hadoop classpath`; java -cp > >> >>>>>>> > >> >>>>>>> > ./SriniTwillYarnClasses.jar:twill-api-0.3.0-incubating-SNAPSHOT.jar:……:$HDCP > >> >>>>>>> com.srini.hadoopTwill.HelloTwill > >> >>>>>>> > >> >>>>>>> Terence > >> >>>>>>> > >> >>>>>>> On May 5, 2014, at 9:07 PM, Srinivas Reddy Kancharla > >> >>>>>>> <[email protected]> wrote: > >> >>>>>>> > >> >>>>>>> Hello Terence, > >> >>>>>>> > >> >>>>>>> I am Srini and new to twill. I am very sorry for sending you > email > >> >>>>>>> like this, b'cos I could not find any other discussion forum to > >> >>>>>>> post this > >> >>>>>>> message. My bad, please let me know if there is a forum exist > and > >> >>>>>>> I can get > >> >>>>>>> some help in future instead of direct mails to you. Below is the > >> >>>>>>> issue I am > >> >>>>>>> facing while executing my first Twill program: > >> >>>>>>> > >> >>>>>>> - I have a setup of hadoop-2.2.0 which has total 3 nodes. one > >> >>>>>>> master > >> >>>>>>> and 2 slave. > >> >>>>>>> - I could execute the DistributedShell program successfully. > >> >>>>>>> - Now I downloaded twill project, generated required jar files > >> >>>>>>> using > >> >>>>>>> mvn commands. > >> >>>>>>> - I replicated Helloworld sample program and during execution, I > >> >>>>>>> am > >> >>>>>>> getting below exception: > >> >>>>>>> > >> >>>>>>> srini@ubuntu:~/twill/twilljars$ java -classpath > >> >>>>>>> > >> >>>>>>> > ./SriniTwillYarnClasses.jar:twill-api-0.3.0-incubating-SNAPSHOT.jar:guava-13.0.1.jar:slf4j-api-1.7.7.jar:twill-yarn-0.3.0-incubating-SNAPSHOT.jar:hadoop-common-2.2.0.jar:hadoop-yarn-api-2.2.0.jar:twill-ext-0.3.0-incubating-SNAPSHOT.jar:twill-core-0.3.0-incubating-SNAPSHOT.jar:commons-logging-1.1.1.jar:commons-configuration-1.6.jar:commons-lang-2.5.jar:twill-common-0.3.0-incubating-SNAPSHOT.jar:twill-zookeeper-0.3.0-incubating-SNAPSHOT.jar:hadoop-auth-2.2.0.jar > >> >>>>>>> com.srini.hadoopTwill.HelloTwill > >> >>>>>>> > >> >>>>>>> SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder". > >> >>>>>>> SLF4J: Defaulting to no-operation (NOP) logger implementation > >> >>>>>>> SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinderfor > >> >>>>>>> further details. > >> >>>>>>> May 5, 2014 8:49:53 PM org.apache.hadoop.util.NativeCodeLoader > >> >>>>>>> <clinit> > >> >>>>>>> WARNING: Unable to load native-hadoop library for your > platform... > >> >>>>>>> using builtin-java classes where applicable > >> >>>>>>> Exception in thread "main" java.lang.RuntimeException: > >> >>>>>>> java.lang.reflect.InvocationTargetException > >> >>>>>>> at > >> >>>>>>> com.google.common.base.Throwables.propagate(Throwables.java:160) > >> >>>>>>> at > >> >>>>>>> > >> >>>>>>> > org.apache.twill.internal.yarn.VersionDetectYarnAppClientFactory.create(VersionDetectYarnAppClientFactory.java:47) > >> >>>>>>> at > >> >>>>>>> > >> >>>>>>> > org.apache.twill.yarn.YarnTwillRunnerService.<init>(YarnTwillRunnerService.java:143) > >> >>>>>>> at > >> >>>>>>> > >> >>>>>>> > org.apache.twill.yarn.YarnTwillRunnerService.<init>(YarnTwillRunnerService.java:138) > >> >>>>>>> at com.srini.hadoopTwill.HelloTwill.main(HelloTwill.java:37) > >> >>>>>>> Caused by: java.lang.reflect.InvocationTargetException > >> >>>>>>> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native > >> >>>>>>> Method) > >> >>>>>>> at > >> >>>>>>> > >> >>>>>>> > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > >> >>>>>>> at > >> >>>>>>> > >> >>>>>>> > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > >> >>>>>>> at > java.lang.reflect.Constructor.newInstance(Constructor.java:534) > >> >>>>>>> at > >> >>>>>>> > >> >>>>>>> > org.apache.twill.internal.yarn.VersionDetectYarnAppClientFactory.create(VersionDetectYarnAppClientFactory.java:44) > >> >>>>>>> ... 3 more > >> >>>>>>> Caused by: java.lang.Error: Unresolved compilation problems: > >> >>>>>>> The import org.apache.hadoop.yarn.api.records.DelegationToken > >> >>>>>>> cannot > >> >>>>>>> be resolved > >> >>>>>>> The import org.apache.hadoop.yarn.client.YarnClient cannot be > >> >>>>>>> resolved > >> >>>>>>> The import org.apache.hadoop.yarn.client.YarnClientImpl cannot > be > >> >>>>>>> resolved > >> >>>>>>> The import org.apache.hadoop.yarn.exceptions.YarnRemoteException > >> >>>>>>> cannot be resolved > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClientImpl cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> The method setUser(String) is undefined for the type > >> >>>>>>> ApplicationSubmissionContext > >> >>>>>>> The method getUser() is undefined for the type > >> >>>>>>> ApplicationSubmissionContext > >> >>>>>>> The method setResource(Resource) is undefined for the type > >> >>>>>>> ContainerLaunchContext > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnRemoteException cannot be resolved to a type > >> >>>>>>> The method getMinimumResourceCapability() is undefined for the > >> >>>>>>> type > >> >>>>>>> GetNewApplicationResponse > >> >>>>>>> The method getContainerTokens() is undefined for the type > >> >>>>>>> ContainerLaunchContext > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> The method setContainerTokens(ByteBuffer) is undefined for the > >> >>>>>>> type > >> >>>>>>> ContainerLaunchContext > >> >>>>>>> DelegationToken cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnRemoteException cannot be resolved to a type > >> >>>>>>> YarnClient cannot be resolved to a type > >> >>>>>>> YarnRemoteException cannot be resolved to a type > >> >>>>>>> > >> >>>>>>> at > >> >>>>>>> > >> >>>>>>> > org.apache.twill.internal.yarn.Hadoop20YarnAppClient.<init>(Hadoop20YarnAppClient.java:33) > >> >>>>>>> ... 8 more > >> >>>>>>> > >> >>>>>>> > >> >>>>>>> > >> >>>>>>> Could you please let me know if I am missing anything here to > >> >>>>>>> execute > >> >>>>>>> this program. In my program, "localhost:2181" is hard coded for > >> >>>>>>> zookeeper > >> >>>>>>> string. > >> >>>>>>> > >> >>>>>>> My suspect: > >> >>>>>>> - My setup is having hadoop-2.2.0 , to execute this program, do > I > >> >>>>>>> need to provide hadoop-2.0 libraries instead of 2.2 . > >> >>>>>>> - Do I need to start zookeeper server separately ? > >> >>>>>>> > >> >>>>>>> > >> >>>>>>> > >> >>>>>>> Thanks for your any help, > >> >>>>>>> > >> >>>>>>> Srini > >> >>>>>>> > >> >>>>>>> > >> >>>>>>> > >> >>>>>>> > >> >>>>>>> > >> >>>>>> > >> >>>>> > >> >>>> > >> >>>> > >> >>> > >> >> > >> > > > > > >
