Thanks Newton , I have installed ambari-server on namenode and on other server have installed ambari-agent and then tried to setup the cluster.
At the deploy stage it is at hung state at Prepearing to deploy Task 3 of 61 state. Below are some messages which I could find out at one of the amabari-agent log files. Please advise . ********************************** ERROR 2015-01-21 20:46:46,871 PackagesAnalyzer.py:48 - Task timed out and will be killed INFO 2015-01-21 20:46:52,622 HostCheckReportFileHandler.py:43 - Host check report at /var/lib/ambari-agent/data/hostcheck.result INFO 2015-01-21 20:46:52,627 HostCheckReportFileHandler.py:104 - Removing old host check file at /var/lib/ambari-agent/data/hostcheck.result INFO 2015-01-21 20:46:52,629 HostCheckReportFileHandler.py:109 - Creating host check file at /var/lib/ambari-agent/data/hostcheck.result INFO 2015-01-21 20:46:52,785 Controller.py:214 - Heartbeat response received (id = 379) INFO 2015-01-21 20:46:52,788 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:47:02,794 Heartbeat.py:78 - Building Heartbeat: {responseId = 379, timestamp = 1421891222793, commandsInProgress = False, componentsMapped = False} INFO 2015-01-21 20:47:02,877 Controller.py:214 - Heartbeat response received (id = 380) INFO 2015-01-21 20:47:02,879 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:47:12,937 Heartbeat.py:78 - Building Heartbeat: {responseId = 380, timestamp = 1421891232936, commandsInProgress = False, componentsMapped = False} INFO 2015-01-21 20:47:13,129 Controller.py:214 - Heartbeat response received (id = 381) INFO 2015-01-21 20:47:13,132 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:47:23,139 Heartbeat.py:78 - Building Heartbeat: {responseId = 381, timestamp = 1421891243138, commandsInProgress = False, componentsMapped = False} INFO 2015-01-21 20:47:23,394 Controller.py:214 - Heartbeat response received (id = 382) INFO 2015-01-21 20:47:23,397 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:47:33,442 Heartbeat.py:78 - Building Heartbeat: {responseId = 382, timestamp = 1421891253426, commandsInProgress = False, componentsMapped = False} INFO 2015-01-21 20:47:33,576 Controller.py:214 - Heartbeat response received (id = 383) INFO 2015-01-21 20:47:33,578 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:47:43,592 Heartbeat.py:78 - Building Heartbeat: {responseId = 383, timestamp = 1421891263591, commandsInProgress = False, componentsMapped = False} INFO 2015-01-21 20:47:43,665 Controller.py:214 - Heartbeat response received (id = 384) INFO 2015-01-21 20:47:43,667 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:47:53,801 Heartbeat.py:78 - Building Heartbeat: {responseId = 384, timestamp = 1421891273800, commandsInProgress = False, componentsMapped = False} ERROR 2015-01-21 20:48:40,795 PackagesAnalyzer.py:48 - Task timed out and will be killed INFO 2015-01-21 20:49:21,066 HostCheckReportFileHandler.py:43 - Host check report at /var/lib/ambari-agent/data/hostcheck.result INFO 2015-01-21 20:49:21,070 HostCheckReportFileHandler.py:104 - Removing old host check file at /var/lib/ambari-agent/data/hostcheck.result INFO 2015-01-21 20:49:21,072 HostCheckReportFileHandler.py:109 - Creating host check file at /var/lib/ambari-agent/data/hostcheck.result INFO 2015-01-21 20:49:21,298 Controller.py:214 - Heartbeat response received (id = 385) INFO 2015-01-21 20:49:21,300 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:49:31,305 Heartbeat.py:78 - Building Heartbeat: {responseId = 385, timestamp = 1421891371304, commandsInProgress = False, componentsMapped = False} INFO 2015-01-21 20:49:31,431 Controller.py:214 - Heartbeat response received (id = 386) INFO 2015-01-21 20:49:31,434 Controller.py:249 - No commands sent from master.my.com INFO 2015-01-21 20:49:41,438 Heartbeat.py:78 - Building Heartbeat: {responseId = 386, timestamp = 1421891381437, commandsInProgress = False, componentsMapped = False} INFO 2015-01-21 20:49:41,543 Controller.py:214 - Heartbeat response received (id = 387) INFO 2015-01-21 20:49:41,546 Controller.py:249 - No commands sent from master.my.com On Wed, Jan 21, 2015 at 3:21 PM, Newton Alex <na...@pivotal.io> wrote: > My next step would be to manually run the command that failed and see what > is the error. > > From the logs it looks like the first failure happened for scp, so just > run that command. I've copy pasted it here for your convenience. > > "scp -o ConnectTimeout=60 -o BatchMode=yes -o StrictHostKeyChecking=no -i > /var/run/ambari-server/bootstrap/1/sshKey /usr/lib/python2.6/site > -packages/ambari_server/os_type_check.sh r...@slave3.my.com:/tmp/os_ > type_check.sh" > > BTW, do this for the first failure you see on the log file (if scp was not > the first cmd that failed). Also try using verbose flag (scp -v) to get > more information. > > - > Newton > > > On Wed, Jan 21, 2015 at 12:05 PM, Krish Donald <gotomyp...@gmail.com> > wrote: > >> Yes I am running using root only >> >> On Wed, Jan 21, 2015 at 10:20 AM, Newton Alex <na...@pivotal.io> wrote: >> >>> Sorry, couldn't figure out if you are running it as "root" user. Can you >>> please confirm? >>> >>> - >>> Newton >>> >>> On Tue, Jan 20, 2015 at 11:08 PM, Krish Donald <gotomyp...@gmail.com> >>> wrote: >>> >>>> Few more messages: >>>> >>>> >>>> 02:04:50,993 INFO Configuration:261 - Reading password from existing >>>> file >>>> 02:04:51,029 INFO Configuration:378 - Hosts Mapping File null >>>> 02:04:51,029 INFO HostsMap:60 - Using hostsmap file null >>>> 02:04:57,939 INFO AmbariServer:428 - Getting the controller >>>> 02:05:00,403 INFO CertificateManager:65 - Initialization of root >>>> certificate >>>> 02:05:00,404 INFO CertificateManager:69 - Certificate exists:true >>>> 02:05:00,421 INFO ClustersImpl:88 - Initializing the ClustersImpl >>>> 02:05:04,235 INFO AmbariServer:383 - Checking DB store version >>>> 02:05:04,297 INFO AmbariServer:394 - DB store version is compatible >>>> 02:05:04,361 INFO ClassPathXmlApplicationContext:456 - Refreshing >>>> org.springframework.context.support.ClassPathXmlApplicationContext@4838462a: >>>> startup date [Wed Jan 21 >>>> 02:05:04 EST 2015]; root of context hierarchy >>>> 02:05:04,489 INFO DefaultListableBeanFactory:557 - Pre-instantiating >>>> singletons in >>>> org.springframework.beans.factory.support.DefaultListableBeanFactory@f292738: >>>> defini >>>> ng beans []; root of factory hierarchy >>>> 02:05:04,506 INFO ClassPathXmlApplicationContext:456 - Refreshing >>>> org.springframework.context.support.ClassPathXmlApplicationContext@65cb048e: >>>> startup date [Wed Jan 21 >>>> 02:05:04 EST 2015]; parent: >>>> org.springframework.context.support.ClassPathXmlApplicationContext@4838462a >>>> 02:05:04,523 INFO XmlBeanDefinitionReader:315 - Loading XML bean >>>> definitions from class path resource [webapp/WEB-INF/spring-security.xml] >>>> 02:05:05,243 INFO SpringSecurityCoreVersion:33 - You are running with >>>> Spring Security Core 3.1.2.RELEASE >>>> 02:05:05,243 INFO SecurityNamespaceHandler:59 - Spring Security >>>> 'config' module version is 3.1.2.RELEASE >>>> 02:05:05,477 INFO FilterInvocationSecurityMetadataSourceParser:134 - >>>> Creating access control expression attribute 'isAuthenticated()' for /** >>>> 02:05:05,486 INFO FilterInvocationSecurityMetadataSourceParser:134 - >>>> Creating access control expression attribute 'hasRole('ADMIN')' for /** >>>> 02:05:05,676 INFO HttpSecurityBeanDefinitionParser:264 - Checking >>>> sorted filter chain: [Root bean: class >>>> [org.springframework.security.web.context.SecurityContextPersi >>>> stenceFilter]; scope=; abstract=false; lazyInit=false; autowireMode=0; >>>> dependencyCheck=0; autowireCandidate=true; primary=false; >>>> factoryBeanName=null; factoryMethodName >>>> =null; initMethodName=null; destroyMethodName=null, order = 300, Root >>>> bean: class >>>> [org.springframework.security.web.authentication.www.BasicAuthenticationFilter]; >>>> scope >>>> =; abstract=false; lazyInit=false; autowireMode=0; dependencyCheck=0; >>>> autowireCandidate=true; primary=false; factoryBeanName=null; >>>> factoryMethodName=null; initMethodNam >>>> e=null; destroyMethodName=null, order = 1200, Root bean: class >>>> [org.springframework.security.web.savedrequest.RequestCacheAwareFilter]; >>>> scope=; abstract=false; lazyInit >>>> =false; autowireMode=0; dependencyCheck=0; autowireCandidate=true; >>>> primary=false; factoryBeanName=null; factoryMethodName=null; >>>> initMethodName=null; destroyMethodName=n >>>> ull, order = 1300, Root bean: class >>>> [org.springframework.security.web.servletapi.SecurityContextHolderAwareRequestFilter]; >>>> scope=; abstract=false; lazyInit=false; autow >>>> ireMode=0; dependencyCheck=0; autowireCandidate=true; primary=false; >>>> factoryBeanName=null; factoryMethodName=null; initMethodName=null; >>>> destroyMethodName=null, order = >>>> 1400, Root bean: class >>>> [org.springframework.security.web.authentication.AnonymousAuthenticationFilter]; >>>> scope=; abstract=false; lazyInit=false; autowireMode=0; dependen >>>> cyCheck=0; autowireCandidate=true; primary=false; factoryBeanName=null; >>>> factoryMethodName=null; initMethodName=null; destroyMethodName=null, order >>>> = 1700, Root bean: cl >>>> ass [org.springframework.security.web.session.SessionManagementFilter]; >>>> scope=; abstract=false; lazyInit=false; autowireMode=0; dependencyCheck=0; >>>> autowireCandidate=tru >>>> e; primary=false; factoryBeanName=null; factoryMethodName=null; >>>> initMethodName=null; destroyMethodName=null, order = 1800, Root bean: class >>>> [org.springframework.securit >>>> y.web.access.ExceptionTranslationFilter]; scope=; abstract=false; >>>> lazyInit=false; autowireMode=0; dependencyCheck=0; autowireCandidate=true; >>>> primary=false; factoryBeanN >>>> ame=null; factoryMethodName=null; initMethodName=null; >>>> destroyMethodName=null, order = 1900, >>>> <org.springframework.security.web.access.intercept.FilterSecurityIntercepto >>>> r#0>, order = 2000] >>>> 02:05:05,714 INFO DefaultListableBeanFactory:557 - Pre-instantiating >>>> singletons in >>>> org.springframework.beans.factory.support.DefaultListableBeanFactory@251c135c: >>>> defin >>>> ing beans >>>> [org.springframework.security.filterChains,org.springframework.security.filterChainProxy,org.springframework.security.web.PortMapperImpl#0,org.springframework >>>> >>>> .security.web.PortResolverImpl#0,org.springframework.security.config.authentication.AuthenticationManagerFactoryBean#0,org.springframework.security.authentication.Provi >>>> >>>> derManager#0,org.springframework.security.web.context.HttpSessionSecurityContextRepository#0,org.springframework.security.web.authentication.session.SessionFixationProt >>>> >>>> ectionStrategy#0,org.springframework.security.web.savedrequest.HttpSessionRequestCache#0,org.springframework.security.web.access.expression.DefaultWebSecurityExpression >>>> >>>> Handler#0,org.springframework.security.access.vote.AffirmativeBased#0,org.springframework.security.web.access.intercept.FilterSecurityInterceptor#0,org.springframework. >>>> >>>> security.web.access.DefaultWebInvocationPrivilegeEvaluator#0,org.springframework.security.authentication.AnonymousAuthenticationProvider#0,org.springframework.security. >>>> >>>> userDetailsServiceFactory,org.springframework.security.web.DefaultSecurityFilterChain#0,org.springframework.security.authentication.dao.DaoAuthenticationProvider#0,org. >>>> springframework.security.authentication.DefaultAuthenticationEventPublisher#0,org.springframework.security.authenticationManager,ambariEntryPoint]; >>>> parent: org.springfr >>>> amework.beans.factory.support.DefaultListableBeanFactory@f292738 >>>> 02:05:06,206 INFO DefaultSecurityFilterChain:28 - Creating filter >>>> chain: org.springframework.security.web.util.AnyRequestMatcher@1, >>>> [org.springframework.security.web.c >>>> ontext.SecurityContextPersistenceFilter@78304648, >>>> org.springframework.security.web.authentication.www.BasicAuthenticationFilter@29565e9d, >>>> org.springframework.security.w >>>> eb.savedrequest.RequestCacheAwareFilter@655daa9e, >>>> org.springframework.security.web.servletapi.SecurityContextHolderAwareRequestFilter@686c9f0f, >>>> org.springframework.secu >>>> rity.web.authentication.AnonymousAuthenticationFilter@333ec758, >>>> org.springframework.security.web.session.SessionManagementFilter@5e917cb0, >>>> org.springframework.security. >>>> web.access.ExceptionTranslationFilter@21913751, >>>> org.springframework.security.web.access.intercept.FilterSecurityInterceptor@44c6b80e >>>> ] >>>> 02:05:06,455 INFO Configuration:369 - Web App DIR test >>>> /usr/lib/ambari-server/web >>>> 02:05:06,466 INFO CertificateManager:65 - Initialization of root >>>> certificate >>>> 02:05:06,466 INFO CertificateManager:69 - Certificate exists:true >>>> 02:05:06,573 INFO AmbariServer:309 - ********* Initializing Meta Info >>>> ********** >>>> 02:05:08,741 INFO AmbariServer:319 - ********* Initializing Clusters >>>> ********** >>>> 02:05:08,742 INFO AmbariServer:323 - ********* Current Clusters State >>>> ********* >>>> 02:05:08,742 INFO AmbariServer:324 - Clusters=[ ] >>>> 02:05:08,742 INFO AmbariServer:326 - ********* Initializing >>>> ActionManager ********** >>>> 02:05:08,742 INFO AmbariServer:328 - ********* Initializing Controller >>>> ********** >>>> 02:05:08,752 INFO AmbariManagementControllerImpl:166 - Initializing >>>> the AmbariManagementControllerImpl >>>> 02:05:08,758 INFO Server:266 - jetty-7.6.7.v20120910 >>>> 02:05:08,870 INFO ContextHandler:744 - started >>>> o.e.j.s.ServletContextHandler{/,file:/usr/lib/ambari-server/web/} >>>> 02:05:15,807 INFO AbstractConnector:338 - Started >>>> SelectChannelConnector@0.0.0.0:8080 >>>> 02:05:15,808 INFO Server:266 - jetty-7.6.7.v20120910 >>>> 02:05:15,809 INFO ContextHandler:744 - started >>>> o.e.j.s.ServletContextHandler{/,null} >>>> 02:05:18,510 INFO AmbariLocalUserDetailsService:62 - Loading user by >>>> name: admin >>>> 02:05:20,271 INFO SslContextFactory:300 - Enabled Protocols >>>> [SSLv2Hello, SSLv3, TLSv1] of [SSLv2Hello, SSLv3, TLSv1] >>>> 02:05:20,310 INFO AbstractConnector:338 - Started >>>> SslSelectChannelConnector@0.0.0.0:8440 >>>> 02:05:20,558 INFO SslContextFactory:300 - Enabled Protocols >>>> [SSLv2Hello, SSLv3, TLSv1] of [SSLv2Hello, SSLv3, TLSv1] >>>> 02:05:20,610 INFO AbstractConnector:338 - Started >>>> SslSelectChannelConnector@0.0.0.0:8441 >>>> 02:05:20,611 INFO AmbariServer:343 - ********* Started Server >>>> ********** >>>> 02:05:20,611 INFO ActionManager:64 - Starting scheduler thread >>>> 02:05:20,614 INFO AmbariServer:346 - ********* Started ActionManager >>>> ********** >>>> 02:05:21,874 INFO PersistKeyValueService:82 - Looking for keyName >>>> CLUSTER_CURRENT_STATUS >>>> 02:05:22,014 INFO PersistKeyValueService:82 - Looking for keyName >>>> CLUSTER_CURRENT_STATUS >>>> 02:06:08,967 INFO BootStrapImpl:101 - BootStrapping hosts >>>> master.my.com:slave1.my.com:slave2.my.com:slave3.my.com: >>>> 02:06:08,991 INFO BSRunner:196 - Host= master.my.com,slave1.my.com, >>>> slave2.my.com,slave3.my.com >>>> bs=/usr/lib/python2.6/site-packages/ambari_server/bootstrap.py requestDi >>>> r=/var/run/ambari-server/bootstrap/1 user=root >>>> keyfile=/var/run/ambari-server/bootstrap/1/sshKey passwordFile null server= >>>> master.my.com version=1.3.0 >>>> 02:06:09,043 INFO BSRunner:227 - Kicking off the scheduler for polling >>>> on logs in /var/run/ambari-server/bootstrap/1 >>>> 02:06:09,043 INFO BSRunner:231 - Bootstrap output, >>>> log=/var/run/ambari-server/bootstrap/1/bootstrap.err >>>> /var/run/ambari-server/bootstrap/1/bootstrap.out >>>> 02:06:09,046 INFO BSHostStatusCollector:55 - Request directory >>>> /var/run/ambari-server/bootstrap/1 >>>> 02:06:09,046 INFO BSHostStatusCollector:62 - HostList for polling on [ >>>> master.my.com, slave1.my.com, slave2.my.com, slave3.my.com] >>>> 02:06:09,578 INFO BSRunner:243 - Script log Mesg >>>> INFO:root:BootStrapping hosts ['master.my.com', 'slave1.my.com', ' >>>> slave2.my.com', 'slave3.my.com']using >>>> /usr/lib/python2.6/site-packages/ambari_server cluster primary O >>>> S: redhat6 with user 'root' sshKey File >>>> /var/run/ambari-server/bootstrap/1/sshKey password File null using tmp dir >>>> /var/run/ambari-server/bootstrap/1 ambari: master.my. >>>> com; ambari version: 1.3.0 >>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o BatchMode=yes >>>> -o StrictHostKeyChecking=no -i /var/run/ambari-server/bootstrap/1/sshKey >>>> /usr/lib/python2.6/site >>>> -packages/ambari_server/os_type_check.sh r...@master.my.com: >>>> /tmp/os_type_check.sh >>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o BatchMode=yes >>>> -o StrictHostKeyChecking=no -i /var/run/ambari-server/bootstrap/1/sshKey >>>> /usr/lib/python2.6/site >>>> -packages/ambari_server/os_type_check.sh r...@slave1.my.com: >>>> /tmp/os_type_check.sh >>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o BatchMode=yes >>>> -o StrictHostKeyChecking=no -i /var/run/ambari-server/bootstrap/1/sshKey >>>> /usr/lib/python2.6/site >>>> -packages/ambari_server/os_type_check.sh r...@slave2.my.com: >>>> /tmp/os_type_check.sh >>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o BatchMode=yes >>>> -o StrictHostKeyChecking=no -i /var/run/ambari-server/bootstrap/1/sshKey >>>> /usr/lib/python2.6/site >>>> -packages/ambari_server/os_type_check.sh r...@slave3.my.com: >>>> /tmp/os_type_check.sh >>>> INFO:root:scp >>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>> host master.my.com, exitcode=1 >>>> INFO:root:scp >>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>> host slave1.my.com, exitcode=1 >>>> INFO:root:scp >>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>> host slave2.my.com, exitcode=1 >>>> INFO:root:scp >>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>> host slave3.my.com, exitcode=1 >>>> INFO:root:Parallel scp returns for os type check script. Failed hosts >>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com >>>> '] >>>> INFO:root:Copying os type check script finished >>>> INFO:root:Running os type check... >>>> INFO:root:Parallel ssh returns for setup agent. All failed hosts are: [' >>>> slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>> Failed on last step: [] >>>> INFO:root:Running os type check finished >>>> INFO:root:Parallel ssh returns for checking 'sudo' package. All failed >>>> hosts are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>> slave1.my.com']. Failed on last s >>>> tep: [] >>>> INFO:root:Checking 'sudo' package finished >>>> INFO:root:Copying repo file to 'tmp' folder... >>>> INFO:root:Parallel scp returns for copying repo file. All failed hosts >>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>> Failed on last step: [ >>>> ] >>>> INFO:root:Moving repo file... >>>> INFO:root:Parallel scp returns for moving repo file. All failed hosts >>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>> Failed on last step: [] >>>> INFO:root:Parallel scp returns for agent script. All failed hosts are: >>>> ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>> Failed on last step: [] >>>> INFO:root:Copying files finished >>>> INFO:root:Running setup agent... >>>> INFO:root:Parallel ssh returns for setup agent. All failed hosts are: [' >>>> slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>> Failed on last step: [] >>>> INFO:root:Running ssh command finished >>>> 02:06:09,579 INFO BSHostStatusCollector:55 - Request directory >>>> /var/run/ambari-server/bootstrap/1 >>>> 02:06:09,580 INFO BSHostStatusCollector:62 - HostList for polling on [ >>>> master.my.com, slave1.my.com, slave2.my.com, slave3.my.com] >>>> 02:06:10,595 INFO BSHostStatusCollector:55 - Request directory >>>> /var/run/ambari-server/bootstrap/1 >>>> 02:06:10,595 INFO BSHostStatusCollector:62 - HostList for polling on [ >>>> master.my.com, slave1.my.com, slave2.my.com, slave3.my.com] >>>> >>>> >>>> On Tue, Jan 20, 2015 at 10:00 PM, Krish Donald <gotomyp...@gmail.com> >>>> wrote: >>>> >>>>> It failed and I saw one more message in the log: >>>>> >>>>> 00:57:36,564 INFO PersistKeyValueService:82 - Looking for keyName >>>>> CLUSTER_CURRENT_STATUS >>>>> >>>>> On Tue, Jan 20, 2015 at 9:48 PM, Krish Donald <gotomyp...@gmail.com> >>>>> wrote: >>>>> >>>>>> Below are the last few messages which I can find out: >>>>>> >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host slave3.my.com, exitcode=1 >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host slave1.my.com, exitcode=1 >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host master.my.com, exitcode=1 >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host slave2.my.com, exitcode=1 >>>>>> INFO:root:Parallel scp returns for os type check script. Failed hosts >>>>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com'] >>>>>> INFO:root:Copying os type check script finished >>>>>> INFO:root:Running os type check... >>>>>> INFO:root:Parallel ssh returns for setup agent. All failed hosts are: >>>>>> ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>>>> Failed on last step: [] >>>>>> INFO:root:Running os type check finished >>>>>> INFO:root:Parallel ssh returns for checking 'sudo' package. All >>>>>> failed hosts are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', >>>>>> 'slave1.my.com']. Failed on last s >>>>>> tep: [] >>>>>> INFO:root:Checking 'sudo' package finished >>>>>> INFO:root:Copying repo file to 'tmp' folder... >>>>>> INFO:root:Parallel scp returns for copying repo file. All failed >>>>>> hosts are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com']. Failed on last step: [ >>>>>> ] >>>>>> INFO:root:Moving repo file... >>>>>> INFO:root:Parallel scp returns for moving repo file. All failed hosts >>>>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com']. Failed on last step: [] >>>>>> INFO:root:Parallel scp returns for agent script. All failed hosts >>>>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com']. Failed on last step: [] >>>>>> INFO:root:Copying files finished >>>>>> INFO:root:Running setup agent... >>>>>> INFO:root:Parallel ssh returns for setup agent. All failed hosts are: >>>>>> ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>>>> Failed on last step: [] >>>>>> INFO:root:Running ssh command finished >>>>>> 21:22:01,847 INFO BSHostStatusCollector:55 - Request directory >>>>>> /var/run/ambari-server/bootstrap/2 >>>>>> 21:22:01,848 INFO BSHostStatusCollector:62 - HostList for polling on >>>>>> [master.my.com, slave1.my.com, slave2.my.com, slave3.my.com] >>>>>> 21:57:32,153 INFO PersistKeyValueService:82 - Looking for keyName >>>>>> CLUSTER_CURRENT_STATUS >>>>>> 22:57:32,784 INFO PersistKeyValueService:82 - Looking for keyName >>>>>> CLUSTER_CURRENT_STATUS >>>>>> 23:57:34,304 INFO PersistKeyValueService:82 - Looking for keyName >>>>>> CLUSTER_CURRENT_STATUS >>>>>> 00:16:34,862 INFO BootStrapImpl:101 - BootStrapping hosts >>>>>> master.my.com:slave1.my.com:slave2.my.com:slave3.my.com: >>>>>> 00:16:34,868 INFO BSRunner:196 - Host= master.my.com,slave1.my.com, >>>>>> slave2.my.com,slave3.my.com >>>>>> bs=/usr/lib/python2.6/site-packages/ambari_server/bootstrap.py requestDi >>>>>> r=/var/run/ambari-server/bootstrap/3 user=root >>>>>> keyfile=/var/run/ambari-server/bootstrap/3/sshKey passwordFile null >>>>>> server= >>>>>> master.my.com version=1.3.0 >>>>>> 00:16:34,877 INFO BSRunner:227 - Kicking off the scheduler for >>>>>> polling on logs in /var/run/ambari-server/bootstrap/3 >>>>>> 00:16:34,878 INFO BSRunner:231 - Bootstrap output, >>>>>> log=/var/run/ambari-server/bootstrap/3/bootstrap.err >>>>>> /var/run/ambari-server/bootstrap/3/bootstrap.out >>>>>> 00:16:34,891 INFO BSHostStatusCollector:55 - Request directory >>>>>> /var/run/ambari-server/bootstrap/3 >>>>>> 00:16:34,892 INFO BSHostStatusCollector:62 - HostList for polling on >>>>>> [master.my.com, slave1.my.com, slave2.my.com, slave3.my.com] >>>>>> 00:16:35,532 INFO BSRunner:243 - Script log Mesg >>>>>> INFO:root:BootStrapping hosts ['master.my.com', 'slave1.my.com', ' >>>>>> slave2.my.com', 'slave3.my.com']using >>>>>> /usr/lib/python2.6/site-packages/ambari_server cluster primary O >>>>>> S: redhat6 with user 'root' sshKey File >>>>>> /var/run/ambari-server/bootstrap/3/sshKey password File null using tmp >>>>>> dir >>>>>> /var/run/ambari-server/bootstrap/3 ambari: master.my. >>>>>> com; ambari version: 1.3.0 >>>>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o >>>>>> BatchMode=yes -o StrictHostKeyChecking=no -i >>>>>> /var/run/ambari-server/bootstrap/3/sshKey /usr/lib/python2.6/site >>>>>> -packages/ambari_server/os_type_check.sh r...@master.my.com: >>>>>> /tmp/os_type_check.sh >>>>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o >>>>>> BatchMode=yes -o StrictHostKeyChecking=no -i >>>>>> /var/run/ambari-server/bootstrap/3/sshKey /usr/lib/python2.6/site >>>>>> -packages/ambari_server/os_type_check.sh r...@slave1.my.com: >>>>>> /tmp/os_type_check.sh >>>>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o >>>>>> BatchMode=yes -o StrictHostKeyChecking=no -i >>>>>> /var/run/ambari-server/bootstrap/3/sshKey /usr/lib/python2.6/site >>>>>> -packages/ambari_server/os_type_check.sh r...@slave2.my.com: >>>>>> /tmp/os_type_check.sh >>>>>> INFO:root:Running scp command scp -o ConnectTimeout=60 -o >>>>>> BatchMode=yes -o StrictHostKeyChecking=no -i >>>>>> /var/run/ambari-server/bootstrap/3/sshKey /usr/lib/python2.6/site >>>>>> -packages/ambari_server/os_type_check.sh r...@slave3.my.com: >>>>>> /tmp/os_type_check.sh >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host master.my.com, exitcode=1 >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host slave2.my.com, exitcode=1 >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host slave1.my.com, exitcode=1 >>>>>> INFO:root:scp >>>>>> /usr/lib/python2.6/site-packages/ambari_server/os_type_check.sh done for >>>>>> host slave3.my.com, exitcode=1 >>>>>> INFO:root:Parallel scp returns for os type check script. Failed hosts >>>>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com'] >>>>>> INFO:root:Copying os type check script finished >>>>>> INFO:root:Running os type check... >>>>>> INFO:root:Parallel ssh returns for setup agent. All failed hosts are: >>>>>> ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>>>> Failed on last step: [] >>>>>> INFO:root:Running os type check finished >>>>>> INFO:root:Parallel ssh returns for checking 'sudo' package. All >>>>>> failed hosts are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', >>>>>> 'slave1.my.com']. Failed on last s >>>>>> tep: [] >>>>>> INFO:root:Checking 'sudo' package finished >>>>>> INFO:root:Copying repo file to 'tmp' folder... >>>>>> INFO:root:Parallel scp returns for copying repo file. All failed >>>>>> hosts are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com']. Failed on last step: [ >>>>>> ] >>>>>> INFO:root:Moving repo file... >>>>>> INFO:root:Parallel scp returns for moving repo file. All failed hosts >>>>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com']. Failed on last step: [] >>>>>> INFO:root:Parallel scp returns for agent script. All failed hosts >>>>>> are: ['slave3.my.com', 'slave2.my.com', 'master.my.com', ' >>>>>> slave1.my.com']. Failed on last step: [] >>>>>> INFO:root:Copying files finished >>>>>> INFO:root:Running setup agent... >>>>>> INFO:root:Parallel ssh returns for setup agent. All failed hosts are: >>>>>> ['slave3.my.com', 'slave2.my.com', 'master.my.com', 'slave1.my.com']. >>>>>> Failed on last step: [] >>>>>> INFO:root:Running ssh command finished >>>>>> 00:16:35,537 INFO BSHostStatusCollector:55 - Request directory >>>>>> /var/run/ambari-server/bootstrap/3 >>>>>> 00:16:35,537 INFO BSHostStatusCollector:62 - HostList for polling on >>>>>> [master.my.com, slave1.my.com, slave2.my.com, slave3.my.com] >>>>>> >>>>>> >>>>>> On Tue, Jan 20, 2015 at 8:07 PM, Devopam Mittra <devo...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> Please check if you have installed ambari as "root" user ? And as >>>>>>> Newton has asked before, see the log file for details. >>>>>>> >>>>>>> regards >>>>>>> Dev >>>>>>> >>>>>>> On Wed, Jan 21, 2015 at 7:34 AM, Newton Alex <na...@pivotal.io> >>>>>>> wrote: >>>>>>> >>>>>>>> What does the ambari-server.log say? >>>>>>>> >>>>>>>> On Ambari Server: /var/log/ambari-server/ambari-server.log >>>>>>>> >>>>>>>> - >>>>>>>> >>>>>>>> Newton >>>>>>>> >>>>>>>> On Tue, Jan 20, 2015 at 6:00 PM, Krish Donald <gotomyp...@gmail.com >>>>>>>> > wrote: >>>>>>>> >>>>>>>>> >>>>>>>>> Hi, >>>>>>>>> >>>>>>>>> I am trying to setup multimode cluster using Ambari. >>>>>>>>> I have setup password less ssh using root . >>>>>>>>> I have installed ambari server on the master node . >>>>>>>>> Setup ntp, disabled iptables. >>>>>>>>> >>>>>>>>> 1. When I am going to masternode ipaddress:8080 and after logining >>>>>>>>> using admin userid with admin password, I am not getting option to >>>>>>>>> select >>>>>>>>> the stack . >>>>>>>>> >>>>>>>>> 2. When I am trying to setup Hadoop cluster I am getting below >>>>>>>>> error : >>>>>>>>> >>>>>>>>> >>>>>>>>> STDOUT >>>>>>>>> >>>>>>>>> STDERR >>>>>>>>> Permission denied (publickey,gssapi-keyex,gssapi-with-mic,password). >>>>>>>>> lost connection >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Devopam Mittra >>>>>>> Life and Relations are not binary >>>>>>> >>>>>> >>>>>> >>>>> >>>> >>> >> >