[
https://issues.apache.org/jira/browse/AMBARI-9372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Levas reassigned AMBARI-9372:
------------------------------------
Assignee: Robert Levas
> Kerberos: start + test fails on NodeManager due to ambari-qa usercache
> ----------------------------------------------------------------------
>
> Key: AMBARI-9372
> URL: https://issues.apache.org/jira/browse/AMBARI-9372
> Project: Ambari
> Issue Type: Bug
> Components: security
> Affects Versions: 2.0.0
> Reporter: Erik Bergenholtz
> Assignee: Robert Levas
> Fix For: 2.0.0
>
>
> This is on a two-node cluster. The NodeManager on one of the nodes (c6402)
> starts just fine. It's this second node where the nodemanager fails (c6403)
> with the following:
> Build #398
> [root@c6402 keytabs]# ambari-server --hash
> c0167f89d7c293f39f62fbf9df0b6640bde25b51
> [root@c6402 keytabs]#
> NodeManager fails to start, with this error.
> {code}
> 2015-01-24 18:12:33,968 - Error while executing command 'start':
> Traceback (most recent call last):
> File
> "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
> line 184, in execute
> method(env)
> File
> "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/scripts/nodemanager.py",
> line 58, in start
> self.configure(env) # FOR SECURITY
> File
> "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/scripts/nodemanager.py",
> line 45, in configure
> yarn(name="nodemanager")
> File
> "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/scripts/yarn.py",
> line 77, in yarn
> Execute(format("chown -R {params.smokeuser} {directory}"))
> File "/usr/lib/python2.6/site-packages/resource_management/core/base.py",
> line 148, in __init__
> self.env.run()
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
> line 151, in run
> self.run_action(resource, action)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
> line 117, in run_action
> provider_action()
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py",
> line 277, in action_run
> raise ex
> Fail: Execution of 'chown -R ambari-qa
> /hadoop/yarn/local/usercache/ambari-qa' returned 1. chown: cannot access
> `/hadoop/yarn/local/usercache/ambari-qa': No such file or directory
> {code}
> *ON C6402 (where NODEMANAGER STARTS FINE):*
> {code}
> [vagrant@c6402 ~]$ ls -l /hadoop/yarn/local/
> total 12
> drwxr-xr-x 4 yarn hadoop 4096 Jan 24 17:42 filecache
> drwx------ 2 yarn hadoop 4096 Jan 24 17:44 nmPrivate
> drwxr-xr-x 3 yarn hadoop 4096 Jan 24 17:41 usercache
> [vagrant@c6402 ~]$ ls -l /hadoop/yarn/local/usercache/
> total 4
> drwxr-x--- 4 ambari-qa hadoop 4096 Jan 24 17:41 ambari-qa
> [vagrant@c6402 ~]$
> {code}
> *ON C6403 (where NODEMANAGER DOES NOT START):*
> {code}
> [vagrant@c6403 ~]$ ls -l /hadoop/yarn/local/
> total 12
> drwxr-xr-x 2 yarn hadoop 4096 Jan 24 17:40 filecache
> drwx------ 2 yarn hadoop 4096 Jan 24 17:40 nmPrivate
> drwxr-xr-x 2 yarn hadoop 4096 Jan 24 17:40 usercache
> [vagrant@c6403 ~]$ ls -l /hadoop/yarn/local/usercache/
> total 0
> [vagrant@c6403 ~]$
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)