Dmitry Lysnichenko created AMBARI-6748:
------------------------------------------
Summary: Nimbus start failed after deployment
Key: AMBARI-6748
URL: https://issues.apache.org/jira/browse/AMBARI-6748
Project: Ambari
Issue Type: Bug
Components: agent
Affects Versions: 1.7.0
Reporter: Dmitry Lysnichenko
Assignee: Dmitry Lysnichenko
Fix For: 1.7.0
Deployed HDP-2.1, start all services failed. Nimbus cannot start.
error log:
{code}
stderr:
2014-08-04 18:12:28,049 - Error while executing command 'start':
Traceback (most recent call last):
File
"/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 122, in execute
method(env)
File
"/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/nimbus.py",
line 43, in start
service("nimbus", action="start")
File
"/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/service.py",
line 64, in service
try_sleep=10
File "/usr/lib/python2.6/site-packages/resource_management/core/base.py",
line 148, in __init__
self.env.run()
File
"/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
line 149, in run
self.run_action(resource, action)
File
"/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
line 115, in run_action
provider_action()
File
"/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py",
line 241, in action_run
raise ex
Fail: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid' returned 1.
stdout:
2014-08-04 18:11:37,197 - Execute['mkdir -p /tmp/HDP-artifacts/; curl -kf
-x "" --retry 10
http://ambsmoke6-4-1407193726-1.cs1cloud.internal:8080/resources//UnlimitedJCEPolicyJDK7.zip
-o /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip'] {'environment': ...,
'not_if': 'test -e /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip',
'ignore_failures': True, 'path': ['/bin', '/usr/bin/']}
2014-08-04 18:11:37,211 - Skipping Execute['mkdir -p /tmp/HDP-artifacts/;
curl -kf -x "" --retry 10
http://ambsmoke6-4-1407193726-1.cs1cloud.internal:8080/resources//UnlimitedJCEPolicyJDK7.zip
-o /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip'] due to not_if
2014-08-04 18:11:37,212 - Execute['rm -f local_policy.jar; rm -f
US_export_policy.jar; unzip -o -j -q
/tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip'] {'path': ['/bin/',
'/usr/bin'], 'only_if': 'test -e /usr/jdk64/jdk1.7.0_45/jre/lib/security &&
test -f /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip', 'cwd':
'/usr/jdk64/jdk1.7.0_45/jre/lib/security'}
2014-08-04 18:11:37,390 - Directory['/etc/hadoop/conf.empty'] {'owner': 'root',
'group': 'root', 'recursive': True}
2014-08-04 18:11:37,391 - Link['/etc/hadoop/conf'] {'not_if': 'ls
/etc/hadoop/conf', 'to': '/etc/hadoop/conf.empty'}
2014-08-04 18:11:37,404 - Skipping Link['/etc/hadoop/conf'] due to not_if
2014-08-04 18:11:37,419 - File['/etc/hadoop/conf/hadoop-env.sh'] {'content':
InlineTemplate(...), 'owner': 'root'}
2014-08-04 18:11:37,419 - XmlConfig['core-site.xml'] {'owner': 'hdfs', 'group':
'hadoop', 'conf_dir': '/etc/hadoop/conf', 'configuration_attributes': ...,
'configurations': ...}
2014-08-04 18:11:37,429 - Generating config: /etc/hadoop/conf/core-site.xml
2014-08-04 18:11:37,430 - File['/etc/hadoop/conf/core-site.xml'] {'owner':
'hdfs', 'content': InlineTemplate(...), 'group': 'hadoop', 'mode': None,
'encoding': 'UTF-8'}
2014-08-04 18:11:37,431 - Writing File['/etc/hadoop/conf/core-site.xml']
because contents don't match
2014-08-04 18:11:37,443 - Execute['/bin/echo 0 > /selinux/enforce'] {'only_if':
'test -f /selinux/enforce'}
2014-08-04 18:11:37,456 - Skipping Execute['/bin/echo 0 > /selinux/enforce']
due to only_if
2014-08-04 18:11:37,457 - Execute['mkdir -p
/usr/lib/hadoop/lib/native/Linux-i386-32; ln -sf /usr/lib/libsnappy.so
/usr/lib/hadoop/lib/native/Linux-i386-32/libsnappy.so'] {}
2014-08-04 18:11:37,488 - Execute['mkdir -p
/usr/lib/hadoop/lib/native/Linux-amd64-64; ln -sf /usr/lib64/libsnappy.so
/usr/lib/hadoop/lib/native/Linux-amd64-64/libsnappy.so'] {}
2014-08-04 18:11:37,500 - Directory['/grid/0/log/hadoop'] {'owner': 'root',
'group': 'root', 'recursive': True}
2014-08-04 18:11:37,501 - Directory['/var/run/hadoop'] {'owner': 'root',
'group': 'root', 'recursive': True}
2014-08-04 18:11:37,502 - Directory['/tmp/hadoop-hdfs'] {'owner': 'hdfs',
'recursive': True}
2014-08-04 18:11:37,506 - File['/etc/hadoop/conf/commons-logging.properties']
{'content': Template('commons-logging.properties.j2'), 'owner': 'root'}
2014-08-04 18:11:37,508 - File['/etc/hadoop/conf/health_check'] {'content':
Template('health_check-v2.j2'), 'owner': 'root'}
2014-08-04 18:11:37,509 - File['/etc/hadoop/conf/log4j.properties'] {'content':
'...', 'owner': 'hdfs', 'group': 'hadoop', 'mode': 0644}
2014-08-04 18:11:37,516 - File['/etc/hadoop/conf/hadoop-metrics2.properties']
{'content': Template('hadoop-metrics2.properties.j2'), 'owner': 'hdfs'}
2014-08-04 18:11:37,517 - File['/etc/hadoop/conf/task-log4j.properties']
{'content': StaticFile('task-log4j.properties'), 'mode': 0755}
2014-08-04 18:11:37,517 - File['/etc/hadoop/conf/configuration.xsl'] {'owner':
'hdfs', 'group': 'hadoop'}
2014-08-04 18:11:37,705 - Directory['/var/log/storm'] {'owner': 'storm',
'group': 'hadoop', 'recursive': True}
2014-08-04 18:11:37,707 - Directory['/var/run/storm'] {'owner': 'storm',
'group': 'hadoop', 'recursive': True}
2014-08-04 18:11:37,707 - Directory['/grid/0/hadoop/storm'] {'owner': 'storm',
'group': 'hadoop', 'recursive': True}
2014-08-04 18:11:37,707 - Directory['/etc/storm/conf'] {'owner': 'storm',
'group': 'hadoop', 'recursive': True}
2014-08-04 18:11:37,714 - File['/etc/storm/conf/config.yaml'] {'owner':
'storm', 'content': Template('config.yaml.j2'), 'group': 'hadoop'}
2014-08-04 18:11:37,719 - File['/etc/storm/conf/storm.yaml'] {'owner': 'storm',
'content': InlineTemplate(...), 'group': 'hadoop', 'mode': None}
2014-08-04 18:11:37,721 - Writing File['/etc/storm/conf/storm.yaml'] because
contents don't match
2014-08-04 18:11:37,722 - File['/etc/storm/conf/storm-env.sh'] {'content':
'\n#!/bin/bash\n\n# Set Storm specific environment variables here.\n\n# The
java implementation to use.\nexport JAVA_HOME={{java_home}}\n\n# export
STORM_CONF_DIR=""', 'owner': 'storm'}
2014-08-04 18:11:37,722 - TemplateConfig['/etc/storm/conf/storm_jaas.conf']
{'owner': 'storm'}
2014-08-04 18:11:37,724 - File['/etc/storm/conf/storm_jaas.conf'] {'content':
Template('storm_jaas.conf.j2'), 'owner': 'storm', 'group': None, 'mode': None}
2014-08-04 18:11:37,725 - Execute['env JAVA_HOME=/usr/jdk64/jdk1.7.0_45
PATH=$PATH:/usr/jdk64/jdk1.7.0_45/bin /usr/bin/storm nimbus >
/var/log/storm/nimbus.out 2>&1'] {'wait_for_finish': False, 'not_if': 'ls
/var/run/storm/nimbus.pid >/dev/null 2>&1 && ps `cat /var/run/storm/nimbus.pid`
>/dev/null 2>&1', 'user': 'storm'}
2014-08-04 18:11:37,752 - Execute['pgrep -f
"^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid']
{'logoutput': True, 'tries': 6, 'user': 'storm', 'try_sleep': 10}
2014-08-04 18:11:37,790 - Retrying after 10 seconds. Reason: Execution of
'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid' returned 1.
2014-08-04 18:11:47,827 - Retrying after 10 seconds. Reason: Execution of
'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid' returned 1.
2014-08-04 18:11:57,881 - Retrying after 10 seconds. Reason: Execution of
'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid' returned 1.
2014-08-04 18:12:07,946 - Retrying after 10 seconds. Reason: Execution of
'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid' returned 1.
2014-08-04 18:12:17,989 - Retrying after 10 seconds. Reason: Execution of
'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid' returned 1.
2014-08-04 18:12:28,049 - Error while executing command 'start':
Traceback (most recent call last):
File
"/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 122, in execute
method(env)
File
"/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/nimbus.py",
line 43, in start
service("nimbus", action="start")
File
"/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/service.py",
line 64, in service
try_sleep=10
File "/usr/lib/python2.6/site-packages/resource_management/core/base.py",
line 148, in __init__
self.env.run()
File
"/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
line 149, in run
self.run_action(resource, action)
File
"/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
line 115, in run_action
provider_action()
File
"/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py",
line 241, in action_run
raise ex
Fail: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f
"^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid' returned 1.
{code}
Nimbus.out
{code}
Traceback (most recent call last):
File "/usr/lib/storm/bin/storm.py", line 463, in <module>
main()
File "/usr/lib/storm/bin/storm.py", line 460, in main
(COMMANDS.get(COMMAND, unknown_command))(*ARGS)
File "/usr/lib/storm/bin/storm.py", line 276, in nimbus
jvmopts = parse_args(confvalue("nimbus.childopts", cppaths)) + [
File "/usr/lib/storm/bin/storm.py", line 87, in confvalue
p = sub.Popen(command, stdout=sub.PIPE)
File "/usr/lib64/python2.6/subprocess.py", line 642, in __init__
errread, errwrite)
File "/usr/lib64/python2.6/subprocess.py", line 1234, in _execute_child
raise child_exception
OSError: [Errno 2] No such file or directory
{code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)