I think this is a bug that was fixed in the latest version of Slider, 0.92.
I didn't figure out why this happens sometimes, but it seems to be resolved
by changing the following:

diff --git a/slider-agent/src/main/python/agent/ActionQueue.py
b/slider-agent/src/main/python/agent/ActionQueue.py
index 7514337..e973337 100644
--- a/slider-agent/src/main/python/agent/ActionQueue.py
+++ b/slider-agent/src/main/python/agent/ActionQueue.py
@@ -161,7 +161,7 @@ class ActionQueue(threading.Thread):
     self.commandStatuses.put_command_status(command, in_progress_status,
reportResult)

     store_config = False
-    if ActionQueue.STORE_APPLIED_CONFIG in command['commandParams']:
+    if 'commandParams' in command and ActionQueue.STORE_APPLIED_CONFIG in
command['commandParams']:
       store_config = 'true' ==
command['commandParams'][ActionQueue.STORE_APPLIED_CONFIG]
     store_command = False
     if 'roleParams' in command and command['roleParams'] is not None and
ActionQueue.AUTO_RESTART in command['roleParams']:


On Thu, Mar 23, 2017 at 6:25 PM, David.Serafini <david.seraf...@target.com>
wrote:

> Can anyone tell me what this error means and whether it is significant?
> I have a slider job that seems to randomly fail, and I don't see anything
> interesting in the AppMaster logs  except this.  (That doesn't mean there
> isn't an error elsewhere: yarn is wiping out the job directories as soon as
> the containter terminates: I haven't figured out how to fix that).
>
> In case it matters, my job is a shell script specified in  metainfo.json
> in  application.components.commands.exec .  The script does some setup
> and then runs tomcat.
>
> thanks in advance,
> david
>
>
> Connecting to the server at https://brdn1088.target.com:
> 42721/ws/v1/slider/agents/...
> Registered with the server
> Traceback (most recent call last):
>   File "./infra/agent/slider-agent/agent/main.py", line 318, in <module>
>     main()
>   File "./infra/agent/slider-agent/agent/main.py", line 311, in main
>     controller.join(timeout=1.0)
>   File "/usr/lib64/python2.6/threading.py", line 655, in join
>     self.__block.wait(delay)
>   File "/usr/lib64/python2.6/threading.py", line 258, in wait
>     _sleep(delay)
>   File "./infra/agent/slider-agent/agent/main.py", line 66, in
> signal_handler
>     controller.actionQueue.execute_command(controller.stopCommand)
>   File "/grid/4/hadoop/yarn/local/usercache/Z002JSF/appcache/
> application_1490038663882_9176/filecache/11/slider-
> agent.tar.gz/slider-agent/agent/ActionQueue.py", line 164, in
> execute_command
>     if ActionQueue.STORE_APPLIED_CONFIG in command['commandParams']:
> KeyError: 'commandParams'
>
>
>

Reply via email to