[ 
https://issues.apache.org/jira/browse/MESOS-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15868485#comment-15868485
 ] 

Anand Mazumdar commented on MESOS-7102:
---------------------------------------

Commit to 1.2.x branch
{noformat}
commit 7e5439d55fd89cb9336220d9a1847391384ea8d5
Author: Anand Mazumdar <an...@apache.org>
Date:   Fri Feb 10 15:41:11 2017 -0800

    Fixed a crash on the agent when handling the SIGUSR1 signal.

    There were some actors that were not being destructed when
    `finalize()` was being invoked. Also fixed the order of the
    destruction of objects i.e., in the reverse order of their
    creation.

    Review: https://reviews.apache.org/r/56525/
{noformat}

> Crash when sending a SIGUSR1 signal to the agent.
> -------------------------------------------------
>
>                 Key: MESOS-7102
>                 URL: https://issues.apache.org/jira/browse/MESOS-7102
>             Project: Mesos
>          Issue Type: Bug
>          Components: agent
>    Affects Versions: 1.2.0
>         Environment: ubuntu 16.04
>            Reporter: Anand Mazumdar
>            Assignee: Anand Mazumdar
>            Priority: Critical
>              Labels: mesosphere
>             Fix For: 1.3.0
>
>
> Looks like sending a {{SIGUSR1}} to the agent crashes it. This is a 
> regression and used to work fine in the 1.1 release. Note that the agent does 
> unregisters with the master and the crash happens after that.
> Steps to reproduce:
> - Start the agent.
> - Send it a {{SIGUSR1}} signal.
> The agent should crash with a stack trace similar to this:
> {noformat}
> I0209 16:19:46.210819 31977472 slave.cpp:851] Received SIGUSR1 signal from 
> user gmann; unregistering and shutting down
> I0209 16:19:46.210960 31977472 slave.cpp:803] Agent terminating
> *** Aborted at 1486685986 (unix time) try "date -d @1486685986" if you are 
> using GNU date ***
> PC: @     0x7fffbc4904fc _pthread_key_global_init
> *** SIGSEGV (@0x38) received by PID 88894 (TID 0x7fffc50c83c0) stack trace: 
> ***
>     @     0x7fffbc488bba _sigtramp
>     @     0x7fe8a5d03f38 (unknown)
>     @        0x10b6d67d9 
> _ZZ11synchronizeINSt3__115recursive_mutexEE12SynchronizedIT_EPS3_ENKUlPS1_E_clES6_
>     @        0x10b6d67b8 
> _ZZ11synchronizeINSt3__115recursive_mutexEE12SynchronizedIT_EPS3_ENUlPS1_E_8__invokeES6_
>     @        0x10b6d6889 Synchronized<>::Synchronized()
>     @        0x10b6d678d Synchronized<>::Synchronized()
>     @        0x10b6a708a synchronize<>()
>     @        0x10e2f148d process::ProcessManager::wait()
>     @        0x10e2e9a78 process::wait()
>     @        0x10b30614f process::wait()
>     @        0x10c9619dc 
> mesos::internal::slave::StatusUpdateManager::~StatusUpdateManager()
>     @        0x10c961a55 
> mesos::internal::slave::StatusUpdateManager::~StatusUpdateManager()
>     @        0x10b1ab035 main
>     @     0x7fffbc27b255 start
> [1]    88894 segmentation fault  bin/mesos-agent.sh —master=127.0.0.1:5050
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to