[ https://issues.apache.org/jira/browse/MESOS-7686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jack Crawford updated MESOS-7686: --------------------------------- Description: Mesos master fails to update registrar and fails. Running with mesos 1.2.0, 1 master {code} Jun 16 02:20:22 mesosmaster mesos-master[9415]: E0616 02:20:22.562098 9422 registrar.cpp:528] Registrar aborting: Failed to update registry: Failed to perform store within 20secs Jun 16 02:20:32 mesosmaster mesos-master[9415]: F0616 02:20:32.965498 9419 master.cpp:6420] Failed to mark agent acc02700-53c8-4961-b8f4-d952e58432c3-S742 at slave(1)@10.0.239.60:5051 (ip-10-0-239-60) un Jun 16 02:20:53 mesosmaster mesos-master[9415]: E0616 02:20:36.198673 9426 process.cpp:2426] Failed to shutdown socket with fd 18: Transport endpoint is not connected Jun 16 02:20:53 mesosmaster mesos-master[9415]: *** Check failure stack trace: *** Jun 16 02:21:24 mesosmaster mesos-master[9415]: @ 0x7f696c7923cd google::LogMessage::Fail() Jun 16 02:21:34 mesosmaster mesos-master[9415]: @ 0x7f696c794180 google::LogMessage::SendToLog() Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c791fb3 google::LogMessage::Flush() Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c794ba9 google::LogMessageFatal::~LogMessageFatal() Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696bb48e6d mesos::internal::master::Master::_markUnreachable() Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c6f1f0c process::ProcessBase::visit() Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c704933 process::ProcessManager::resume() Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c70f537 _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696a950c80 (unknown) Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696a1636ba start_thread Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f6969e9982d (unknown) Jun 16 02:23:40 mesosmaster systemd[1]: mesos-master.service: Main process exited, code=killed, status=6/ABRT Jun 16 02:23:40 mesosmaster systemd[1]: mesos-master.service: Unit entered failed state. {code} was: Mesos master fails to update registrar and fails. Running with mesos 1.2.0, 1 master ``` Jun 16 02:20:22 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:22.562098 9422 registrar.cpp:528] Registrar aborting: Failed to update registry: Failed to perform store within 20secs Jun 16 02:20:32 ava-mesosmasterl001 mesos-master[9415]: F0616 02:20:32.965498 9419 master.cpp:6420] Failed to mark agent acc02700-53c8-4961-b8f4-d952e58432c3-S742 at slave(1)@10.0.239.60:5051 (ip-10-0-239-60) un Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:36.198673 9426 process.cpp:2426] Failed to shutdown socket with fd 18: Transport endpoint is not connected Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: *** Check failure stack trace: *** Jun 16 02:21:24 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696c7923cd google::LogMessage::Fail() Jun 16 02:21:34 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696c794180 google::LogMessage::SendToLog() Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696c791fb3 google::LogMessage::Flush() Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696c794ba9 google::LogMessageFatal::~LogMessageFatal() Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696bb48e6d mesos::internal::master::Master::_markUnreachable() Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696c6f1f0c process::ProcessBase::visit() Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696c704933 process::ProcessManager::resume() Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696c70f537 _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696a950c80 (unknown) Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f696a1636ba start_thread Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]: @ 0x7f6969e9982d (unknown) Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Main process exited, code=killed, status=6/ABRT Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Unit entered failed state. ``` > registrar aborting, failed to mark agent causes fatal error > ----------------------------------------------------------- > > Key: MESOS-7686 > URL: https://issues.apache.org/jira/browse/MESOS-7686 > Project: Mesos > Issue Type: Bug > Reporter: Jack Crawford > > Mesos master fails to update registrar and fails. > Running with mesos 1.2.0, 1 master > {code} > Jun 16 02:20:22 mesosmaster mesos-master[9415]: E0616 02:20:22.562098 9422 > registrar.cpp:528] Registrar aborting: Failed to update registry: Failed to > perform store within 20secs > Jun 16 02:20:32 mesosmaster mesos-master[9415]: F0616 02:20:32.965498 9419 > master.cpp:6420] Failed to mark agent > acc02700-53c8-4961-b8f4-d952e58432c3-S742 at slave(1)@10.0.239.60:5051 > (ip-10-0-239-60) un > Jun 16 02:20:53 mesosmaster mesos-master[9415]: E0616 02:20:36.198673 9426 > process.cpp:2426] Failed to shutdown socket with fd 18: Transport endpoint is > not connected > Jun 16 02:20:53 mesosmaster mesos-master[9415]: *** Check failure stack > trace: *** > Jun 16 02:21:24 mesosmaster mesos-master[9415]: @ 0x7f696c7923cd > google::LogMessage::Fail() > Jun 16 02:21:34 mesosmaster mesos-master[9415]: @ 0x7f696c794180 > google::LogMessage::SendToLog() > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c791fb3 > google::LogMessage::Flush() > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c794ba9 > google::LogMessageFatal::~LogMessageFatal() > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696bb48e6d > mesos::internal::master::Master::_markUnreachable() > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c6f1f0c > process::ProcessBase::visit() > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c704933 > process::ProcessManager::resume() > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696c70f537 > _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696a950c80 > (unknown) > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f696a1636ba > start_thread > Jun 16 02:21:44 mesosmaster mesos-master[9415]: @ 0x7f6969e9982d > (unknown) > Jun 16 02:23:40 mesosmaster systemd[1]: mesos-master.service: Main process > exited, code=killed, status=6/ABRT > Jun 16 02:23:40 mesosmaster systemd[1]: mesos-master.service: Unit entered > failed state. > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)