Hello again, 2008/2/18, Mark Sapiro <[EMAIL PROTECTED]>: > Zbigniew Szalbot wrote: > > > >Feb 18 18:26:53 2008 (560) ArchRunner qrunner started. > >Feb 18 18:26:53 2008 (563) IncomingRunner qrunner started. > >Feb 18 18:26:53 2008 (567) RetryRunner qrunner started. > >Feb 18 18:26:53 2008 (565) OutgoingRunner qrunner started. > >Feb 18 18:26:53 2008 (564) NewsRunner qrunner started. > >Feb 18 18:26:53 2008 (566) VirginRunner qrunner started. > >Feb 18 18:26:53 2008 (562) CommandRunner qrunner started. > >Feb 18 18:26:53 2008 (561) BounceRunner qrunner started. > >Feb 18 18:27:06 2008 (718) CommandRunner qrunner started. > >Feb 18 18:27:06 2008 (716) ArchRunner qrunner started. > >Feb 18 18:27:06 2008 (717) BounceRunner qrunner started. > >Feb 18 18:27:06 2008 (719) IncomingRunner qrunner started. > >Feb 18 18:27:06 2008 (722) VirginRunner qrunner started. > >Feb 18 18:27:06 2008 (721) OutgoingRunner qrunner started. > >Feb 18 18:27:06 2008 (720) NewsRunner qrunner started. > >Feb 18 18:27:07 2008 (723) RetryRunner qrunner started. > >Feb 18 18:28:08 2008 (559) Master qrunner detected subprocess exit > >(pid: 565, sig: None, sts: 1, class: OutgoingRunner, slice: 1/1) [restarting] > > > The above is from the first OutgoingRunner (pid 565) dying on its own > and being restarted by its master (pid 559). > > >Feb 18 18:28:09 2008 (1242) OutgoingRunner qrunner started. > > It restarts as pid 1242. > > This next bunch is from mailmanctl stop which signals the second master > (pid 711) and all its children. > > > >Feb 18 18:29:59 2008 (716) ArchRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (716) ArchRunner qrunner exiting. > >Feb 18 18:29:59 2008 (717) BounceRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (717) BounceRunner qrunner exiting. > >Feb 18 18:29:59 2008 (719) IncomingRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (719) IncomingRunner qrunner exiting. > >Feb 18 18:29:59 2008 (718) CommandRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (718) CommandRunner qrunner exiting. > >Feb 18 18:29:59 2008 (720) NewsRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (720) NewsRunner qrunner exiting. > >Feb 18 18:29:59 2008 (721) OutgoingRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (721) OutgoingRunner qrunner exiting. > >Feb 18 18:29:59 2008 (722) VirginRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (722) VirginRunner qrunner exiting. > >Feb 18 18:29:59 2008 (723) RetryRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:29:59 2008 (723) RetryRunner qrunner exiting. > >Feb 18 18:29:59 2008 (711) Master watcher caught SIGTERM. Exiting. > >Feb 18 18:29:59 2008 (711) Master qrunner detected subprocess exit > >(pid: 716, sig: None, sts: 15, class: ArchRunner, slice: 1/1) > >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit > >(pid: 718, sig: None, sts: 15, class: CommandRunner, slice: 1/1) > >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit > >(pid: 717, sig: None, sts: 15, class: BounceRunner, slice: 1/1) > >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit > >(pid: 719, sig: None, sts: 15, class: IncomingRunner, slice: 1/1) > >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit > >(pid: 720, sig: None, sts: 15, class: NewsRunner, slice: 1/1) > >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit > >(pid: 721, sig: None, sts: 15, class: OutgoingRunner, slice: 1/1) > >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit > >(pid: 722, sig: None, sts: 15, class: VirginRunner, slice: 1/1) > >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit > >(pid: 723, sig: None, sts: 15, class: RetryRunner, slice: 1/1) > > > Then you SIGTERM'd the first master. > > > >Feb 18 18:31:27 2008 (560) ArchRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (560) ArchRunner qrunner exiting. > >Feb 18 18:31:27 2008 (561) BounceRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (561) BounceRunner qrunner exiting. > >Feb 18 18:31:27 2008 (562) CommandRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (562) CommandRunner qrunner exiting. > >Feb 18 18:31:27 2008 (563) IncomingRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (563) IncomingRunner qrunner exiting. > >Feb 18 18:31:27 2008 (564) NewsRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (564) NewsRunner qrunner exiting. > >Feb 18 18:31:27 2008 (566) VirginRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (566) VirginRunner qrunner exiting. > >Feb 18 18:31:27 2008 (567) RetryRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (567) RetryRunner qrunner exiting. > >Feb 18 18:31:27 2008 (1242) OutgoingRunner qrunner caught SIGTERM. Stopping. > >Feb 18 18:31:27 2008 (1242) OutgoingRunner qrunner exiting. > >Feb 18 18:31:27 2008 (559) Master watcher caught SIGTERM. Exiting. > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 562, sig: None, sts: 15, class: CommandRunner, slice: 1/1) > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 566, sig: None, sts: 15, class: VirginRunner, slice: 1/1) > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 563, sig: None, sts: 15, class: IncomingRunner, slice: 1/1) > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 561, sig: None, sts: 15, class: BounceRunner, slice: 1/1) > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 560, sig: None, sts: 15, class: ArchRunner, slice: 1/1) > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 564, sig: None, sts: 15, class: NewsRunner, slice: 1/1) > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 567, sig: None, sts: 15, class: RetryRunner, slice: 1/1) > >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit > >(pid: 1242, sig: None, sts: 15, class: OutgoingRunner, slice: 1/1) > > > And finally restarted. > > > >Feb 18 18:31:43 2008 (1710) IncomingRunner qrunner started. > >Feb 18 18:31:43 2008 (1707) ArchRunner qrunner started. > >Feb 18 18:31:43 2008 (1712) OutgoingRunner qrunner started. > >Feb 18 18:31:43 2008 (1713) VirginRunner qrunner started. > >Feb 18 18:31:43 2008 (1709) CommandRunner qrunner started. > >Feb 18 18:31:43 2008 (1711) NewsRunner qrunner started. > >Feb 18 18:31:43 2008 (1714) RetryRunner qrunner started. > >Feb 18 18:31:43 2008 (1708) BounceRunner qrunner started. > > > > What I am curious about is what's in Mailman's error log relating to > the first OutgoingRunner (pid 565) dying on its own. I would like to > see the error and traceback from this.
I did edit mailman startup script as it contained the -s flag (I have removed it now). Not sure if it was there by default or if I inserted it at some point. Now for the error log: Feb 18 19:23:45 2008 mailmanctl(188): No child with pid: 14396 Feb 18 19:23:45 2008 mailmanctl(188): [Errno 3] No such process Feb 18 19:23:45 2008 mailmanctl(188): Stale pid file removed. Feb 18 19:25:50 2008 mailmanctl(184): PID unreadable in: /usr/local/mailman/data/master-qrunner.pid Feb 18 19:25:50 2008 mailmanctl(184): [Errno 2] No such file or directory: '/usr/local/mailman/data/master-qrunner.pid' Feb 18 19:25:50 2008 mailmanctl(184): Is qrunner even running? Feb 18 18:28:08 2008 qrunner(565): Traceback (most recent call last): Feb 18 18:28:08 2008 qrunner(565): File "/usr/local/mailman/bin/qrunner", line 278, in ? Feb 18 18:28:08 2008 qrunner(565): main() Feb 18 18:28:08 2008 qrunner(565): File "/usr/local/mailman/bin/qrunner", line 238, in main Feb 18 18:28:08 2008 qrunner(565): qrunner.run() Feb 18 18:28:08 2008 qrunner(565): File "/usr/local/mailman/Mailman/Queue/Runner.py", line 71, in run Feb 18 18:28:08 2008 qrunner(565): filecnt = self._oneloop() Feb 18 18:28:08 2008 qrunner(565): File "/usr/local/mailman/Mailman/Queue/Runner.py", line 100, in _oneloop Feb 18 18:28:08 2008 qrunner(565): msg, msgdata = self._switchboard.dequeue(filebase) Feb 18 18:28:08 2008 qrunner(565): File "/usr/local/mailman/Mailman/Queue/Switchboard.py", line 150, in dequeue Feb 18 18:28:08 2008 qrunner(565): fp = open(filename) Feb 18 18:28:08 2008 qrunner(565): IOError : [Errno 2] No such file or directory: '/usr/local/mailman/qfiles/out/1203355686.956471+f4998fab7c49e7696e310da67e6e976129e32e0b.pck' Feb 18 18:30:03 2008 mailmanctl(1633): No child with pid: 711 Feb 18 18:30:03 2008 mailmanctl(1633): [Errno 3] No such process Feb 18 18:30:03 2008 mailmanctl(1633): Stale pid file removed. Feb 18 18:30:11 2008 mailmanctl(1636): PID unreadable in: /usr/local/mailman/data/master-qrunner.pid Feb 18 18:30:11 2008 mailmanctl(1636): [Errno 2] No such file or directory: '/usr/local/mailman/data/master-qrunner.pid' Feb 18 18:30:11 2008 mailmanctl(1636): Is qrunner even running? Feb 18 18:30:22 2008 mailmanctl(1648): PID unreadable in: /usr/local/mailman/data/master-qrunner.pid Feb 18 18:30:22 2008 mailmanctl(1648): [Errno 2] No such file or directory: '/usr/local/mailman/data/master-qrunner.pid' Feb 18 18:30:22 2008 mailmanctl(1648): Is qrunner even running? Feb 18 18:31:02 2008 mailmanctl(1683): PID unreadable in: /usr/local/mailman/data/master-qrunner.pid Feb 18 18:31:02 2008 mailmanctl(1683): [Errno 2] No such file or directory: '/usr/local/mailman/data/master-qrunner.pid' Feb 18 18:31:02 2008 mailmanctl(1683): Is qrunner even running? Thank you again! -- Zbigniew Szalbot ------------------------------------------------------ Mailman-Users mailing list Mailman-Users@python.org http://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: http://mail.python.org/mailman/options/mailman-users/archive%40jab.org Security Policy: http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp