On Tue, 26 Feb 2013 05:19:32 -0600 legoktm <legoktm.wikipe...@gmail.com> wrote:
> On Tue, Feb 26, 2013 at 4:38 AM, Marlen Caemmerer < > marlen.caemme...@wikimedia.de> wrote: > > > On Tue, 26 Feb 2013, Johannes Kroll wrote: > > > > > >> While trying to load > >> http://toolserver.org/~render/**stools/tlg<http://toolserver.org/~render/stools/tlg>, > >> we got > >> 500 errors first and then "connection reset". SSH to nightshade took 2 > >> minutes or so to connect. Now web & ssh seems to be working again. > >> > > At which time did you try about? > > On IRC myself and jem- reported having issues at around 10:10am UTC and it > recovered around 10:14am UTC. > As of 11:08am UTC I cannot ssh in, and phe was getting 404s. > tsbot and tsnag also left the channel at 11:05am UTC after timing out. Curious: now 'ps aux' on nightshade hangs after displaying some 30 or so processes. Some lines from strace when it was hanging: connect(6, {sa_family=AF_INET, sin_port=htons(53), sin_addr=inet_addr("10.24.1.18")}, 16) = 0 poll([{fd=6, events=POLLOUT}], 1, 0) = 1 ([{fd=6, revents=POLLOUT}]) sendto(6, "\3353\1\0\0\1\0\0\0\0\0\0\4ldap\3esi\ntoolserver"..., 41, MSG_NOSIGNAL, NULL, 0) = 41 poll([{fd=6, events=POLLIN|POLLOUT}], 1, 5000) = 1 ([{fd=6, revents=POLLOUT}]) sendto(6, "\23I\1\0\0\1\0\0\0\0\0\0\4ldap\3esi\ntoolserver"..., 41, MSG_NOSIGNAL, NULL, 0) = 41 poll([{fd=6, events=POLLIN}], 1, 4999) = 0 (Timeout) Port 53 is DNS? So it looks like some DNS query timed out? Now it seems to be working again. I didn't log the whole strace run, but I saved the lines that I still had in the terminal buffer... I can send it if anybody needs it. _______________________________________________ Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette