It's happening about 3-4 times per month for me on 32bit Gentoo hosting TF2 servers only. Im not using -debug in startup line. I am using default preempt kernel (not realtime), but i was using realtime priority scheduler on srcds processes. I think maybe that is the problem. Now i removed resched.sh script from crontab, cause i thought that maybe it happens at some specific moments when cpu overload is very high and rescheduling script changes realtime priority of all srcds_processes (chrt -f -p 98 processid), but im not sure. Now i change realtime priority manually only. Server hasnt crashed for about 2 weeks for now, but i think it might crash any time. This has been frustrating for me as well, cause i lost all remote accesses to server, all processes start to hang one by one and cpu usage is maximized (all taken by one srcds process) at that moment till i lost connection from server at all and i have to call my hosting company so they could physically restart server. At first i thought it was because of some hardware failure or something but now seeing that others have the same problem then im not sure. I hope someone could give some more clues..
I'll give this a shot when I see it happening again. Btw, my OS is Linux Debian Lenny 64-bit with kernel 2.6.30, default compiled. Cheers, -----Original Message----- From: hlds_linux-boun...@list.valvesoftware.com [mailto: hlds_linux-boun...@list.valvesoftware.com]<http://mail.inbox.lv/horde/imp/compose.php?to=hlds_linux-bounces%40list.valvesoftware.com%5D&thismailbox=INBOX&start=1>On Behalf Of Milton Ngan Sent: Monday, November 30, 2009 12:39 AM To: Half-Life dedicated Linux server mailing list Subject: Re: [hlds_linux] Crashing L4D2 fork killing entire machine If you guys are running the servers with -debug (which doesn't currently do anything useful in -fork mode) or crash dumps enabled you should send a SIGABRT (-6) to the process to get it to dump a core file. It will also kill the fork as well. SIGKILL should be used as a last resort if SIGINT (standard kill) doesn't do anything. With regards to -debug and -fork, I have a Perl script that I run alongside srcds_run to achieve desired behavior. I need to re-write this to be more portable so that it can be shipped. -----Original Message----- From: hlds_linux-boun...@list.valvesoftware.com [mailto: hlds_linux-boun...@list.valvesoftware.com]<http://mail.inbox.lv/horde/imp/compose.php?to=hlds_linux-bounces%40list.valvesoftware.com%5D&thismailbox=INBOX&start=1>On Behalf Of riemers Sent: Saturday, November 28, 2009 11:49 AM To: Half-Life dedicated Linux server mailing list Subject: Re: [hlds_linux] Crashing L4D2 fork killing entire machine I've seen a process with 100% too, i've also seen a server which was full for like 2 hours on the same map. At that point i check it out, and it seemed the 4 survivor bots where playing against the infected bots, over and over again. Somehow still pretty funny.. On Sat, 28 Nov 2009 20:13:44 +0100, Saint K. <sai...@specialattack.net> wrote: > I have had this issue twice up till now, and it's rather disturbing. > > What happened is that a L4D2 fork gets in a crashing mode of some kind > where it continuously puts 100% CPU load on a single core. I've seen these > sort of crashes perhaps around 10 times so far. 8 times I've spotted it > soon enough to kill -9 <pid> the process, and 2 times I've been too late > and it been in such a state for several hours before it will entirely kill > off the machine. The machine will only reply to icmp and nothing else (ssh, > http, ftp, gameservers etc). > > This worries me a bit, because I do not have any reason to assume > something is broken on OS or hardware level. > > Regards, > > Saint K. > > _______________________________________________ > To unsubscribe, edit your list preferences, or view the list archives, > please visit: > http://list.valvesoftware.com/mailman/listinfo/hlds_linux _______________________________________________ To unsubscribe, edit your list preferences, or view the list archives, please visit: http://list.valvesoftware.com/mailman/listinfo/hlds_linux