Hello Attached is an email , documenting the exact problem I'm experiencing now. The machine is running 3.4 Stable. It's a fairly busy shell server with all the standard toys. In short , ps -a ... or ps -u fails with no error , and is not able to be killed , and exists in a pfslock state if one has a look at the processes. ps -x reports it in a 'D' status. D Marks a process in disk (or other short term,uninter- ruptible) wait. There are ps's that are more than two days old now. It's a fairly busy box , here are just some background details. Any ideas where else to go scratch would go down well. ####uname -a FreeBSD unix.za.net 3.4-RELEASE FreeBSD 3.4-RELEASE #1: Wed Feb 23 13:48:07 SAST 2000 [EMAIL PROTECTED]:/usr/kame/freebsd3/sys/compile/UNIX_IPV6 i386 ####uptime 9:14AM up 204 days, 21:36, 11 users, load averages: 1.14, 1.19, 1.14 ###Dmesg### Copyright (c) 1992-1999 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.4-RELEASE #1: Wed Feb 23 13:48:07 SAST 2000 [EMAIL PROTECTED]:/usr/kame/freebsd3/sys/compile/UNIX_IPV6 Timecounter "i8254" frequency 1193182 Hz CPU: Pentium Pro (199.31-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x619 Stepping = 9 Features=0xf9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV> real memory = 50331648 (49152K bytes) avail memory = 44904448 (43852K bytes) Preloaded elf kernel "kernel" at 0xc03e7000. Pentium Pro MTRR support enabled altq: major number is 96 Probing for devices on PCI bus 0: chip0: <Intel 82440FX (Natoma) PCI and memory controller> rev 0x02 on pci0.0.0 chip1: <Intel 82371SB PCI to ISA bridge> rev 0x01 on pci0.7.0 ide_pci0: <Intel PIIX3 Bus-master IDE controller> rev 0x00 on pci0.7.1 ahc0: <Adaptec 2940 Ultra SCSI adapter> rev 0x00 int a irq 11 on pci0.16.0 ahc0: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs xl0: <3Com 3c905-TX Fast Etherlink XL> rev 0x00 int a irq 10 on pci0.18.0 xl0: Ethernet address: 00:60:97:82:5a:d7 xl0: autoneg complete, link status good (full-duplex, 100Mbps) xl0: supplying EUI64: 00:60:97:ff:fe:82:5a:d7 vga0: <S3 Trio graphics accelerator> rev 0x00 int a irq 0 on pci0.20.0 Probing for PnP devices: Probing for devices on the ISA bus: sc0 on isa sc0: VGA color <16 virtual consoles, flags=0x0> ed0 not found at 0x280 fe0 not found at 0x300 atkbdc0 at 0x60-0x6f on motherboard atkbd0 irq 1 on isa psm0 not found sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold wdc0 at 0x1f0-0x1f7 irq 14 on isa wdc0: unit 0 (wd0): <WDC AC22000L> wd0: 1907MB (3907008 sectors), 3876 cyls, 16 heads, 63 S/T, 512 B/S wdc1 not found at 0x170 wt0 not found at 0x300 mcd0 not found at 0x300 matcdc0 not found at 0x230 scd0 not found at 0x230 ppc0 at 0x378 irq 7 flags 0x40 on isa ppc0: Generic chipset (ECP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold lpt0: <generic printer> on ppbus 0 lpt0: Interrupt-driven port ppi0: <generic parallel i/o> on ppbus 0 plip0: <PLIP network interface> on ppbus 0 ie0: unknown board_id: f000 ie0 not found at 0x300 ep0 not found at 0x300 ex0 not found le0 not found at 0x300 lnc0 not found at 0x280 cs0 not found at 0x300 ze0 not found at 0x300 zp0 not found at 0x300 adv0 not found at 0x330 bt0 not found at 0x134 aha0 not found at 0x134 aic0 not found vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa npx0 on motherboard npx0: INT 16 interface IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to accept, logging limited to 1000 packets/entry by default IPsec: Initialized Security Association Processing. DUMMYNET initialized (990504) IP Filter: initialized. Default = pass all, Logging = enabled Waiting 2 seconds for SCSI devices to settle changing root device to wd0s1a WARNING: / was not properly dismounted da0 at ahc0 bus 0 target 0 lun 0 da0: <WDIGTL WDE9100 1.30> Fixed Direct Access SCSI-2 device da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit) da0: 8683MB (17783204 512 byte sectors: 255H 63S/T 1106C) ###### Top last pid: 1093; load averages: 1.21, 1.17, 1.14 up 204+21:39:28 09:17:15 176 processes: 1 running, 175 sleeping CPU states: 4.5% user, 0.0% nice, 7.2% system, 8.0% interrupt,80.3% idle Mem: 15M Active, 1616K Inact, 23M Wired, 2924K Cache, 5354K Buf, 564K Free Swap: 200M Total, 148M Used, 52M Free, 74% Inuse, 40K In, 256K Out #### df Filesystem 1K-blocks Used Avail Capacity Mounted on /dev/wd0s1a 99183 28528 62721 31% / /dev/da0s2 2973645 2687112 48642 98% /home /dev/da0g 1684111 1266128 283255 82% /usr /dev/da0e 496111 87143 369280 19% /var /dev/da0s3 2662425 1233749 1215682 50% /space procfs 4 4 0 100% /proc --Daniel Schroder (Private email [EMAIL PROTECTED]) Unix users .. South Africa To : From : date : Address : Quality is a standard .. not a selling point
Date: Tue, 14 Jan 1997 17:54:26 -0800 (PST) From: John-Mark Gurney <[EMAIL PROTECTED]> To: FreeBSD Hackers <[EMAIL PROTECTED]> Subject: ps stuck in pfslock from clash between programs... Message-ID: <[EMAIL PROTECTED]> -------------------------------------------------------------------------------- Next in thread | Raw E-Mail | Index | Archive | Help -------------------------------------------------------------------------------- well.. right now I have a slight problem... any time I try to run ps against root processes and daemons (i.e. 'ps a' as a user or 'ps x' as root).. the process will end up in pfslck... and because they are in pfslck... they get swapped out to disk and I can't do anything more... I have found out a bit more about my problem... I did a: for i in /proc/*/status; do cat $i >> /tmp/proc.stat done to get the status of all the processes :)... and here are a few interesting lines: sh 14619 6911 6891 0 -1,-1 noflags 853281365,428930 0,0 0,7448 lockrd 0 0 0,0,0,2,3,4,5,20,31 ps 14636 1 14635 0 5,3 noflags 853281755,194442 0,18819 0,56458 lockrd 0 0 0,2,0,2,3,4,5,20,31 ps 14648 1 14648 0 5,1 noflags 853281835,553094 0,20489 0,20489 pfslck 1000 1000 1000,2,1000,0,2,5,7,66,68,85,117,666,888,996,997,1021,10000 ps 14658 1 14658 0 5,1 noflags 853281877,562603 0,8838 0,44190 pfslck 1000 1000 1000,2,1000,0,2,5,7,66,68,85,117,666,888,996,997,1021,10000 ps 14750 1 14749 0 5,2 noflags 853283361,460279 0,7537 0,67833 pfslck 1000 1000 1000,2,1000,0,2,5,7,66,68,85,117,666,888,996,997,1021,10000 ps 14754 1 14754 0 5,2 noflags 853283376,426138 0,30954 0,46431 pfslck 1000 1000 1000,2,1000,0,2,5,7,66,68,85,117,666,888,996,997,1021,10000 ps 14768 1 14768 0 5,0 noflags 853283610,650285 0,7311 0,80430 pfslck 1000 1000 1000,2,1000,0,2,5,7,66,68,85,117,666,888,996,997,1021,10000 ps 14786 6500 14786 0 5,3 noflags 853283849,884193 0,8581 0,34327 pfslck 0 0 0,2,0,2,3,4,5,20,31 ps 14810 1 14810 0 5,5 noflags 853283883,318649 0,37196 0,59514 pfslck 1000 1000 1000,2,1000,0,2,5,7,66,68,85,117,666,888,996,997,1021,10000 ps 15150 1 15150 0 5,3 noflags 853291999,14680 0,7674 0,107448 pfslck 1000 1000 1000,2,1000,0,2,5,7,66,68,85,117,666,888,996,997,1021,10000 ps 15151 15011 15151 0 5,1 noflags 853292005,831667 0,7834 0,47009 pfslck 0 0 0,2,0,2,3,4,5,20,31 ps 15239 15224 15239 0 5,3 noflags 853292336,857854 0,14888 0,74441 pfslck 0 0 0,2,0,2,3,4,5,20,31 no it looks like that some how, sh and ps managed to get into a race condition when they were trying to lock something... and that causes the rest of ps to freeze... also.. I'm not sure if this has been fixed as I'm running 960801-SNAP... but if someone would like to figure out what happened and why... I'm willing to do what someone suggests... also.. I've tried to kill -KILL all the processes but that doesn't help... thanks for the advice... and if you need any more info.. just ask... ttyl.. John-Mark [EMAIL PROTECTED] http://resnet.uoregon.edu/~gurney_j/ Modem/FAX: (541) 683-6954 (FreeBSD Box) Live in Peace, destroy Micro$oft, support free software, run FreeBSD (unix)