2.2.16 deadlocking in schedule()
I am running 2.2.16 unpatched on a PII-450 with 512 megs of memory. The disk controller is a Mylex DAC960. Recently the machine seems to be locking up under moderate load. I wanted to see what was causing it, and running setiathome seems to be a pretty good trigger for it. Using control-scroll lock showed that the seti process, along with any new processes I tried to start ended up as deadlocked. Running the default RedHat 6.2 kernel would deadlock in the functions hard_idle() and cpu_idle(). This was under the same load. Below is my config -- David Weis| 10520 New York Ave, Des Moines, IA 50322 [EMAIL PROTECTED] | Voice 515-278-0133 Ext 231 | http://www.perfectionlearning.com/ CONFIG_EXPERIMENTAL=y CONFIG_M686=y CONFIG_X86_WP_WORKS_OK=y CONFIG_X86_INVLPG=y CONFIG_X86_BSWAP=y CONFIG_X86_POPAD_OK=y CONFIG_X86_TSC=y CONFIG_X86_GOOD_APIC=y CONFIG_1GB=y CONFIG_SMP=y CONFIG_MODULES=y CONFIG_KMOD=y CONFIG_NET=y CONFIG_PCI=y CONFIG_PCI_GOANY=y CONFIG_PCI_BIOS=y CONFIG_PCI_DIRECT=y CONFIG_PCI_QUIRKS=y CONFIG_PCI_OLD_PROC=y CONFIG_X86_IO_APIC=y CONFIG_X86_LOCAL_APIC=y CONFIG_SYSVIPC=y CONFIG_SYSCTL=y CONFIG_BINFMT_AOUT=y CONFIG_BINFMT_ELF=y CONFIG_BINFMT_MISC=y CONFIG_BLK_DEV_FD=y CONFIG_BLK_DEV_IDE=y CONFIG_BLK_DEV_IDEDISK=y CONFIG_BLK_DEV_IDECD=y CONFIG_BLK_DEV_IDEPCI=y CONFIG_BLK_DEV_IDEDMA=y CONFIG_IDEDMA_AUTO=y CONFIG_BLK_DEV_DAC960=y CONFIG_PARIDE_PARPORT=y CONFIG_PACKET=y CONFIG_FIREWALL=y CONFIG_UNIX=y CONFIG_INET=y CONFIG_IP_FIREWALL=y CONFIG_IP_ALIAS=y CONFIG_SKB_LARGE=y CONFIG_ATALK=y CONFIG_SCSI=y CONFIG_BLK_DEV_SD=y CONFIG_CHR_DEV_ST=y CONFIG_BLK_DEV_SR=y CONFIG_SCSI_MULTI_LUN=y CONFIG_SCSI_CONSTANTS=y CONFIG_SCSI_SYM53C8XX=y CONFIG_SCSI_NCR53C8XX_DEFAULT_TAGS=4 CONFIG_SCSI_NCR53C8XX_MAX_TAGS=32 CONFIG_SCSI_NCR53C8XX_SYNC=20 CONFIG_NETDEVICES=y CONFIG_NET_ETHERNET=y CONFIG_NET_EISA=y CONFIG_EEXPRESS_PRO100=y CONFIG_ACENIC=y CONFIG_VT=y CONFIG_VT_CONSOLE=y CONFIG_SERIAL=y CONFIG_UNIX98_PTYS=y CONFIG_UNIX98_PTY_COUNT=256 CONFIG_MOUSE=y CONFIG_PSMOUSE=y CONFIG_82C710_MOUSE=y CONFIG_AUTOFS_FS=y CONFIG_ISO9660_FS=y CONFIG_JOLIET=y CONFIG_PROC_FS=y CONFIG_DEVPTS_FS=y CONFIG_EXT2_FS=y CONFIG_NFS_FS=y CONFIG_NFSD=y CONFIG_SUNRPC=y CONFIG_LOCKD=y CONFIG_NLS=y CONFIG_NLS_DEFAULT="cp437" CONFIG_VGA_CONSOLE=y - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
Re: 2.2.16 deadlocking in schedule()
On Wed, 30 Aug 2000, Alan Cox wrote: > > I am running 2.2.16 unpatched on a PII-450 with 512 megs of memory. The > > disk controller is a Mylex DAC960. Recently the machine seems to be > > locking up under moderate load. I wanted to see what was causing it, and > > running setiathome seems to be a pretty good trigger for it. Using > > control-scroll lock showed that the seti process, along with any new > > processes I tried to start ended up as deadlocked. > > Please try 2.2.17pre20. THis has various vm and other fixes as well as an > updated DAC960 driver That fixed it partly. I'm still getting lots of uninterruptible processes, but it lasted a few hours this time before getting bad. The freeze is still occurring in schedule(). Any other advice? dave -- David Weis| 10520 New York Ave, Des Moines, IA 50322 [EMAIL PROTECTED] | Voice 515-278-0133 Ext 231 | http://www.perfectionlearning.com/ - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
Re: 2.2.16 deadlocking in schedule()
On Wed, 30 Aug 2000, Alan Cox wrote: > > > Please try 2.2.17pre20. THis has various vm and other fixes as well as an > > > updated DAC960 driver > > > > That fixed it partly. I'm still getting lots of uninterruptible processes, > > but it lasted a few hours this time before getting bad. The freeze is > > still occurring in schedule(). > > > > Any other advice? > > Ok that sounds like its not directly related. What compiler are you using and > what drivers are you regularly using. Also where does ps think the processes > in D state are waiting if you ask it to show the wait queue addr It's a fresh RedHat 6.2 install with egcs 1.1.2. I have a Bay GA620 gig ethernet card, the onboard eepro 100, an unused dual symbios scsi onboard. The machine wasn't in X when this occurs, just a few ssh sessions, some netatalk apple file sharing, and Oracle 8.1.5. I checked a ps alx when it was last hanging and it was wait_on something Forgot about the n option, so I'll try to crash it again. dave -- David Weis| 10520 New York Ave, Des Moines, IA 50322 [EMAIL PROTECTED] | Voice 515-278-0133 Ext 231 | http://www.perfectionlearning.com/ - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
Re: 2.2.16 deadlocking in schedule()
On Wed, 30 Aug 2000, Alan Cox wrote: > > > Please try 2.2.17pre20. THis has various vm and other fixes as well as an > > > updated DAC960 driver > > > > That fixed it partly. I'm still getting lots of uninterruptible processes, > > but it lasted a few hours this time before getting bad. The freeze is > > still occurring in schedule(). > > > > Any other advice? > > Ok that sounds like its not directly related. What compiler are you using and > what drivers are you regularly using. Also where does ps think the processes > in D state are waiting if you ask it to show the wait queue addr I think I have found something. I've currently got 4 processes deadlocked on DAC960_WaitForCommand. This machine is a VA Linux VAR Server 3000. dave -- David Weis| 10520 New York Ave, Des Moines, IA 50322 [EMAIL PROTECTED] | Voice 515-278-0133 Ext 231 | http://www.perfectionlearning.com/ - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
Re: 2.2.16 deadlocking in schedule()
On Wed, 30 Aug 2000, Alan Cox wrote: > > I think I have found something. I've currently got 4 processes deadlocked > > on DAC960_WaitForCommand. > > > > This machine is a VA Linux VAR Server 3000. > > If they stay deadlocked there then let Leonard Zubkoff know. He's both the > DAC960 guru and happens to work for VA so will know the box too 8) With Leonard's help, I have determined that it is a defective card. After some amount of time, it ceases to generate interrupts. Now to contact support... Thanks dave -- David Weis| 10520 New York Ave, Des Moines, IA 50322 [EMAIL PROTECTED] | Voice 515-278-0133 Ext 231 | http://www.perfectionlearning.com/ - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
Re: processes hung
On Thu, 31 Aug 2000 [EMAIL PROTECTED] wrote: > I am testing my scsi driver. I started the test, and down > the line, the I/O processes (cp/rm) are hung. It looks > like they are hung on completion of some I/O. How do I > find out, for which I/O they are waiting ? Is there any > way to look at the kernel data structures ? Albert Cahalan sent this one to me when I was having scsi troubles. ps -eo fname,tty,pid,stat,pcpu,nwchan,wchan ps -eo pid,stat,pcpu,nwchan,wchan=WIDE-WCHAN-COLUMN -o args dave -- David Weis| 10520 New York Ave, Des Moines, IA 50322 [EMAIL PROTECTED] | Voice 515-278-0133 Ext 231 | http://www.perfectionlearning.com/ - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
Re: GPL Question
On Fri, 27 Oct 2000, Jason Wohlgemuth wrote: > Now, if a module is loaded that registers a set of functions that have > increased functionality compared to the original functions, if that > modules is not based off GPL'd code, must the source code of that module > be released under the GPL? It would probably follow GPL, but it's pretty slimy. I won't buy it. david -- David Weis| "Great spirits will always encounter violent [EMAIL PROTECTED] | opposition from mediocre minds" - Einstein http://www.sjdjweis.com/ | - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
changing mac address of eth alias
what would be required to make the mac address of aliases changable, specifically for something like vrrp that shares a mac address among machines. dave -- Dave Weis "I believe there are more instances of the abridgement [EMAIL PROTECTED] of the freedom of the people by gradual and silent encroachments of those in power than by violent and sudden usurpations."- James Madison - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
Re: changing mac address of eth alias
On Tue, 23 Jan 2001, Ben Greear wrote: > David Weis wrote: > > what would be required to make the mac address of aliases changable, > > specifically for something like vrrp that shares a mac address among > > machines. > > Not sure you can do that, but you could use an 802.1Q vlan patch > and set up two different VLANs. You can now change the MAC > address on a VLAN with my patch: http://scry.wanfear.com/~greear/vlan.html I'm looking at your code, in the function vlan_dev_set_multicast_list() for the 2.4 tree, you enable promiscuity and reception of all multicast packets. Is this necessary for all cards? This looks pretty close to what I was looking for, thanks for the pointer. Do the multicast functions have enough usefulness for things other than VLAN to be split out separately? dave -- Dave Weis "I believe there are more instances of the abridgement [EMAIL PROTECTED] of the freedom of the people by gradual and silent encroachments of those in power than by violent and sudden usurpations."- James Madison - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/