Re: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Linus Torvalds wrote: > On Wed, 13 Dec 2000, Linus Torvalds wrote: > > > > Hint: "ptep_mkdirty()". rather obvious oopsie.. once spotted. > In case you wonder why the bug was so insidious, what this caused was two > separate problems, both of them able to cause SIGSGV's.

Re: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Linus Torvalds wrote: > On Wed, 13 Dec 2000, Mike Galbraith wrote: > > > > Not in my test tree. Same fault, and same trace leading up to it. no > > Ok. > > It definitely looks like a swapoff() problem. > > Have you ever seen the behaviour without running swapoff? No.

RE: Signal 11 - the continuing saga

2000-12-13 Thread Rainer Mager
[EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED]]On Behalf Of Linus Torvalds > Sent: Thursday, December 14, 2000 5:19 AM > To: Mike Galbraith > Cc: Kernel Mailing List > Subject: Re: Signal 11 - the continuing saga > > > On Wed, 13 Dec 2000, Linus Torvalds wrote: > > >

Re: Signal 11 - the continuing saga

2000-12-13 Thread Gérard Roudier
On Wed, 13 Dec 2000, Linus Torvalds wrote: > > > Ehh, I think I found it. > > Hint: "ptep_mkdirty()". > > Oops. > > I'll bet you $5 USD (and these days, that's about a gadzillion Euros) that Poor European Gérard as slim as 1.84 meter - 78 Kg these days. What about old days poor European

Re: Signal 11 - the continuing saga

2000-12-13 Thread Jeff V. Merkey
On Wed, Dec 13, 2000 at 11:35:57AM -0800, Linus Torvalds wrote: > > > Ehh, I think I found it. > > Hint: "ptep_mkdirty()". > > Oops. > > I'll bet you $5 USD (and these days, that's about a gadzillion Euros) that > this explains it. > > Linus Good. Sounds like you guys have a

Re: Signal 11 - the continuing saga

2000-12-13 Thread Linus Torvalds
On Wed, 13 Dec 2000, Linus Torvalds wrote: > > Hint: "ptep_mkdirty()". In case you wonder why the bug was so insidious, what this caused was two separate problems, both of them able to cause SIGSGV's. One: we didn't mark the page table entry dirty like we were supposed to. Two: by making

Re: Signal 11 - the continuing saga

2000-12-13 Thread Linus Torvalds
On Wed, 13 Dec 2000, Mike Galbraith wrote: > > Not in my test tree. Same fault, and same trace leading up to it. no Ok. It definitely looks like a swapoff() problem. Have you ever seen the behaviour without running swapoff? Also, can you re-create it without running swapon() (if it's

Re: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Linus Torvalds wrote: > On Wed, 13 Dec 2000, Linus Torvalds wrote: > > > > Lookin gat "swapoff()", it could easily be something like > > > > - swapoff walks theough the processes, marking the pages dirty > >(correctly) > > - swapoff goes on to the next swap entry,

Re: Signal 11 - the continuing saga

2000-12-13 Thread Linus Torvalds
On Wed, 13 Dec 2000, Linus Torvalds wrote: > > Lookin gat "swapoff()", it could easily be something like > > - swapoff walks theough the processes, marking the pages dirty >(correctly) > - swapoff goes on to the next swap entry, and because it needs memory for >this, the VM layer

Re: Signal 11 - the continuing saga

2000-12-13 Thread Jeff V. Merkey
On Tue, Dec 12, 2000 at 07:17:41PM -0800, Linus Torvalds wrote: > In article <[EMAIL PROTECTED]>, > Jeff V. Merkey <[EMAIL PROTECTED]> wrote: > >On Wed, Dec 13, 2000 at 09:22:55AM +0900, Rainer Mager wrote: > >>I have a tiny bash script that launches a Java swing app. If I run my > >> script

RE: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Rainer Mager wrote: > Mike et al, > > I have no idea what IKD is and I don't know what to do with any results I > might find BUT I'd be happy to do this if it will help. Please pass on the > info with the instructions. Who should I report the results to? IKD is a

RE: Signal 11 - the continuing saga

2000-12-13 Thread Rainer Mager
Mike et al, I have no idea what IKD is and I don't know what to do with any results I might find BUT I'd be happy to do this if it will help. Please pass on the info with the instructions. Who should I report the results to? --Rainer > [mailto:[EMAIL PROTECTED]]On Behalf Of Mike

RE: Signal 11 - the continuing saga

2000-12-13 Thread Rainer Mager
Give that man a cigarit was an env var (not LOCALE but LANG). I'd actually checked this but I didn't think that made a difference in my case. Thanks Linus, now can you fix the larger signal 11 problem? --Rainer > [mailto:[EMAIL PROTECTED]]On Behalf Of Linus Torvalds > I'd guess that the

RE: Signal 11 - the continuing saga

2000-12-13 Thread Rainer Mager
Mike et al, I have no idea what IKD is and I don't know what to do with any results I might find BUT I'd be happy to do this if it will help. Please pass on the info with the instructions. Who should I report the results to? --Rainer [mailto:[EMAIL PROTECTED]]On Behalf Of Mike

RE: Signal 11 - the continuing saga

2000-12-13 Thread Rainer Mager
Give that man a cigarit was an env var (not LOCALE but LANG). I'd actually checked this but I didn't think that made a difference in my case. Thanks Linus, now can you fix the larger signal 11 problem? --Rainer [mailto:[EMAIL PROTECTED]]On Behalf Of Linus Torvalds I'd guess that the

RE: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Rainer Mager wrote: Mike et al, I have no idea what IKD is and I don't know what to do with any results I might find BUT I'd be happy to do this if it will help. Please pass on the info with the instructions. Who should I report the results to? IKD is a

Re: Signal 11 - the continuing saga

2000-12-13 Thread Jeff V. Merkey
On Tue, Dec 12, 2000 at 07:17:41PM -0800, Linus Torvalds wrote: In article [EMAIL PROTECTED], Jeff V. Merkey [EMAIL PROTECTED] wrote: On Wed, Dec 13, 2000 at 09:22:55AM +0900, Rainer Mager wrote: I have a tiny bash script that launches a Java swing app. If I run my script from an xterm

Re: Signal 11 - the continuing saga

2000-12-13 Thread Linus Torvalds
On Wed, 13 Dec 2000, Linus Torvalds wrote: Lookin gat "swapoff()", it could easily be something like - swapoff walks theough the processes, marking the pages dirty (correctly) - swapoff goes on to the next swap entry, and because it needs memory for this, the VM layer will swap

Re: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Linus Torvalds wrote: On Wed, 13 Dec 2000, Linus Torvalds wrote: Lookin gat "swapoff()", it could easily be something like - swapoff walks theough the processes, marking the pages dirty (correctly) - swapoff goes on to the next swap entry, and because it

Re: Signal 11 - the continuing saga

2000-12-13 Thread Linus Torvalds
On Wed, 13 Dec 2000, Mike Galbraith wrote: Not in my test tree. Same fault, and same trace leading up to it. no Ok. It definitely looks like a swapoff() problem. Have you ever seen the behaviour without running swapoff? Also, can you re-create it without running swapon() (if it's

Re: Signal 11 - the continuing saga

2000-12-13 Thread Linus Torvalds
On Wed, 13 Dec 2000, Linus Torvalds wrote: Hint: "ptep_mkdirty()". In case you wonder why the bug was so insidious, what this caused was two separate problems, both of them able to cause SIGSGV's. One: we didn't mark the page table entry dirty like we were supposed to. Two: by making it

Re: Signal 11 - the continuing saga

2000-12-13 Thread Jeff V. Merkey
On Wed, Dec 13, 2000 at 11:35:57AM -0800, Linus Torvalds wrote: Ehh, I think I found it. Hint: "ptep_mkdirty()". Oops. I'll bet you $5 USD (and these days, that's about a gadzillion Euros) that this explains it. Linus Good. Sounds like you guys have a handle on

Re: Signal 11 - the continuing saga

2000-12-13 Thread Gérard Roudier
On Wed, 13 Dec 2000, Linus Torvalds wrote: Ehh, I think I found it. Hint: "ptep_mkdirty()". Oops. I'll bet you $5 USD (and these days, that's about a gadzillion Euros) that Poor European Gérard as slim as 1.84 meter - 78 Kg these days. What about old days poor European Linus

RE: Signal 11 - the continuing saga

2000-12-13 Thread Rainer Mager
PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of Linus Torvalds Sent: Thursday, December 14, 2000 5:19 AM To: Mike Galbraith Cc: Kernel Mailing List Subject: Re: Signal 11 - the continuing saga On Wed, 13 Dec 2000, Linus Torvalds wrote: Hint: "ptep_mkdirty()". In case you

Re: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Linus Torvalds wrote: On Wed, 13 Dec 2000, Mike Galbraith wrote: Not in my test tree. Same fault, and same trace leading up to it. no Ok. It definitely looks like a swapoff() problem. Have you ever seen the behaviour without running swapoff? No. Also, can

Re: Signal 11 - the continuing saga

2000-12-13 Thread Mike Galbraith
On Wed, 13 Dec 2000, Linus Torvalds wrote: On Wed, 13 Dec 2000, Linus Torvalds wrote: Hint: "ptep_mkdirty()". g rather obvious oopsie.. once spotted. In case you wonder why the bug was so insidious, what this caused was two separate problems, both of them able to cause SIGSGV's.

RE: Signal 11 - the continuing saga

2000-12-12 Thread Mike Galbraith
On Wed, 13 Dec 2000, Rainer Mager wrote: > Thanks for the info... > > > [mailto:[EMAIL PROTECTED]]On Behalf Of Jeff V. Merkey > > > So, is this related to the larger signal 11 problems? > > > > There's a corruption bug in the page cache somewhere, and it's 100% > > reproducable. Finding it

Re: Signal 11 - the continuing saga

2000-12-12 Thread Linus Torvalds
In article <[EMAIL PROTECTED]>, Jeff V. Merkey <[EMAIL PROTECTED]> wrote: >On Wed, Dec 13, 2000 at 09:22:55AM +0900, Rainer Mager wrote: >> I have a tiny bash script that launches a Java swing app. If I run my >> script from an xterm (or gnome-terminal or whatever) then it starts up fine. >>

RE: Signal 11 - the continuing saga

2000-12-12 Thread Rainer Mager
Thanks for the info... > [mailto:[EMAIL PROTECTED]]On Behalf Of Jeff V. Merkey > > So, is this related to the larger signal 11 problems? > > There's a corruption bug in the page cache somewhere, and it's 100% > reproducable. Finding it will be tough Ok, granted this will be tough but

Re: Signal 11 - the continuing saga

2000-12-12 Thread Jeff V. Merkey
On Wed, Dec 13, 2000 at 09:22:55AM +0900, Rainer Mager wrote: > Hi again, > > Ok, I just upgraded to 2.4.0test12 (although I don't think there was any > work in 12 that directly addresses this signal 11 problem). When compiling > the new kernel I chose to disable AGPGart and RDM as

RE: Signal 11 - the continuing saga

2000-12-12 Thread Rainer Mager
Hi again, Ok, I just upgraded to 2.4.0test12 (although I don't think there was any work in 12 that directly addresses this signal 11 problem). When compiling the new kernel I chose to disable AGPGart and RDM as suggested by [EMAIL PROTECTED] I will report later if this makes any

RE: Signal 11 - the continuing saga

2000-12-12 Thread Rainer Mager
Hi again, Ok, I just upgraded to 2.4.0test12 (although I don't think there was any work in 12 that directly addresses this signal 11 problem). When compiling the new kernel I chose to disable AGPGart and RDM as suggested by [EMAIL PROTECTED] I will report later if this makes any

Re: Signal 11 - the continuing saga

2000-12-12 Thread Jeff V. Merkey
On Wed, Dec 13, 2000 at 09:22:55AM +0900, Rainer Mager wrote: Hi again, Ok, I just upgraded to 2.4.0test12 (although I don't think there was any work in 12 that directly addresses this signal 11 problem). When compiling the new kernel I chose to disable AGPGart and RDM as suggested by

RE: Signal 11 - the continuing saga

2000-12-12 Thread Rainer Mager
Thanks for the info... [mailto:[EMAIL PROTECTED]]On Behalf Of Jeff V. Merkey So, is this related to the larger signal 11 problems? There's a corruption bug in the page cache somewhere, and it's 100% reproducable. Finding it will be tough Ok, granted this will be tough but is

Re: Signal 11 - the continuing saga

2000-12-12 Thread Linus Torvalds
In article [EMAIL PROTECTED], Jeff V. Merkey [EMAIL PROTECTED] wrote: On Wed, Dec 13, 2000 at 09:22:55AM +0900, Rainer Mager wrote: I have a tiny bash script that launches a Java swing app. If I run my script from an xterm (or gnome-terminal or whatever) then it starts up fine. If,

RE: Signal 11 - the continuing saga

2000-12-12 Thread Mike Galbraith
On Wed, 13 Dec 2000, Rainer Mager wrote: Thanks for the info... [mailto:[EMAIL PROTECTED]]On Behalf Of Jeff V. Merkey So, is this related to the larger signal 11 problems? There's a corruption bug in the page cache somewhere, and it's 100% reproducable. Finding it will be