My server gets kernel panic every 7th day

2004-12-18 Thread Daniel Johansson
Hi, i've had my server up for over a year now and it's been rock solid
but for the latest weeks the server has rebooted evert Saturday at
exact 04:19:57 because of a find command. I have no idea why and I've
checked the cron log and I don't think any crontab is runned at that
time. Not as far as I can see from the cron log. Anyway find makes the
server get a kernel panic and it reboots. This is the fourth week in a
row it happens and I've checked the hardware, no problems at all.

I'm running 4.10-RELEASE-p5 so there is nothing newer to update to.

Output from the message log:
Dec 18 04:19:57 donnex /kernel:
Dec 18 04:19:57 donnex /kernel:
Dec 18 04:19:57 donnex /kernel: Fatal trap 12: page fault while in kernel mode
Dec 18 04:19:57 donnex /kernel: fault virtual address   = 0x4
Dec 18 04:19:57 donnex /kernel: fault code  = supervisor read,
page not present
Dec 18 04:19:57 donnex /kernel: instruction pointer = 0x8:0xc01a35a3
Dec 18 04:19:57 donnex /kernel: stack pointer   = 0x10:0xdde38e90
Dec 18 04:19:57 donnex /kernel: frame pointer   = 0x10:0xdde38ea4
Dec 18 04:19:57 donnex /kernel: code segment= base 0x0, limit
0xf, type 0x1b
Dec 18 04:19:57 donnex /kernel: = DPL 0, pres 1, def32 1, gran 1
Dec 18 04:19:57 donnex /kernel: processor eflags= interrupt
enabled, resume, IOPL = 0
Dec 18 04:19:57 donnex /kernel: current process = 16533 (find)
Dec 18 04:19:57 donnex /kernel: interrupt mask  = none
Dec 18 04:19:57 donnex /kernel: trap number = 12
Dec 18 04:19:57 donnex /kernel: panic: page fault
Dec 18 04:19:57 donnex /kernel:
Dec 18 04:19:57 donnex /kernel: syncing disks... 46 9 2 1 1 1 1 1 1 1
1 1 1 1 1 1 1 1 1 1 1 1 1
Dec 18 04:19:57 donnex /kernel: giving up on 1 buffers
Dec 18 04:19:57 donnex /kernel: Uptime: 6d16h7m19s
Dec 18 04:19:57 donnex /kernel: Automatic reboot in 15 seconds - press
a key on the console to abort
Dec 18 04:19:57 donnex /kernel: Rebooting...

Any idea how to find out what causes this and how to find out what is
running perl or how to fix it?
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Tuc
> 
> Hmm, I haven't checked the powersuply but I think it is unlikley that
> it would fail on a find when the box can take very high load and I've
> done a few rebuilds (kernel + world) on the box without any problems
> at all.
> 
We have a place in CA that assembles our systems, and put it on net
and we test them before they are sent.

One time they build it, we did all sorts of tests, but once we
started to build perl on it it would die.

They swapped out everything, and when they got to the power supply,
perl would build.

The next time they built a system, it stopped in the same exact part
of the perl build.  It turned out they went to re-use the same power supply.

So, I guess all I'm saying is.. It can happen. :)

Tuc/TTSG Internet Services, Inc.
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Daniel Johansson
Yeah, it has been rock solid. But if the psu was about to break
wouldn't the panics be a little more random?

As it is right now it get the exact same panic at the exact same time
every week.

On Sat, 18 Dec 2004 12:10:29 -0800, Kris Kennaway <[EMAIL PROTECTED]> wrote:
> On Sat, Dec 18, 2004 at 09:07:09PM +0100, Daniel Johansson wrote:
> > Hmm, I haven't checked the powersuply but I think it is unlikley that
> > it would fail on a find when the box can take very high load and I've
> > done a few rebuilds (kernel + world) on the box without any problems
> > at all.
> 
> Unlikely or not, I've seen it happen, and there's no other likely
> candidate since you said you haven't updated the machine in a year,
> and it's been stable under the same load until now.
> 
> Kris
> 
> 
>
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Daniel Johansson
Hmm okay, sounds like it could be the PSU after all. Thanks. I think I
will try to switch it for another one.

By the way is there any way to find out what and why find is ran at
that time every night to saturday? I would like to find the
script/cron/periodic that run it and see if I can make the kernel
panic happen again.

Then I don't have to wait until next saturday to know if it was the PSU or not.

I've searched in /etc/periodic, it's none of my crons that runs find,
and found some scripts that runs find but I'm not really familiar with
how the periodic scripts works.

I need to find the script that runs find every night to saturday at 04:19:57.


On Sat, 18 Dec 2004 15:12:41 -0500 (EST), Tuc <[EMAIL PROTECTED]> wrote:
> >
> > Hmm, I haven't checked the powersuply but I think it is unlikley that
> > it would fail on a find when the box can take very high load and I've
> > done a few rebuilds (kernel + world) on the box without any problems
> > at all.
> > 
> We have a place in CA that assembles our systems, and put it on net
> and we test them before they are sent.
> 
> One time they build it, we did all sorts of tests, but once we
> started to build perl on it it would die.
> 
> They swapped out everything, and when they got to the power supply,
> perl would build.
> 
> The next time they built a system, it stopped in the same exact part
> of the perl build.  It turned out they went to re-use the same power supply.
> 
> So, I guess all I'm saying is.. It can happen. :)
> 
> Tuc/TTSG Internet Services, Inc.
>
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Kris Kennaway
On Sat, Dec 18, 2004 at 09:12:58PM +0100, Daniel Johansson wrote:
> Yeah, it has been rock solid. But if the psu was about to break
> wouldn't the panics be a little more random?
> 
> As it is right now it get the exact same panic at the exact same time
> every week.

...during a time when the hard disk is placed under extra load,
presumably causing enough additional drain on the power supply to
cause it to fail.  You could probably trigger it yourself by loading
the machine in a similar way.

Kris


pgpfxGC1SrsMw.pgp
Description: PGP signature


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Daniel Johansson
Yeah, I've ran locate.updatedb ten times in a row, rebuilt kernel +
world twice and done a cat /dev/uranom > tmp for some Gigs and nothing
makes it crash :/


On Sat, 18 Dec 2004 12:46:45 -0800, Kris Kennaway <[EMAIL PROTECTED]> wrote:
> On Sat, Dec 18, 2004 at 09:12:58PM +0100, Daniel Johansson wrote:
> > Yeah, it has been rock solid. But if the psu was about to break
> > wouldn't the panics be a little more random?
> >
> > As it is right now it get the exact same panic at the exact same time
> > every week.
> 
> ...during a time when the hard disk is placed under extra load,
> presumably causing enough additional drain on the power supply to
> cause it to fail.  You could probably trigger it yourself by loading
> the machine in a similar way.
> 
> Kris
> 
> 
>
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-19 Thread Greg 'groggy' Lehey
On Saturday, 18 December 2004 at 11:50:02 -0800, Kris Kennaway wrote:
> On Sat, Dec 18, 2004 at 11:57:35AM +0100, Daniel Johansson wrote:
>> Hi, i've had my server up for over a year now and it's been rock solid
>> but for the latest weeks the server has rebooted evert Saturday at
>> exact 04:19:57 because of a find command. I have no idea why and I've
>> checked the cron log and I don't think any crontab is runned at that
>> time. Not as far as I can see from the cron log. Anyway find makes the
>> server get a kernel panic and it reboots. This is the fourth week in a
>> row it happens and I've checked the hardware, no problems at all.
>
> How did you "check the hardware"?  Hardware failure is by far the
> most common cause of "strange panics under abnormal load [such as
> when the weekly cron job runs]".

If this panic occurs repeatedly under certain circumstances, it's
probably not hardware.  Anyway, there's not much point standing
outside and scratching our heads.  We have a facility for analysing
this kind of problem: the processor dump and kernel debugger.

Greg
--
When replying to this message, please copy the original recipients.
If you don't, I may ignore the reply or reply to the original recipients.
For more information, see http://www.lemis.com/questions.html
See complete headers for address and phone numbers.


pgpMAVELc83CQ.pgp
Description: PGP signature


Re: My server gets kernel panic every 7th day

2004-12-19 Thread Greg 'groggy' Lehey
On Sunday, 19 December 2004 at 23:35:18 +0100, Daniel Johansson wrote:
> On Mon, 20 Dec 2004 08:59:19 +1030, Greg 'groggy' Lehey
> <[EMAIL PROTECTED]> wrote:
>> On Saturday, 18 December 2004 at 11:50:02 -0800, Kris Kennaway wrote:
>>> On Sat, Dec 18, 2004 at 11:57:35AM +0100, Daniel Johansson wrote:
 Hi, i've had my server up for over a year now and it's been rock solid
 but for the latest weeks the server has rebooted evert Saturday at
 exact 04:19:57 because of a find command. I have no idea why and I've
 checked the cron log and I don't think any crontab is runned at that
 time. Not as far as I can see from the cron log. Anyway find makes the
 server get a kernel panic and it reboots. This is the fourth week in a
 row it happens and I've checked the hardware, no problems at all.
>>>
>>> How did you "check the hardware"?  Hardware failure is by far the
>>> most common cause of "strange panics under abnormal load [such as
>>> when the weekly cron job runs]".
>>
>> If this panic occurs repeatedly under certain circumstances, it's
>> probably not hardware.  Anyway, there's not much point standing
>> outside and scratching our heads.  We have a facility for analysing
>> this kind of problem: the processor dump and kernel debugger.
>
> Yeah, I want to say thank you for your help. I think I've been able to
> reproduce the kernel panic now, finalay!
>
> On my server I run 3 jails and every night at 04:15 when it runs
> periodic weekly it runs it in 3 jails + the host enviroment. This
> seems to cause the kernel panic, I don't really know why yet. I can
> run periodic weekly separatly in every jail + the host without kernel
> panic but when I run it at the same time on all places it kernel
> panics.

What does the dump backtrace show?

> It can still be the PSU, don't have any other atm to try with. I'll
> do some more testing and see if I can get any more info.

There's no point looking at the hardware until you've looked at the
dump.

Greg
--
See complete headers for address and phone numbers.


pgp4OwzVB4OQj.pgp
Description: PGP signature


Re: My server gets kernel panic every 7th day

2004-12-19 Thread Daniel Johansson
Okay, is this hard to do? I've no idea how to look at the dump or how
to understand the dump. You don't have to be kernel hacker to
understand that?

On Mon, 20 Dec 2004 09:08:01 +1030, Greg 'groggy' Lehey
<[EMAIL PROTECTED]> wrote:
> On Sunday, 19 December 2004 at 23:35:18 +0100, Daniel Johansson wrote:
> > On Mon, 20 Dec 2004 08:59:19 +1030, Greg 'groggy' Lehey
> > <[EMAIL PROTECTED]> wrote:
> >> On Saturday, 18 December 2004 at 11:50:02 -0800, Kris Kennaway wrote:
> >>> On Sat, Dec 18, 2004 at 11:57:35AM +0100, Daniel Johansson wrote:
>  Hi, i've had my server up for over a year now and it's been rock solid
>  but for the latest weeks the server has rebooted evert Saturday at
>  exact 04:19:57 because of a find command. I have no idea why and I've
>  checked the cron log and I don't think any crontab is runned at that
>  time. Not as far as I can see from the cron log. Anyway find makes the
>  server get a kernel panic and it reboots. This is the fourth week in a
>  row it happens and I've checked the hardware, no problems at all.
> >>>
> >>> How did you "check the hardware"?  Hardware failure is by far the
> >>> most common cause of "strange panics under abnormal load [such as
> >>> when the weekly cron job runs]".
> >>
> >> If this panic occurs repeatedly under certain circumstances, it's
> >> probably not hardware.  Anyway, there's not much point standing
> >> outside and scratching our heads.  We have a facility for analysing
> >> this kind of problem: the processor dump and kernel debugger.
> >
> > Yeah, I want to say thank you for your help. I think I've been able to
> > reproduce the kernel panic now, finalay!
> >
> > On my server I run 3 jails and every night at 04:15 when it runs
> > periodic weekly it runs it in 3 jails + the host enviroment. This
> > seems to cause the kernel panic, I don't really know why yet. I can
> > run periodic weekly separatly in every jail + the host without kernel
> > panic but when I run it at the same time on all places it kernel
> > panics.
> 
> What does the dump backtrace show?
> 
> > It can still be the PSU, don't have any other atm to try with. I'll
> > do some more testing and see if I can get any more info.
> 
> There's no point looking at the hardware until you've looked at the
> dump.
> 
> Greg
> --
> See complete headers for address and phone numbers.
> 
> 
>
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


RE: My server gets kernel panic every 7th day

2004-12-19 Thread Ted Mittelstaedt
If I may interject, why should he bother?

Seems to me his next step should be setting up a new server
with a fresh FreeBSD copy on it, duplicating the 3 jails, then
firing off the 4 periodics and seeing if that blows up.

If so that is enough info to file a bug report, and he can
then simply adjust his script so that the periodics aren't all
run at the same time.  If someone else wants to spend the time
researching this - like YOU for example - then great.

>From the Project's point of view, his problems aren't interesting
unless they can be reproduced - and from his point of view all he
wants to do is fix the problem - and he now knows how to do it.
(run the periodic weekly at different times)  Grubbing around in
a dump traceback does not prove that a problem is reproducible.

Ted Mittelstaedt
Author, The FreeBSD Corporate Networker's Guide.

> -Original Message-
> From: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED] Behalf Of Greg 'groggy'
> Lehey
> Sent: Sunday, December 19, 2004 2:38 PM
> To: Daniel Johansson
> Cc: FreeBSD Questions
> Subject: Re: My server gets kernel panic every 7th day
>
>
> On Sunday, 19 December 2004 at 23:35:18 +0100, Daniel Johansson wrote:
> > On Mon, 20 Dec 2004 08:59:19 +1030, Greg 'groggy' Lehey
> > <[EMAIL PROTECTED]> wrote:
> >> On Saturday, 18 December 2004 at 11:50:02 -0800, Kris Kennaway wrote:
> >>> On Sat, Dec 18, 2004 at 11:57:35AM +0100, Daniel Johansson wrote:
> >>>> Hi, i've had my server up for over a year now and it's been
> rock solid
> >>>> but for the latest weeks the server has rebooted evert Saturday at
> >>>> exact 04:19:57 because of a find command. I have no idea why and I've
> >>>> checked the cron log and I don't think any crontab is runned at that
> >>>> time. Not as far as I can see from the cron log. Anyway find
> makes the
> >>>> server get a kernel panic and it reboots. This is the fourth
> week in a
> >>>> row it happens and I've checked the hardware, no problems at all.
> >>>
> >>> How did you "check the hardware"?  Hardware failure is by far the
> >>> most common cause of "strange panics under abnormal load [such as
> >>> when the weekly cron job runs]".
> >>
> >> If this panic occurs repeatedly under certain circumstances, it's
> >> probably not hardware.  Anyway, there's not much point standing
> >> outside and scratching our heads.  We have a facility for analysing
> >> this kind of problem: the processor dump and kernel debugger.
> >
> > Yeah, I want to say thank you for your help. I think I've been able to
> > reproduce the kernel panic now, finalay!
> >
> > On my server I run 3 jails and every night at 04:15 when it runs
> > periodic weekly it runs it in 3 jails + the host enviroment. This
> > seems to cause the kernel panic, I don't really know why yet. I can
> > run periodic weekly separatly in every jail + the host without kernel
> > panic but when I run it at the same time on all places it kernel
> > panics.
>
> What does the dump backtrace show?
>
> > It can still be the PSU, don't have any other atm to try with. I'll
> > do some more testing and see if I can get any more info.
>
> There's no point looking at the hardware until you've looked at the
> dump.
>
> Greg
> --
> See complete headers for address and phone numbers.
>

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-19 Thread Greg 'groggy' Lehey
On Sunday, 19 December 2004 at 23:42:20 +0100, Daniel Johansson wrote:
> On Mon, 20 Dec 2004 09:08:01 +1030, Greg 'groggy' Lehey
> <[EMAIL PROTECTED]> wrote:
>> On Sunday, 19 December 2004 at 23:35:18 +0100, Daniel Johansson wrote:
>>> On Mon, 20 Dec 2004 08:59:19 +1030, Greg 'groggy' Lehey
>>> <[EMAIL PROTECTED]> wrote:
 On Saturday, 18 December 2004 at 11:50:02 -0800, Kris Kennaway wrote:
> On Sat, Dec 18, 2004 at 11:57:35AM +0100, Daniel Johansson wrote:
>> Hi, i've had my server up for over a year now and it's been rock solid
>> but for the latest weeks the server has rebooted evert Saturday at
>> exact 04:19:57 because of a find command. I have no idea why and I've
>> checked the cron log and I don't think any crontab is runned at that
>> time. Not as far as I can see from the cron log. Anyway find makes the
>> server get a kernel panic and it reboots. This is the fourth week in a
>> row it happens and I've checked the hardware, no problems at all.
>
> How did you "check the hardware"?  Hardware failure is by far the
> most common cause of "strange panics under abnormal load [such as
> when the weekly cron job runs]".

 If this panic occurs repeatedly under certain circumstances, it's
 probably not hardware.  Anyway, there's not much point standing
 outside and scratching our heads.  We have a facility for analysing
 this kind of problem: the processor dump and kernel debugger.
>>>
>>> Yeah, I want to say thank you for your help. I think I've been able to
>>> reproduce the kernel panic now, finalay!
>>>
>>> On my server I run 3 jails and every night at 04:15 when it runs
>>> periodic weekly it runs it in 3 jails + the host enviroment. This
>>> seems to cause the kernel panic, I don't really know why yet. I can
>>> run periodic weekly separatly in every jail + the host without kernel
>>> panic but when I run it at the same time on all places it kernel
>>> panics.
>>
>> What does the dump backtrace show?
>>
>>> It can still be the PSU, don't have any other atm to try with. I'll
>>> do some more testing and see if I can get any more info.
>>
>> There's no point looking at the hardware until you've looked at the
>> dump.

I'd appreciate it if you didn't require me to move the text of your
messages to where it fits.

> Okay, is this hard to do? I've no idea how to look at the dump or
> how to understand the dump. You don't have to be kernel hacker to
> understand that?

It's described in the handbook.  Basically:

- Build a kernel with debug symbols (you should be doing this anyway).
  You need the following line in your configuration file:

makeoptions DEBUG=-g# Build kernel with gdb(1) debug symbols

- Make sure that dumps are enabled.  You should have something like
  this in your /etc/rc.conf:

dumpdev=/dev/ad0s2b

  The device name should be the name of your swap partition, and it
  must be at least slightly larger than your main memory.

- Ensure you have a directory /var/crash, and that the file system in
  which it resides has enough space for the dump (a little larger than
  main memory).

- When you get a dump, it will be copied to /var/crash automatically
  on reboot.  Go there and get a backtrace.  You don't say which
  version of FreeBSD you're using, but in general this will do it:

  # cd /var/crash
  # gdb -k /usr/obj/src/sys/GENERIC/kernel.debug vmcore.0
  (gdb) bt
  
The name of the kernel (kernel.debug) depends on how you built your
kernel.  If it's not called GENERIC, the name of the directory will
change accordingly.

That's it in a nutshell.  There's much more detail in chapter 6 of my
debug tutorial, which you can find at
http://www.lemis.com/grog/Papers/Debug-tutorial/tutorial.pdf .

Greg
--
When replying to this message, please copy the original recipients.
If you don't, I may ignore the reply or reply to the original recipients.
For more information, see http://www.lemis.com/questions.html
See complete headers for address and phone numbers.


pgpjij8g1W5Zr.pgp
Description: PGP signature


Re: My server gets kernel panic every 7th day

2004-12-19 Thread Greg 'groggy' Lehey
[Format recovered--see http://www.lemis.com/email/email-format.html]

Please don't quote out of sequence.

On Sunday, 19 December 2004 at 14:44:22 -0800, Ted Mittelstaedt wrote:
> On  Sunday, December 19, 2004 2:38 PM, Greg 'groggy' Lehey wrote:
>> On Sunday, 19 December 2004 at 23:35:18 +0100, Daniel Johansson wrote:
>>> It can still be the PSU, don't have any other atm to try with. I'll
>>> do some more testing and see if I can get any more info.
>>
>> There's no point looking at the hardware until you've looked at the
>> dump.
>
> If I may interject, why should he bother?

So he can find out what the problem is.

> Seems to me his next step should be setting up a new server with a
> fresh FreeBSD copy on it, duplicating the 3 jails, then firing off
> the 4 periodics and seeing if that blows up.

This would take a lot of time and money.  And if it blows up, it
doesn't buy him anything: he still has to find the cause of the
problem.  If it doesn't blow up, he doesn't know whether it's going to
stay that way or maybe reappear in a while when the constellations are
right.  Why should he want to do that?

> If so that is enough info to file a bug report, and he can then
> simply adjust his script so that the periodics aren't all run at the
> same time.

This is a *VERY* bad idea.  If there's a bug, you should fix it, not
hide it.  Working around it is the Microsoft Way, and hidden bugs
often come back to bite you.

> If someone else wants to spend the time researching this - like YOU
> for example - then great.

Why should I want to do it?

> From the Project's point of view, his problems aren't interesting
> unless they can be reproduced - and from his point of view all he
> wants to do is fix the problem - and he now knows how to do it.
> (run the periodic weekly at different times)

Hiding a problem doesn't fix it.

> Grubbing around in a dump traceback does not prove that a problem is
> reproducible.

He's already proved that the problem is reproducible.

Greg
--
When replying to this message, please take care not to mutilate the
original text.  
For more information, see http://www.lemis.com/email.html
See complete headers for address and phone numbers.


pgpxqNo7mdJt0.pgp
Description: PGP signature


Re: My server gets kernel panic every 7th day

2004-12-19 Thread Daniel Johansson
Yeah, thank you. I changed the cronjob file so that they run periodic
weekly on diffrent days. So I hope I won't get any more panics now.

I don't have the time or energy to bebug the kernel atm, maby later
but thanks anyway.

Anyone else uses jails and noticed the same behaviour? I think I'll do
some more testing on a new box when I've got time to see if I can
reproduce it there.


On Sun, 19 Dec 2004 14:44:22 -0800, Ted Mittelstaedt
<[EMAIL PROTECTED]> wrote:
> If I may interject, why should he bother?
> 
> Seems to me his next step should be setting up a new server
> with a fresh FreeBSD copy on it, duplicating the 3 jails, then
> firing off the 4 periodics and seeing if that blows up.
> 
> If so that is enough info to file a bug report, and he can
> then simply adjust his script so that the periodics aren't all
> run at the same time.  If someone else wants to spend the time
> researching this - like YOU for example - then great.
> 
> From the Project's point of view, his problems aren't interesting
> unless they can be reproduced - and from his point of view all he
> wants to do is fix the problem - and he now knows how to do it.
> (run the periodic weekly at different times)  Grubbing around in
> a dump traceback does not prove that a problem is reproducible.
> 
> Ted Mittelstaedt
> Author, The FreeBSD Corporate Networker's Guide.
> 
> > -Original Message-
> > From: [EMAIL PROTECTED]
> > [mailto:[EMAIL PROTECTED] Behalf Of Greg 'groggy'
> > Lehey
> > Sent: Sunday, December 19, 2004 2:38 PM
> > To: Daniel Johansson
> > Cc: FreeBSD Questions
> > Subject: Re: My server gets kernel panic every 7th day
> >
> >
> > On Sunday, 19 December 2004 at 23:35:18 +0100, Daniel Johansson wrote:
> > > On Mon, 20 Dec 2004 08:59:19 +1030, Greg 'groggy' Lehey
> > > <[EMAIL PROTECTED]> wrote:
> > >> On Saturday, 18 December 2004 at 11:50:02 -0800, Kris Kennaway wrote:
> > >>> On Sat, Dec 18, 2004 at 11:57:35AM +0100, Daniel Johansson wrote:
> > >>>> Hi, i've had my server up for over a year now and it's been
> > rock solid
> > >>>> but for the latest weeks the server has rebooted evert Saturday at
> > >>>> exact 04:19:57 because of a find command. I have no idea why and I've
> > >>>> checked the cron log and I don't think any crontab is runned at that
> > >>>> time. Not as far as I can see from the cron log. Anyway find
> > makes the
> > >>>> server get a kernel panic and it reboots. This is the fourth
> > week in a
> > >>>> row it happens and I've checked the hardware, no problems at all.
> > >>>
> > >>> How did you "check the hardware"?  Hardware failure is by far the
> > >>> most common cause of "strange panics under abnormal load [such as
> > >>> when the weekly cron job runs]".
> > >>
> > >> If this panic occurs repeatedly under certain circumstances, it's
> > >> probably not hardware.  Anyway, there's not much point standing
> > >> outside and scratching our heads.  We have a facility for analysing
> > >> this kind of problem: the processor dump and kernel debugger.
> > >
> > > Yeah, I want to say thank you for your help. I think I've been able to
> > > reproduce the kernel panic now, finalay!
> > >
> > > On my server I run 3 jails and every night at 04:15 when it runs
> > > periodic weekly it runs it in 3 jails + the host enviroment. This
> > > seems to cause the kernel panic, I don't really know why yet. I can
> > > run periodic weekly separatly in every jail + the host without kernel
> > > panic but when I run it at the same time on all places it kernel
> > > panics.
> >
> > What does the dump backtrace show?
> >
> > > It can still be the PSU, don't have any other atm to try with. I'll
> > > do some more testing and see if I can get any more info.
> >
> > There's no point looking at the hardware until you've looked at the
> > dump.
> >
> > Greg
> > --
> > See complete headers for address and phone numbers.
> >
> 
>
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Kris Kennaway
On Sat, Dec 18, 2004 at 11:57:35AM +0100, Daniel Johansson wrote:
> Hi, i've had my server up for over a year now and it's been rock solid
> but for the latest weeks the server has rebooted evert Saturday at
> exact 04:19:57 because of a find command. I have no idea why and I've
> checked the cron log and I don't think any crontab is runned at that
> time. Not as far as I can see from the cron log. Anyway find makes the
> server get a kernel panic and it reboots. This is the fourth week in a
> row it happens and I've checked the hardware, no problems at all.

How did you "check the hardware"?  Hardware failure is by far the most
common cause of "strange panics under abnormal load [such as when the
weekly cron job runs]".

Kris


pgpKAtPBg9AGQ.pgp
Description: PGP signature


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Kris Kennaway
On Sat, Dec 18, 2004 at 08:52:47PM +0100, Daniel Johansson wrote:
> I checked the memory with memtest86+ and the disc with powermax,
> maxtors own software. No problems at all with those. Also checked all
> fans and everything was okay.
> 
> So I don't think it is any hw error.

What about power supply?  I've had servers start to flake out in
exactly this way when their PS took a dive and was no longer to keep
up with peak demands.

Kris

P.S. Don't drop the list from the CC if you want others to participate
in helping you


pgpdluVWQ8SqO.pgp
Description: PGP signature


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Daniel Johansson
Hmm, I haven't checked the powersuply but I think it is unlikley that
it would fail on a find when the box can take very high load and I've
done a few rebuilds (kernel + world) on the box without any problems
at all.

On Sat, 18 Dec 2004 12:03:19 -0800, Kris Kennaway <[EMAIL PROTECTED]> wrote:
> On Sat, Dec 18, 2004 at 08:52:47PM +0100, Daniel Johansson wrote:
> > I checked the memory with memtest86+ and the disc with powermax,
> > maxtors own software. No problems at all with those. Also checked all
> > fans and everything was okay.
> >
> > So I don't think it is any hw error.
> 
> What about power supply?  I've had servers start to flake out in
> exactly this way when their PS took a dive and was no longer to keep
> up with peak demands.
> 
> Kris
> 
> P.S. Don't drop the list from the CC if you want others to participate
> in helping you
> 
> 
>
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: My server gets kernel panic every 7th day

2004-12-18 Thread Kris Kennaway
On Sat, Dec 18, 2004 at 09:07:09PM +0100, Daniel Johansson wrote:
> Hmm, I haven't checked the powersuply but I think it is unlikley that
> it would fail on a find when the box can take very high load and I've
> done a few rebuilds (kernel + world) on the box without any problems
> at all.

Unlikely or not, I've seen it happen, and there's no other likely
candidate since you said you haven't updated the machine in a year,
and it's been stable under the same load until now.

Kris


pgpqPYg22UVQE.pgp
Description: PGP signature