[SLUG] System freeze - how to diagnose?

2007-01-21 Thread david
The power light was on, but no-one home. Nothing responding, no ping.
Other machines on the same UPS are running fine.

I rebooted and got a "30 times mounted" forced disk check - no problem
there. Everything came up fine. Nothing in syslog or anywhere else I can
think of.

The only thing notable was that the activity light on a USB webcam I'm
messing with was still on, which makes me suspicious it might have to do
with that. [1] I was testing an image capture script that's cpu
intensive, and I left it running over-night, for over 12 hours - over
heating perhaps?

Any suggestions where/how to start investigating this? Nothing obvious
on Google that I can find. Ubuntu 6.10, btw.

many thanks...

David.


[1]
[17179602.072000] drivers/media/video/spca5xx/spca5xx-main.c: USB
SPCA5XX camera found. Logitech QC Communicate STX 
[17179602.372000] usbcore: registered new driver spca5xx
[17179602.372000] drivers/media/video/spca5xx/spca5xx-main.c: spca5xx
driver 00.57.08 registered


-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html


Re: [SLUG] System freeze - how to diagnose?

2007-01-21 Thread Alex Samad
On Mon, Jan 22, 2007 at 09:13:24AM +1100, david wrote:
> The power light was on, but no-one home. Nothing responding, no ping.
> Other machines on the same UPS are running fine.
> 
> I rebooted and got a "30 times mounted" forced disk check - no problem
> there. Everything came up fine. Nothing in syslog or anywhere else I can
> think of.
> 
> The only thing notable was that the activity light on a USB webcam I'm
> messing with was still on, which makes me suspicious it might have to do
> with that. [1] I was testing an image capture script that's cpu
> intensive, and I left it running over-night, for over 12 hours - over
> heating perhaps?
> 
> Any suggestions where/how to start investigating this? Nothing obvious
> on Google that I can find. Ubuntu 6.10, btw.
> 
> many thanks...

if you have another machine, connect the 2 together by serial cable and send
your console out the serial cable.  Start up minicom on the 2nd box and what
for it to crash.

Also did you try alt+sysrq magic key combo ?

> 
> David.
> 
> 
> [1]
> [17179602.072000] drivers/media/video/spca5xx/spca5xx-main.c: USB
> SPCA5XX camera found. Logitech QC Communicate STX 
> [17179602.372000] usbcore: registered new driver spca5xx
> [17179602.372000] drivers/media/video/spca5xx/spca5xx-main.c: spca5xx
> driver 00.57.08 registered
> 
> 
> -- 
> SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
> Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
> 


signature.asc
Description: Digital signature
-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html

Re: [SLUG] System freeze - how to diagnose?

2007-01-21 Thread david
On Mon, 2007-01-22 at 13:46 +1100, Alex Samad wrote:
> On Mon, Jan 22, 2007 at 09:13:24AM +1100, david wrote:
> > The power light was on, but no-one home. Nothing responding, no ping.
> > Other machines on the same UPS are running fine.
> > 
> > I rebooted and got a "30 times mounted" forced disk check - no problem
> > there. Everything came up fine. Nothing in syslog or anywhere else I can
> > think of.
> > 
> > The only thing notable was that the activity light on a USB webcam I'm
> > messing with was still on, which makes me suspicious it might have to do
> > with that. [1] I was testing an image capture script that's cpu
> > intensive, and I left it running over-night, for over 12 hours - over
> > heating perhaps?
> > 
> > Any suggestions where/how to start investigating this? Nothing obvious
> > on Google that I can find. Ubuntu 6.10, btw.
> > 
> > many thanks...
> 
> if you have another machine, connect the 2 together by serial cable and send
> your console out the serial cable.  Start up minicom on the 2nd box and what
> for it to crash.
> 

sounds reasonable ;-) sounds like I need to run it overnight again and
try to reproduce the problem.

> Also did you try alt+sysrq magic key combo ?
> 

I hadn't heard of this, but it looks like it only works from console,
not from a Gnome terminal.


-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html


Re: [SLUG] System freeze - how to diagnose?

2007-01-22 Thread Alex Samad
On Mon, Jan 22, 2007 at 02:30:13PM +1100, david wrote:
> On Mon, 2007-01-22 at 13:46 +1100, Alex Samad wrote:
> > On Mon, Jan 22, 2007 at 09:13:24AM +1100, david wrote:
> > > The power light was on, but no-one home. Nothing responding, no ping.
> > > Other machines on the same UPS are running fine.
> > > 
> > > I rebooted and got a "30 times mounted" forced disk check - no problem
> > > there. Everything came up fine. Nothing in syslog or anywhere else I can
> > > think of.
> > > 
> > > The only thing notable was that the activity light on a USB webcam I'm
> > > messing with was still on, which makes me suspicious it might have to do
> > > with that. [1] I was testing an image capture script that's cpu
> > > intensive, and I left it running over-night, for over 12 hours - over
> > > heating perhaps?
> > > 
> > > Any suggestions where/how to start investigating this? Nothing obvious
> > > on Google that I can find. Ubuntu 6.10, btw.
> > > 
> > > many thanks...
> > 
> > if you have another machine, connect the 2 together by serial cable and send
> > your console out the serial cable.  Start up minicom on the 2nd box and what
> > for it to crash.
> > 
> 
> sounds reasonable ;-) sounds like I need to run it overnight again and
> try to reproduce the problem.
> 
> > Also did you try alt+sysrq magic key combo ?
> > 
> 
> I hadn't heard of this, but it looks like it only works from console,
> not from a Gnome terminal.
yep you need console access, there is some way of doing it across a serial
connection, but .

It is worth knowing about, has saved me a few times


> 
> 
> 


signature.asc
Description: Digital signature
-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html

Re: [SLUG] System freeze - how to diagnose?

2007-01-22 Thread david
For the record, this was a disk I/O error, which I discovered by leaving
the job running over night logged into a console rather than Gnome. 

I'm guessing something is overheating, since it's a job that's very
processor/harddrive intensive. I'm also assuming (!) that it wasn't
logged because the system couldn't write to disk ?

David.


On Mon, 2007-01-22 at 22:22 +1100, Alex Samad wrote:
> On Mon, Jan 22, 2007 at 02:30:13PM +1100, david wrote:
> > On Mon, 2007-01-22 at 13:46 +1100, Alex Samad wrote:
> > > On Mon, Jan 22, 2007 at 09:13:24AM +1100, david wrote:
> > > > The power light was on, but no-one home. Nothing responding, no ping.
> > > > Other machines on the same UPS are running fine.
> > > > 
> > > > I rebooted and got a "30 times mounted" forced disk check - no problem
> > > > there. Everything came up fine. Nothing in syslog or anywhere else I can
> > > > think of.
> > > > 
> > > > The only thing notable was that the activity light on a USB webcam I'm
> > > > messing with was still on, which makes me suspicious it might have to do
> > > > with that. [1] I was testing an image capture script that's cpu
> > > > intensive, and I left it running over-night, for over 12 hours - over
> > > > heating perhaps?
> > > > 
> > > > Any suggestions where/how to start investigating this? Nothing obvious
> > > > on Google that I can find. Ubuntu 6.10, btw.
> > > > 
> > > > many thanks...
> > > 
> > > if you have another machine, connect the 2 together by serial cable and 
> > > send
> > > your console out the serial cable.  Start up minicom on the 2nd box and 
> > > what
> > > for it to crash.
> > > 
> > 
> > sounds reasonable ;-) sounds like I need to run it overnight again and
> > try to reproduce the problem.
> > 
> > > Also did you try alt+sysrq magic key combo ?
> > > 
> > 
> > I hadn't heard of this, but it looks like it only works from console,
> > not from a Gnome terminal.
> yep you need console access, there is some way of doing it across a serial
> connection, but .
> 
> It is worth knowing about, has saved me a few times
> 
> 
> > 
> > 
> > 
> -- 
> SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
> Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html

-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html


Re: [SLUG] System freeze - how to diagnose?

2007-01-22 Thread Alex Samad
On Tue, Jan 23, 2007 at 11:33:42AM +1100, david wrote:
> For the record, this was a disk I/O error, which I discovered by leaving
> the job running over night logged into a console rather than Gnome. 
> 
> I'm guessing something is overheating, since it's a job that's very
> processor/harddrive intensive. I'm also assuming (!) that it wasn't
> logged because the system couldn't write to disk ?
smartmon or smarttools is your friend


> 
> David.
> 
> 
> On Mon, 2007-01-22 at 22:22 +1100, Alex Samad wrote:
> > On Mon, Jan 22, 2007 at 02:30:13PM +1100, david wrote:
> > > On Mon, 2007-01-22 at 13:46 +1100, Alex Samad wrote:
> > > > On Mon, Jan 22, 2007 at 09:13:24AM +1100, david wrote:
> > > > > The power light was on, but no-one home. Nothing responding, no ping.
> > > > > Other machines on the same UPS are running fine.
> > > > > 
> > > > > I rebooted and got a "30 times mounted" forced disk check - no problem
> > > > > there. Everything came up fine. Nothing in syslog or anywhere else I 
> > > > > can
> > > > > think of.
> > > > > 
> > > > > The only thing notable was that the activity light on a USB webcam I'm
> > > > > messing with was still on, which makes me suspicious it might have to 
> > > > > do
> > > > > with that. [1] I was testing an image capture script that's cpu
> > > > > intensive, and I left it running over-night, for over 12 hours - over
> > > > > heating perhaps?
> > > > > 
> > > > > Any suggestions where/how to start investigating this? Nothing obvious
> > > > > on Google that I can find. Ubuntu 6.10, btw.
> > > > > 
> > > > > many thanks...
> > > > 
> > > > if you have another machine, connect the 2 together by serial cable and 
> > > > send
> > > > your console out the serial cable.  Start up minicom on the 2nd box and 
> > > > what
> > > > for it to crash.
> > > > 
> > > 
> > > sounds reasonable ;-) sounds like I need to run it overnight again and
> > > try to reproduce the problem.
> > > 
> > > > Also did you try alt+sysrq magic key combo ?
> > > > 
> > > 
> > > I hadn't heard of this, but it looks like it only works from console,
> > > not from a Gnome terminal.
> > yep you need console access, there is some way of doing it across a serial
> > connection, but .
> > 
> > It is worth knowing about, has saved me a few times
> > 
> > 
> > > 
> > > 
> > > 
> > -- 
> > SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
> > Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
> 
> -- 
> SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
> Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html
> 


signature.asc
Description: Digital signature
-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html