Hi,

We are currently deploying Tyan S2882 Dual Opteron Boards, and we have
found the system to be quite unstable. After BIOS updates and kernel
changes we still get random kernel panics when under load.

Anyone has these boards and has found any solution, as I have mailed
other users of this board  who also reported random kernel panics and
an unusual number of hardware problems.

So far we have solved the
- broken BIOS problem with an update to the most recent BIOS.
- Discovered that some power supplies can produce problems
http://www.anandtech.com/mb/showdoc.aspx?i=2608
- FS corruption due to a firmeware problem in a RAID hardware board
- MCE chipkill errors (non-fatal) due to apparent bad RAM

To be solved:
- random kernel panics that take out the logging even when all debug
flags are set in the kernel, as it fails to sync the disc during the
kernel panic.
_______________________________________________
Beowulf mailing list, [email protected]
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to