Re: raid5:md3: read error corrected , followed by , Machine Check Exception: .

2007-07-14 Thread Alan Cox
On Sat, 14 Jul 2007 17:08:27 -0700 (PDT) Mr. James W. Laferriere [EMAIL PROTECTED] wrote: Hello All , I was under the impression that a 'machine check' would be caused by some near to the CPU hardware failure , Not a bad disk ? It indicates a hardware failure Jul 14 23:00:26

Re: Linux: Why software RAID?

2006-08-24 Thread Alan Cox
Ar Iau, 2006-08-24 am 09:07 -0400, ysgrifennodd Adam Kropelin: Jeff Garzik [EMAIL PROTECTED] wrote: with sw RAID of course if the builder is careful to use multiple PCI cards, etc. Sw RAID over your motherboard's onboard controllers leaves you vulnerable. Generally speaking the channels on

Re: Linux: Why software RAID?

2006-08-24 Thread Alan Cox
Ar Iau, 2006-08-24 am 07:31 -0700, ysgrifennodd Marc Perkel: So - the bottom line answer to my question is that unless you are running raid 5 and you have a high powered raid card with cache and battery backup that there is no significant speed increase to use hardware raid. For raid 0 there

Re: [PATCH 000 of 5] md: Introduction

2006-01-18 Thread Alan Cox
On Mer, 2006-01-18 at 09:14 +0100, Sander wrote: If the (harddisk internal) remap succeeded, the OS doesn't see the bad sector at all I believe. True for ATA, in the SCSI case you may be told about the remap having occurred but its a by the way type message not an error proper. If you (the

Re: [PATCH] RAID5 NULL Checking Bug Fixt

2001-05-16 Thread Alan Cox
On Wednesday May 16, [EMAIL PROTECTED] wrote: (more patches to come. They will go to Linus, Alan, and linux-raid only). This is the next one, which actually addresses the NULL Checking Bug. Thanks. As Linus merges I'll switch over to match his tree. Less diff is good 8) - To

Re: Proposed RAID5 design changes.

2001-03-21 Thread Alan Cox
1) Read and write errors should be retried at least once before kicking the drive out of the array. This doesn't seem unreasonable on the face of it. Device level retries are the job of the device level driver 2) On more persistent read errors, the failed block (or whatever unit is

Re: Proposed RAID5 design changes.

2001-03-21 Thread Alan Cox
any data, but under normal default drive setup the sector will not be reallocated. If testing the failing sector is too much effort, a simple overwrite with the corrected data, at worst, improves the chances of the drive firmware being able to reallocate the sector. This works just fine

Re: Proposed RAID5 design changes.

2001-03-21 Thread Alan Cox
Umm. Isn't RAID implemented as the md device? That implies that it is responsible for some kind of error management. Bluntly, the file systems don't declare a file system kaput until they've retried the critical I/O operations. Why should RAID5 be any less tolerant? File systems give up the