Re: Could drbd randomly flip bits? Was: Database page corruption on disk occurring during mysqldump on a fresh database and Was: Spontaneous development of supremely large files on different ext3 filesystems

Jeremy Cole Mon, 17 Sep 2007 13:59:55 -0700

Hi Maurice,

If you're running into corruption both in ext3 metadata and in MySQLdata, it is certainly not he fault of MySQL as you're likely aware.
I am hoping they are not related. The problems with MySQL surfacedalmost immediately after upgrading to 5.0.x.

It's possible that they are not related, but it could even be 5.0specific but still not a MySQL bug. I.e. MySQL 5.0 could be doingsomething that steps on the bug and causes it to occur. But, it's hardto say anything for sure. Nonetheless, I generally don't botherworrying about the possibility of MySQL bugs until I'm sure that the OSand hardware are stable.

You can see that there are in fact many bits flipped in each. Iwould suspect higher-level corruption than
I initially thought this as well, but the explanation on the ext3mailing list is that it really is just a lone flipped bit in bothinstances. The other differences are due to fsck padding out theblock when it guesses what the correct size is.

Interesting. Can you forward that mail to me personally, or summarizefor the list? I'd be interested to read the explanation.

Do note that data on e.g. the PCI bus is not protected by any sortof checksum. I've seen this cause corruption problems with PCIrisers and RAID cards. Are you using a PCI riser card? Note thatLSI does *not* certify their cards to be used on risers if you arecustom building a machine.
Yes, there is a riser card. Wouldn't this imply that LSI is sayingyou can't use a 1U or a 2U box?

Kind of. Presumably you would be buying a vendor integrated solutionwhere they have certified that the riser card and RAID card arecompatible. Presumably. You'll also notice that most vendors aremoving to controllers that aren't PCI{,-E,-X} slot based, and ratherconnect directly to a low-profile integrated slot. This removes a fewvariables. (And frees up some space.)

It's kind of scary there is no end-to-end parity implementedsomewhere along the whole data path to prevent this. It sort ofdefeats the point of RAID 6 and ECC.

I agree, it's pretty damn scary. You can read about the story and theensuing discussion here:


http://jcole.us/blog/archives/2006/09/04/on-1u-cases-pci-risers-and-lsi-megaraid/

How did you determine this was the cause?

Isolating lots of variables. The customer in question had a workloadthat could reproduce the problem reliably, although not in the sameplace or same time to be able to track things down, and not under debugmode (which likely slowed things down enough to not cause trouble).

I finally suggested that they isolate the riser card as a variable byplugging it directly into the slot. Since it was a 1U machine, itrequired taking the metal frame off the card and leaving the case open(and hanging out into the datacenter aisle). it could then be shownthat with riser, corruption always occurred, and without the riser,corruption never occurred.

Obviously, running the machines with cases open and cards plugged indirectly was not an option, so the only other possible option waschosen: move to all new hardware with integrated RAID. (HP and theirintegrated SmartArray/cciss controller was chosen as a vendor in this case.)

Do you mean a Serially-Attached SCSI aka SAS controller, I assume?


No, it's SATA to SCSI.

Interesting. I hadn't heard of such a thing until I just looked it up.But in any case that adds yet another variable (and a fairly uncommonone) to the mix.


Regards,

Jeremy

--
high performance mysql consulting
www.provenscaling.com

--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[EMAIL PROTECTED]

Re: Could drbd randomly flip bits? Was: Database page corruption on disk occurring during mysqldump on a fresh database and Was: Spontaneous development of supremely large files on different ext3 filesystems

Reply via email to