Re: MySQL Server crashing & database corruption

Samuel Vogel Sun, 07 Oct 2007 13:17:04 -0700

Actually I have an somewhat reproducible test case.
Problem is, it only works on my Server.

When I start MySQL it crashes almost exactly after one minute. Thishappens every single time:


Oct  7 21:41:04 h1314631 mysqld_safe[18591]: started

Oct 7 21:41:04 h1314631 mysqld[18594]: 071007 21:41:04 [Warning] Thesyntax for replication startup options is deprecated and will be removedin MySQL 5.2. Please use 'CHANGE MASTER' instead.Oct 7 21:41:06 h1314631 mysqld[18594]: 071007 21:41:06 InnoDB:Started; log sequence number 0 1353332480Oct 7 21:41:07 h1314631 mysqld[18594]: 071007 21:41:07 [Note]/usr/sbin/mysqld: ready for connections.Oct 7 21:41:07 h1314631 mysqld[18594]: Version:'5.0.45-Debian_1~bpo.1-debug' socket: '/var/run/mysqld/mysqld.sock'port: 3306 Debian etch distributionOct 7 21:41:36 h1314631 mysqld[18594]: 071007 21:41:36 [Warning]Neither --relay-log nor --relay-log-index were used; so replication maybreak when this MySQL server acts as a slave and has his hostnamechanged!! Please use '--relay-log=/var/run/mysqld/mysqld-relay-bin' toavoid this problem.Oct 7 21:41:36 h1314631 mysqld[18594]: 071007 21:41:36 [ERROR] Failedto open the relay log '/var/run/mysqld/mysqld-relay-bin.000045'(relay_log_pos 2575)Oct 7 21:41:36 h1314631 mysqld[18594]: 071007 21:41:36 [ERROR] Couldnot find target log during relay log initializationOct 7 21:42:38 h1314631 mysqld_safe[18826]: Number of processes runningnow: 0

Oct  7 21:42:38 h1314631 mysqld_safe[18832]: restarted

Oct 7 21:42:38 h1314631 mysqld[18836]: 071007 21:42:38 [Warning] Thesyntax for replication startup options is deprecated and will be removedin MySQL 5.2. Please use 'CHANGE MASTER' instead.Oct 7 21:42:38 h1314631 mysqld[18836]: 071007 21:42:38 InnoDB:Database was not shut down normally!

Oct  7 21:42:38 h1314631 mysqld[18836]: InnoDB: Starting crash recovery.

Oct 7 21:42:38 h1314631 mysqld[18836]: InnoDB: Reading tablespaceinformation from the .ibd files...Oct 7 21:51:22 h1314631 mysqld[18836]: InnoDB: Restoring possiblehalf-written data pages from the doublewrite

Oct  7 21:51:22 h1314631 mysqld[18836]: InnoDB: buffer...

Oct 7 21:51:22 h1314631 mysqld[18836]: 071007 21:51:22 InnoDB:Starting log scan based on checkpoint atOct 7 21:51:22 h1314631 mysqld[18836]: InnoDB: log sequence number 01353332894.Oct 7 21:51:22 h1314631 mysqld[18836]: InnoDB: Doing recovery: scannedup to log sequence number 0 1353333356Oct 7 21:51:22 h1314631 mysqld[18836]: 071007 21:51:22 InnoDB:Starting an apply batch of log records to the database...Oct 7 21:51:23 h1314631 mysqld[18836]: InnoDB: Progress in percents: 4041 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 6465 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 8889 90 91 92 93 94 95 96 97 98 99

Oct  7 21:51:23 h1314631 mysqld[18836]: InnoDB: Apply batch completed

Oct 7 21:51:23 h1314631 mysqld[18836]: 071007 21:51:23 InnoDB:Started; log sequence number 0 1353333356Oct 7 21:51:24 h1314631 mysqld[18836]: 071007 21:51:24 [ERROR] Errorreading master configurationOct 7 21:51:24 h1314631 mysqld[18836]: 071007 21:51:24 [ERROR] Failedto initialize the master info structureOct 7 21:51:24 h1314631 mysqld[18836]: 071007 21:51:24 [Note]/usr/sbin/mysqld: ready for connections.Oct 7 21:51:24 h1314631 mysqld[18836]: Version:'5.0.45-Debian_1~bpo.1-debug' socket: '/var/run/mysqld/mysqld.sock'port: 3306 Debian etch distribution


Maybe this helps somebody?

Regards,
Samy


Samuel Vogel schrieb:

I did some more research:
I ran "xfs_check" on the partition and it didn't report any errors atall, so I'm guessing, that the hard drives are ok.
I also recompiled MySQL with "--with-debug".

How would I get some debug output about the crashes now?

Thanks,
Samy

Samuel Vogel schrieb:
Hey guys,
First of all: I have tried to post this Issue on forums.mysql.com,but the Topic doesn't appear in the Forum. When I try to post again,it tells me that it's a duplicate?!?!
Now the real problem:
I have MySQL set up on two Servers with 7000 Users each and about thesame amount of databases.Since two days ago, there is corruption going on in many databases.And MySQL crashes every other hour or so.Last night I updated from 5.0.32 to 5.0.45, but the problem is stillthere.
Here is what I see in syslog:
Oct 7 11:02:53 h1314631 mysqld[32490]: 071007 11:02:53 [ERROR]/usr/sbin/mysqld: Table './[EMAIL PROTECTED]/dzcp_counter_ips' is markedas crashed and should be repairedOct 7 11:02:53 h1314631 mysqld[32490]: 071007 11:02:53 [ERROR]/usr/sbin/mysqld: Table './[EMAIL PROTECTED]/dzcp_counter_ips' is markedas crashed and should be repairedOct 7 11:03:23 h1314631 mysqld_safe[32724]: Number of processesrunning now: 0
Oct  7 11:03:23 h1314631 mysqld_safe[32729]: restarted
Oct 7 11:03:23 h1314631 mysqld[32734]: 071007 11:03:23 InnoDB:Database was not shut down normally!
Oct  7 11:03:23 h1314631 mysqld[32734]: InnoDB: Starting crash recovery.
Oct 7 11:03:23 h1314631 mysqld[32734]: InnoDB: Reading tablespaceinformation from the .ibd files...Oct 7 11:08:36 h1314631 mysqld[32734]: InnoDB: Restoring possiblehalf-written data pages from the doublewrite
Oct  7 11:08:36 h1314631 mysqld[32734]: InnoDB: buffer...
Oct 7 11:08:36 h1314631 mysqld[32734]: 071007 11:08:36 InnoDB:Starting log scan based on checkpoint atOct 7 11:08:36 h1314631 mysqld[32734]: InnoDB: log sequence number 01346871925.Oct 7 11:08:36 h1314631 mysqld[32734]: InnoDB: Doing recovery:scanned up to log sequence number 0 1346871925Oct 7 11:08:36 h1314631 mysqld[32734]: 071007 11:08:36 InnoDB:Started; log sequence number 0 1346871925Oct 7 11:08:38 h1314631 mysqld[32734]: 071007 11:08:38 [Note]/usr/sbin/mysqld: ready for connections.Oct 7 11:08:38 h1314631 mysqld[32734]: Version:'5.0.45-Debian_1~bpo.1' socket: '/var/run/mysqld/mysqld.sock' port:3306 Debian etch distribution
As far as I understand, this means, that the MySQL Server crashed andmysqld_safe noticed that and restarted it.I also see much database corruption, but I somewhat run into achicken & egg problem here. I don't know if the databases corruptionappeared first and led into the crashes or if the crashes led to thecorruption.
How can I investigate further into the problem? I don't think that aperticular query is crashing the system, since all of our users justrun well known apps like phpBB etc.
To clarify my situtaion I have just started a "myisamchk --silent--force --update-state --recover" for all tables that on my system.It gives me, among errors it could repair, to error messages, which Icould find much about with Google or the MySQL docs:
myisamchk: error: 138 when opening MyISAM-table'/data/mysql/.../transcache.MYI'
and

myisamchk: Unknown error 126
myisamchk: error: '/data/mysql/.../smf_membergroups.MYI' doesn't havea correct index definition. You need to recreate it before you can doa repair
What makes me wonder too, is that the database corruption ishappening on both Servers, but the MySQL crashes only appear on oneof them.
I'm running Debian Etch and the MySQL data dir is on an XFSpartition. I have mounted the partition with "noatime".
How would I investigate a potential hard drive error?

Can anybody shade some light on my situation?

Regards,
Samy


--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[EMAIL PROTECTED]

Re: MySQL Server crashing & database corruption

Reply via email to