I am posting this in the hope it will be useful; I seem 
to have solved the problem. 

The problem was(on 3.23.38) and Linux 2.2.18 or 2.4.0

This is a huge system, all SCSI, 2GB or RAM, 2PIII 
x1000MHz processors and  37GB in the volume group  
corresponding to /var/lib/mysql
There is an AHA29160 SCSI controller.

A scipt was running continuously  getting data into 
Mysql. Every so often the script peformed a mysqldump
The problem was that  at some point when I tried to 
access the  database I would get a crash with error 
code 127.
 repair table of myisamchk -r could fix that, but 
 not all data was  being recovered.
Similarly the dumps were corrupted.

This was apparently caused by timeouts and the solution 
was to pass the boot parameter
 aic7xxx=seltime:0

in lilo.

Perhaps this would be a useful addition to the 
the faqs
S.Alexiou

------------------------------
Here are some excerpts from 
/var/lib/mysql/'hostname'.err

010826 21:59:19  mysqld started
/usr/sbin/mysqld: ready for connections
010827 14:05:37  mysqld started
/usr/sbin/mysqld: ready for connections
010827 17:44:12  Aborted connection 171983 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 20:55:39  Aborted connection 513765 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 20:58:34  Aborted connection 521479 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 21:00:34  Aborted connection 524452 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 21:01:57  Aborted connection 529556 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 21:02:52  Aborted connection 533379 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 21:05:40  Aborted connection 540483 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 21:10:50  Aborted connection 554558 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 21:13:50  Aborted connection 556004 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010827 22:57:44  /usr/sbin/mysqld: Normal shutdown

010827 22:57:45  /usr/sbin/mysqld: Shutdown Complete

010830 18:17:47  mysqld started
/usr/sbin/mysqld: ready for connections
010830 18:45:08  Aborted connection 41658 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010830 18:49:16  Aborted connection 50975 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010830 22:03:58  Aborted connection 528901 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010830 22:18:55  Aborted connection 565805 to db: 'CDR' 
user: 'spiros' host: `localhost' (Got an error reading 
communication packets)
010830 22:42:52  Warning: Found 11 of 17 rows when 
repairing './CDR/CDR_INCOMING_1360_1'
010830 22:56:39  Warning: Found 12 of 16 rows when 
repairing './CDR/CDR_INCOMING_1400_1'
010830 22:58:53  Warning: Found 7 of 12 rows when 
repairing './CDR/CDR_INCOMING_1412_0'
010830 23:00:26  Warning: Found 13 of 27 rows when 
repairing './CDR/CDR_INCOMING_1452_1'
010830 23:01:23  Warning: Found 5 of 18 rows when 
repairing './CDR/CDR_INCOMING_1491_1'
010830 23:02:28  Warning: Found 8 of 12 rows when 
repairing './CDR/CDR_INCOMING_1505_0'
010830 23:03:13  Warning: Found 1 of 6 rows when 
repairing './CDR/CDR_INCOMING_1522_1'
010830 23:09:00  Warning: Found 20 of 27 rows when 
repairing './CDR/CDR_INCOMING_1654_0'
010830 23:17:33  /usr/sbin/mysqld: Normal shutdown

010830 23:17:33  /usr/sbin/mysqld: Shutdown 

----------------------------------------
Some messages may also show up in /var/log/messages

29:36 quality4 PAM-unix2[1040]: session started for 
user root, service su 
change detected on device fd(2,0)
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 37 00 00 08 00 
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 3f 00 00 08 00 
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 47 00 00 08 00 
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 4f 00 00 08 00 
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 57 00 00 08 00 
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 5f 00 00 08 00 
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 67 00 00 08 00 
Aug 25 21:17:07 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 92 cd 6f 00 00 08 00 
Aug 25 21:17:08 quality4 kernel: SCSI host 0 abort (pid 
0) timed out - resetting
Aug 25 21:17:08 quality4 kernel: SCSI bus is being 
reset for host 0 channel 0.
Aug 25 21:17:11 quality4 kernel: (scsi0:0:2:0) 
Synchronous at 160.0 Mbyte/sec, offset 63.
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 2f 00 00 08 00 
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 37 00 00 08 00 
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 3f 00 00 08 00 
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 47 00 00 08 00 
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 4f 00 00 08 00 
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 57 00 00 08 00 
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 5f 00 00 08 00 
Aug 25 21:17:50 quality4 kernel: scsi : aborting 
command due to timeout : pid 0, scsi0, channel 0, id 2, 
lun 0 Write (10) 00 03 96 5c 67 00 00 08 00 
Aug 25 21:17:51 quality4 kernel: SCSI host 0 abort (pid 
0) timed out - resetting
Aug 25 21:17:51 quality4 kernel: SCSI bus is being 
reset for host 0 channel 0.
Aug 25 21:17:54 quality4 kernel: (scsi0:0:2:0) 
Synchronous at 160.0 Mbyte/sec, offset 63.
Aug 25 21:26:06 quality4 kernel: (scsi0:0:1:0) 
Synchronous at 160.0 Mbyte/sec, offset 63.
Aug 25 21:48:13 quality4 -- MARK --
Aug 25 21:59:00 quality4 /USR/SBIN/CRON[1670]: (root) 
CMD ( rm -f /var/spool/cron/lastrun/cron.hourly) 
Aug 25 22:05:00 quality4 /USR/SBIN/CRON[1705]: (root



---------------------------------------------------------------------
Before posting, please check:
   http://www.mysql.com/manual.php   (the manual)
   http://lists.mysql.com/           (the list archive)

To request this thread, e-mail <[EMAIL PROTECTED]>
To unsubscribe, e-mail <[EMAIL PROTECTED]>
Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php

Reply via email to