On Tue, 28 Mar 2000, Danilo Godec wrote:

> Mar 28 12:00:45 mail kernel: (scsi0:0:2:0) Parity error during Message-In phase
> Mar 28 12:00:45 mail kernel: (scsi0:0:2:0) Parity error during Data-In phase.

That's a hardware problem.  A SCSI parity error is reported by the
hardware and simply passed up the chain.  Unless there is something
seriously wrong in the aic7xxx sequencer code, which I doubt, this looks
like a typical cabling and termination issue.

> Mar 28 12:00:45 mail kernel: scsi : aborting command due to timeout : pid 14301024, 
>scsi0, channel 0, id 0, lun 0 Write (10) 00 00 6b 0f 14 00 00 08 00
> 
> Then this (a lot of lines):
> 
> Mar 28 12:00:45 mail kernel: SCSI host 0 abort (pid 14301062) timed out - resetting
> Mar 28 12:00:45 mail kernel: SCSI bus is being reset for host 0 channel 0.

The drive and controller have lost state synchronization.  Once a parity
error has occurred, errors are normal and inevitable, leading up to a bus
reset to attempt to regain synchronization.

> Mar 28 12:00:45 mail kernel: SCSI host 0 reset (pid 14301061) timed out again - 
> Mar 28 12:00:45 mail kernel: probably an unrecoverable SCSI bus or device hang.
> 
> And finally this:
> 
> Mar 28 12:00:45 mail kernel: (scsi0:0:2:0) Successfully completed Domain validation.
> Mar 28 12:00:45 mail kernel: (scsi0:0:2:0) Using asynchronous transfers.
> Mar 28 12:00:45 mail kernel: (scsi0:0:1:0) Synchronous at 80.0 Mbyte/sec, offse 31.
> Mar 28 12:00:45 mail kernel: (scsi0:0:0:0) Using asynchronous transfers.   

Hard to say, but my guess is that your drive has elected to shut down.  I
don't know what devices are on the bus, but the negotiation of aynchronous
transfers is not a good sign and it may indicated one of the lines is
being held in a funny state.  Are you trying to run slow and fast devices
on the same SCSI bus?

> I'm asking if someone with more SCSI experience could diagnose what could
> be the cause of that?

I think you have an electrical issue.

>  19:     358370     359232   IO-APIC-level  aic7xxx, aic7xxx
* * *
> 1400-14be : aic7xxx
> 1800-18be : aic7xxx    

Are you really running two separate aic7xxx controllers?  Do they have the
same firmware revision?

-- Mike

--
-------------------------------------------------------------------------------
Bilow Computer Science, Inc. | http://www.bilow.com/ | Michael S. Bilow
Cranston, RI 02920-5554, USA | [EMAIL PROTECTED]        | President
-------------------------------------------------------------------------------
PGP Public Key fingerprint  =  4B 06 23 FB 3E 24 A5 24  14 B5 A2 14 96 73 B4 B2
PGP Public Key fingerprint  =  A5 13 63 7F E3 9F AB 0A  52 62 49 26 BF 0C 01 AD
-------------------------------------------------------------------------------

Reply via email to