[PATCH-RFC] (was: Re: Sata Sil3512 bug?; Promise SATA300 TX4)

2007-10-27 Thread Alexander Sabourenkov
Hello.

There appears to be a hardware bug in that it chokes on scatterlist
if the last item is larger than 164 bytes.

The patch that follows fixes my problem on 2.6.22.

I can't think of a way to avoid second pass over scatterlist without
duplicating code (ata_qc_prep() and ata_fill_sg() from libata-core.c).


--- a/drivers/ata/sata_promise.c2007-07-09 03:32:17.0 +0400
+++ b/drivers/ata/sata_promise.c2007-10-27 17:20:03.0 +0400
@@ -531,6 +531,80 @@
memcpy(buf+31, cdb, cdb_len);
 }

+/**
+ * pdc_qc_prep - Fill PCI IDE PRD table
+ * @qc: Metadata associated with taskfile to be transferred
+ *
+ * Fill PCI IDE PRD (scatter-gather) table with segments
+ * associated with the current disk command.
+ * Make sure hardware does not choke on it.
+ *
+ * LOCKING:
+ * spin_lock_irqsave(host lock)
+ *
+ */
+static void pdc_qc_prep(struct ata_queued_cmd *qc)
+{
+   struct ata_port *ap = qc-ap;
+   struct scatterlist *sg;
+   unsigned int idx;
+   const u32 SG_COUNT_ASIC_BUG = 41*4;
+
+   if (!(qc-flags  ATA_QCFLAG_DMAMAP))
+   return;
+   
+   WARN_ON(qc-__sg == NULL);
+   WARN_ON(qc-n_elem == 0  qc-pad_len == 0);
+
+   idx = 0;
+   ata_for_each_sg(sg, qc) {
+   u32 addr, offset;
+   u32 sg_len, len;
+
+   /* determine if physical DMA addr spans 64K boundary.
+* Note h/w doesn't support 64-bit, so we unconditionally
+* truncate dma_addr_t to u32.
+*/
+   addr = (u32) sg_dma_address(sg);
+   sg_len = sg_dma_len(sg);
+
+   while (sg_len) {
+   offset = addr  0x;
+   len = sg_len;
+   if ((offset + sg_len)  0x1)
+   len = 0x1 - offset;
+
+   ap-prd[idx].addr = cpu_to_le32(addr);
+   ap-prd[idx].flags_len = cpu_to_le32(len  0x);
+   VPRINTK(PRD[%u] = (0x%X, 0x%X)\n, idx, addr, len);
+
+   idx++;
+   sg_len -= len;
+   addr += len;
+   }
+   }
+
+   if (idx) {
+   u32 len = ap-prd[idx - 1].flags_len;
+   if (len  SG_COUNT_ASIC_BUG) {
+   u32 addr, len;
+
+   VPRINTK(Last PRD split\n);
+   
+   len = le32_to_cpu(ap-prd[idx - 1].flags_len) - 
SG_COUNT_ASIC_BUG;
+   addr = le32_to_cpu(ap-prd[idx - 1].addr);
+   ap-prd[idx - 1].flags_len = cpu_to_le32(len);
+   VPRINTK(PRD[%u] = (0x%X, 0x%X)\n, idx, addr, len);
+   
+   ap-prd[idx].flags_len = cpu_to_le32(SG_COUNT_ASIC_BUG);
+   ap-prd[idx].addr = cpu_to_le32(addr + len);
+   idx++;
+   VPRINTK(PRD[%u] = (0x%X, 0x%X)\n, idx, addr + len, 
SG_COUNT_ASIC_BUG);
+   }
+   ap-prd[idx - 1].flags_len |= cpu_to_le32(ATA_PRD_EOT);
+   }
+}
+
 static void pdc_qc_prep(struct ata_queued_cmd *qc)
 {
struct pdc_port_priv *pp = qc-ap-private_data;
@@ -540,7 +614,7 @@

switch (qc-tf.protocol) {
case ATA_PROT_DMA:
-   ata_qc_prep(qc);
+   pdc_qc_prep(qc);
/* fall through */

case ATA_PROT_NODATA:

-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH-RFC] (was: Re: Sata Sil3512 bug?; Promise SATA300 TX4)

2007-10-27 Thread Mikael Pettersson
On Sat, 27 Oct 2007 17:24:32 +0400, Alexander Sabourenkov wrote:
 There appears to be a hardware bug in that it chokes on scatterlist
 if the last item is larger than 164 bytes.
 
 The patch that follows fixes my problem on 2.6.22.
 
 I can't think of a way to avoid second pass over scatterlist without
 duplicating code (ata_qc_prep() and ata_fill_sg() from libata-core.c).
 
 
 --- a/drivers/ata/sata_promise.c  2007-07-09 03:32:17.0 +0400
 +++ b/drivers/ata/sata_promise.c  2007-10-27 17:20:03.0 +0400

Interesting, but can you please give more background information?
I.e., how did you determine the existence of this bug?

And please cc: the sata_promise maintainer on sata_promise patches.
(Hint: that's me)

And please choose a Subject: that makes it absolutely clear what
the post is about. Sata Sil3512 bug doesn't exactly sound like
something the sata_promise maintainer needs to look at.

Meanwhile I'll check the Promise driver(s) to see if there's
something about SG table formatting restrictions there.

/Mikael
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-20 Thread Alexander Sabourenkov
Hello.

Tejun Heo wrote:
 
 Does the attached patch help?
 

It does somehow force 1.5GB/s mode, and it does change the pattern of
'configured for UDMAxxx' messages that come along with errors, and it
causes the following error:

ata3: exception Emask 0x10 SAct 0x0 SErr 0x0 action 0xb t4
ata3: hotplug_status 0x10
ata3: soft resetting link
ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
ata3.00: configured for UDMA/133
ata3: EH complete

for both drives on TX4 on startup, but read errors are still there.

dmesgs at http://lxnt.info/linux/libata-dev/patch0/

READY
[]


-- 

./lxnt
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-19 Thread Tejun Heo

Alexander Sabourenkov wrote:

Hello.


So, my bet for your second report is your hardware went through
something similar as above.



Thanks for the insight. Let's dismiss it then.

Back to the TX4, I tried libata-dev.git cloned at about 20:00 UTC 19.10,
  no perceived difference - parallel read from two drives causes a lot
of  errors.

dmesgs  with boot and errors are at http://lxnt.info/linux/libata-dev/

I don't know what to try next. Any ideas?



Does the attached patch help?

--
tejun
diff --git a/drivers/ata/libata-core.c b/drivers/ata/libata-core.c
index 68699b3..4c93fee 100644
--- a/drivers/ata/libata-core.c
+++ b/drivers/ata/libata-core.c
@@ -6435,6 +6435,7 @@ int sata_link_init_spd(struct ata_link *link)
spd = (scontrol  4)  0xf;
if (spd)
link-hw_sata_spd_limit = (1  spd) - 1;
+   link-hw_sata_spd_limit = 1;
 
link-sata_spd_limit = link-hw_sata_spd_limit;
 


Re[2]: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-19 Thread MisterE
Hello Alexander,

Friday, October 19, 2007, 11:06:02 PM, you wrote:


 I don't know what to try next. Any ideas?

I'm no kernel hacker, so i'll take a shot.
I assume you have done most already...

* hardware (Tested/without/or used another: motherboard, videocard, memory,
hard drives, power supply, all other hardware)

* tried a more n00b-proof distribution. As far as i know you have all
those flags with gentoo. A mistake is easily made.

* Tested with the latest official drivers (redhat) from the Promise site. And
installing that OS on a disk. I assume they made working drivers, so
it should work with it...

* Does it work correctly with Windows?

This would be the steps i would take to determine the cause of the
problem.

Finally, my 2.6.23 kernel is done. I'm going try to install it now.
Tomorrow the results :)


-- 
Best regards,
 MisterEmailto:[EMAIL PROTECTED]


-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-19 Thread Alexander Sabourenkov
Hello.

 
 So, my bet for your second report is your hardware went through
 something similar as above.
 

Thanks for the insight. Let's dismiss it then.

Back to the TX4, I tried libata-dev.git cloned at about 20:00 UTC 19.10,
  no perceived difference - parallel read from two drives causes a lot
of  errors.

dmesgs  with boot and errors are at http://lxnt.info/linux/libata-dev/

I don't know what to try next. Any ideas?

-- 

./lxnt





-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-18 Thread Alexander Sabourenkov
Hello.


I have done some quick tests with 2.6.23/amd64 and unfortunately, the
very same problem persists.

By the way, 8 in (port_status 0x2008) stands for
PDC_OVERRUN_ERR = (1  19), /* S/G byte count larger
than HD requires */


Does by any chance 'S/G' here somehow relate to 'sg in the 'sg-chaining
work' there is so much talk about on the -kernel mailing list?



In a somewhat parallel development, write errors caused my (other) md
RAID-1 to lose one drive while copying data under 2.6.22
from TX4-attached drives to onboard-VIA-attached ones.

Device: VIA VT6420
00:0f.0 0104: 1106:3149 (rev 80)

Boot:

Oct 17 21:28:25 host sata_via :00:0f.0: version 2.2
Oct 17 21:28:25 host ACPI: PCI Interrupt :00:0f.0[B] - GSI 20
(level, low) - IRQ 17
Oct 17 21:28:25 host sata_via :00:0f.0: routed to hard irq line 10
Oct 17 21:28:25 host scsi4 : sata_via
Oct 17 21:28:25 host scsi5 : sata_via

Oct 17 21:28:25 host ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
Oct 17 21:28:25 host ata6.00: ATA-7: ST3200827AS, 3.AAH, max UDMA/133
Oct 17 21:28:25 host ata6.00: 390721968 sectors, multi 0: LBA48 NCQ
(depth 0/32)
Oct 17 21:28:25 host ata6.00: configured for UDMA/133

... the first two port resets:

Oct 17 23:10:50 host ata6.00: exception Emask 0x0 SAct 0x0 SErr 0x0
action 0x2
Oct 17 23:10:50 host ata6.00: (BMDMA stat 0x4)
Oct 17 23:10:50 host ata6.00: cmd ca/00:08:e7:30:00/00:00:00:00:00/e0
tag 0 cdb 0x0 data 4096 out
Oct 17 23:10:50 host res 51/84:08:e7:30:00/00:00:00:00:00/e0 Emask 0x10
(ATA bus error)
Oct 17 23:10:50 host ata6: soft resetting port
Oct 17 23:10:50 host ata6.00: configured for UDMA/133
Oct 17 23:10:50 host ata6: EH complete
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] 390721968 512-byte hardware
sectors (200050 MB)
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] Write Protect is off
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] Mode Sense: 00 3a 00 00
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] Write cache: enabled, read cache:
enabled, doesn't support DPO or FUA
Oct 17 23:10:50 host ata6.00: exception Emask 0x0 SAct 0x0 SErr 0x0
action 0x2
Oct 17 23:10:50 host ata6.00: (BMDMA stat 0x5)
Oct 17 23:10:50 host ata6.00: cmd ca/00:f8:4f:31:00/00:00:00:00:00/e0
tag 0 cdb 0x0 data 126976 out
Oct 17 23:10:50 host res 51/84:f8:4f:31:00/00:00:00:00:00/e0 Emask 0x10
(ATA bus error)
Oct 17 23:10:50 host ata6: soft resetting port
Oct 17 23:10:50 host ata6.00: configured for UDMA/133
Oct 17 23:10:50 host ata6: EH complete
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] 390721968 512-byte hardware
sectors (200050 MB)
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] Write Protect is off
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] Mode Sense: 00 3a 00 00
Oct 17 23:10:50 host sd 5:0:0:0: [sdd] Write cache: enabled, read cache:
enabled, doesn't support DPO or FUA

... and multiple unsuccessful port resets follow:

Oct 17 23:11:57 host ata6.00: exception Emask 0x0 SAct 0x0 SErr 0x0
action 0x2 frozen
Oct 17 23:11:57 host ata6.00: cmd 25/00:08:7f:bf:28/00:00:16:00:00/e0
tag 0 cdb 0x0 data 4096 in
Oct 17 23:11:57 host res 40/00:f8:4f:31:00/00:00:00:00:00/e0 Emask 0x4
(timeout)
Oct 17 23:12:02 host ata6: port is slow to respond, please be patient
(Status 0xd0)
Oct 17 23:12:07 host ata6: soft resetting port
Oct 17 23:12:37 host ata6.00: qc timeout (cmd 0xec)
Oct 17 23:12:37 host ata6.00: failed to IDENTIFY (I/O error, err_mask=0x4)
Oct 17 23:12:37 host ata6.00: revalidation failed (errno=-5)
Oct 17 23:12:37 host ata6: failed to recover some devices, retrying in 5
secs
Oct 17 23:12:47 host ata6: port is slow to respond, please be patient
(Status 0xd0)
Oct 17 23:12:52 host ata6: soft resetting port
Oct 17 23:13:22 host ata6.00: qc timeout (cmd 0xec)
Oct 17 23:13:22 host ata6.00: failed to IDENTIFY (I/O error, err_mask=0x4)
Oct 17 23:13:22 host ata6.00: revalidation failed (errno=-5)
Oct 17 23:13:22 host ata6.00: limiting speed to UDMA/133:PIO3
Oct 17 23:13:22 host ata6: failed to recover some devices, retrying in 5
secs
Oct 17 23:13:32 host ata6: port is slow to respond, please be patient
(Status 0xd0)
Oct 17 23:13:37 host ata6: soft resetting port
Oct 17 23:14:08 host ata6.00: qc timeout (cmd 0xec)
Oct 17 23:14:08 host ata6.00: failed to IDENTIFY (I/O error, err_mask=0x4)
Oct 17 23:14:08 host ata6.00: revalidation failed (errno=-5)
Oct 17 23:14:08 host ata6.00: disabled
Oct 17 23:14:08 host ata6: EH complete
Oct 17 23:14:08 host sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET
driverbyte=DRIVER_OK,SUGGEST_OK
Oct 17 23:14:08 host end_request: I/O error, dev sdd, sector 371769215
Oct 17 23:14:08 host raid1: sdd1: rescheduling sector 371769152
Oct 17 23:14:08 host sd 5:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET
driverbyte=DRIVER_OK,SUGGEST_OK
Oct 17 23:14:08 host end_request: I/O error, dev sdd, sector 390379327
Oct 17 23:14:08 host md: super_written gets error=-5, uptodate=0
Oct 17 23:14:08 host raid1: Disk failure on sdd1, disabling device.

I'm unable to reproduce this on 2.6.23, so this is of historic interest
only.

-- 

./lxnt
-
To 

Re[2]: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-17 Thread MisterE
Hello,

Wednesday, October 3, 2007, 10:31:17 AM, you wrote:

 Mikael Pettersson wrote:


 I'm thinking of replacing both 3512 controllers with a Promise SATA300
 TX4. Do you know if there are problems with this device?
 
 (please don't top-post)
 
 There are no known data-corruption issues with Promise SATA cards.
 However, some of them, especially the 2nd generation SATA300 TX4,
 are known to trigger intermittent error interrupts (that are dealt
 with but may cause a speed reduction) in some systems. We're still
 scratching our heads on that issue.
 

 But see this thread:

 http://marc.info/?l=linux-idem=119122463403033w=2
 http://www.spinics.net/lists/linux-ide/msg14868.html

 Personally I would not recommend Promise SATA300 TX4 at the moment.


After all the problems i had with the sweex 3512 cards i returned them
to the shop and decided to buy a Sata300 TX4 (because the shop nearby
had one. Unfortunately the shops in the region don't have Highpoints)

Things looked promising when i inserted the card in both Intel D815EEA
motherboards. No problems detecting the hard drives (unlike with the 3512 
cards).
With the 3512 i had LOTS of error messages and corrupt data when writing to it.
Using a separate videocard, instead of the onboard one, seemed to reduce the 
amount of errors.

But after some heavy reading/writing with the promise i got 2 errors. (see log 
file).
But i did'nt find any corrupt files. I can not reproduce the error.
I'm not sure if these are the intermittent error interrupts Mikael
Pettersson mentioned?

ps: as you can i see i got at the boot some errors from the boot disk
(hda). I not sure what is wrong with it. Sometimes it produce these
errors. Used a non-destructive read-write test with badblocks but no
bad sectors found. I don't know if this could influence the sata controller.

Alexander Sabourenkov can you please tell me where i can find the
Bug is fixed in 2.6.23.1: sata_promise: port is slow to respond,
reset failed thread you mentioned?

I also see that the driver is now at version 2.10. Is there something
really critical changed? I've tried testing with Debian stable
(2.6.18-4-686; sata_promise: 1.04) and with Debian Unstable
(2.6.22-2-686; sata_promise: 2.07). 2.6.23 is not in the repositories
yet.

So basically the question is this. Can i trust the SATA300 TX4 or
should i buy a Highpoint RocketRAID 1640/1740?. I can order such device
online but i need to be sure that it works correctly :(


-- 
Best regards,
 MisterEmailto:[EMAIL PROTECTED]


-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re[2]: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-17 Thread MisterE
Hello Alexander,

Wednesday, October 17, 2007, 2:54:25 PM, you wrote:

 Log file got lost. Please post relevant parts inline.

Sorry, i totally forgot to include them.
I can not reproduce the errors. Last times hda did not give errors. So i'm
not sure if it is related to each other. (in the thread you mentioned
that you can't explain the fixing of problem from Peter Favrholdt, so
maybe it has indeed something to do with the libata)

ct 16 14:10:59 fileserver kernel: hda: dma_intr: status=0x51 { DriveReady 
SeekComplete Error }
Oct 16 14:10:59 fileserver kernel: hda: dma_intr: error=0x84 { DriveStatusError 
BadCRC }
Oct 16 14:10:59 fileserver kernel: ide: failed opcode was: unknown
Oct 16 14:12:49 fileserver kernel: kjournald starting.  Commit interval 5 
seconds
Oct 16 14:12:49 fileserver kernel: EXT3 FS on sda1, internal journal
Oct 16 14:12:49 fileserver kernel: EXT3-fs: mounted filesystem with ordered 
data mode.
Oct 16 14:13:34 fileserver kernel: hda: dma_intr: status=0x51 { DriveReady 
SeekComplete Error }
Oct 16 14:13:34 fileserver kernel: hda: dma_intr: error=0x84 { DriveStatusError 
BadCRC }
Oct 16 14:13:34 fileserver kernel: ide: failed opcode was: unknown
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: status=0x51 { DriveReady 
SeekComplete Error }
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: error=0x84 { DriveStatusError 
BadCRC }
Oct 16 14:17:21 fileserver kernel: ide: failed opcode was: unknown
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: status=0x51 { DriveReady 
SeekComplete Error }
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: error=0x84 { DriveStatusError 
BadCRC }
Oct 16 14:17:21 fileserver kernel: ide: failed opcode was: unknown
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: status=0x51 { DriveReady 
SeekComplete Error }
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: error=0x84 { DriveStatusError 
BadCRC }
Oct 16 14:17:21 fileserver kernel: ide: failed opcode was: unknown
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: status=0x51 { DriveReady 
SeekComplete Error }
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: error=0x84 { DriveStatusError 
BadCRC }
Oct 16 14:17:21 fileserver kernel: ide: failed opcode was: unknown
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: status=0x51 { DriveReady 
SeekComplete Error }
Oct 16 14:17:21 fileserver kernel: hda: dma_intr: error=0x84 { DriveStatusError 
BadCRC }
Oct 16 14:17:21 fileserver kernel: ide: failed opcode was: unknown
Oct 16 14:17:21 fileserver kernel: hdb: DMA disabled
Oct 16 14:17:21 fileserver kernel: ide0: reset: success
Oct 16 14:32:51 fileserver kernel: ata1.00: exception Emask 0x0 SAct 0x0 SErr 
0x0 action 0x2
Oct 16 14:32:51 fileserver kernel: ata1.00: (port_status 0x2008)
Oct 16 14:32:51 fileserver kernel: ata1.00: cmd 
c8/00:00:77:f6:6c/00:00:00:00:00/e4 tag 0 cdb 0x0 data 131072 in
Oct 16 14:32:51 fileserver kernel:  res 
50/00:00:76:f7:6c/00:00:00:00:00/e4 Emask 0x2 (HSM violation)
Oct 16 14:32:51 fileserver kernel: ata1: soft resetting port
Oct 16 14:32:51 fileserver kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 
SControl 300)
Oct 16 14:32:51 fileserver kernel: ata1.00: configured for UDMA/133
Oct 16 14:32:51 fileserver kernel: ata1: EH complete
Oct 16 14:32:51 fileserver kernel: sd 0:0:0:0: [sda] 976773168 512-byte 
hardware sectors (500108 MB)
Oct 16 14:32:51 fileserver kernel: sd 0:0:0:0: [sda] Write Protect is off
Oct 16 14:32:51 fileserver kernel: sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
Oct 16 14:32:51 fileserver kernel: sd 0:0:0:0: [sda] Write cache: enabled, read 
cache: enabled, doesn't support DPO or FUA
Oct 16 14:44:09 fileserver kernel: sd 0:0:0:0: Attached scsi generic sg0 type 0
Oct 16 14:48:48 fileserver kernel: ata1.00: exception Emask 0x0 SAct 0x0 SErr 
0x0 action 0x2
Oct 16 14:48:48 fileserver kernel: ata1.00: (port_status 0x2008)
Oct 16 14:48:48 fileserver kernel: ata1.00: cmd 
25/00:00:3f:d0:26/00:01:23:00:00/e0 tag 0 cdb 0x0 data 131072 in
Oct 16 14:48:48 fileserver kernel:  res 
50/00:00:3e:d1:26/00:00:23:00:00/e0 Emask 0x2 (HSM violation)
Oct 16 14:48:48 fileserver kernel: ata1: soft resetting port
Oct 16 14:48:49 fileserver kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 
SControl 300)
Oct 16 14:48:49 fileserver kernel: ata1.00: configured for UDMA/133
Oct 16 14:48:49 fileserver kernel: ata1: EH complete
Oct 16 14:48:49 fileserver kernel: sd 0:0:0:0: [sda] 976773168 512-byte 
hardware sectors (500108 MB)
Oct 16 14:48:49 fileserver kernel: sd 0:0:0:0: [sda] Write Protect is off
Oct 16 14:48:49 fileserver kernel: sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
Oct 16 14:48:49 fileserver kernel: sd 0:0:0:0: [sda] Write cache: enabled, read 
cache: enabled, doesn't support DPO or FUA



 Since you have the hardware, do the tests and decide for yourself.

 I'd try copying one (big, preferably over 160G ) disk onto another (with
 dd) for a start,
 while waiting for answers on mailing lists.


I can order that 1740 online, but returning something is always more

Re: Sata Sil3512 bug?; Promise SATA300 TX4

2007-10-17 Thread Peter Favrholdt

Hi,

MisterE wrote:

Tonight i will try the Asus motherboard with 1 drive and much I/O. And
i will create a new array which takes 7 hours. But how often/hours do
you need to try something to prove it does not fail :P


On one box I had problems with the SATA300 TX4 using 2.6.21 through 
2.6.22 (different versions). I have 4x500GB Seagate ES SATA drives 
connected. The system would run fine, but when put to a stress - i.e. 
loaded on all sata ports one or two ports would fail - one after the 
other. I have _always_ been able to make it fail doing:


dd if=/dev/sda of=/dev/null bs=1M 
dd if=/dev/sdb of=/dev/null bs=1M 
dd if=/dev/sdc of=/dev/null bs=1M 
dd if=/dev/sdd of=/dev/null bs=1M 

The ports would freeze before running long - e.g. in less than an hour.

This can be done without even starting the array (mdadm). Therefore no 
data corruption will happen.


The above issue was fixed by updating to vanilla 2.6.23.1.

Until then I have been running with 2.6.21-rc2 with a Mikael Petterson 
patch to force the SATA to 1.5Gbps (this could possibly be accomplished 
by jumpers on the drives as well - but I didn't try that).


I have another system (Dell PE1800 = different from the above) running 
24x7 using vanilla linux 2.6.19.5. This system has been running without 
hickups for more than a year (current uptime 135 days).



Hope this helps,

Best regards,

Peter
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-10-17 Thread Tejun Heo
MisterE wrote:
 Oct 13 13:01:26 fileserver kernel: ata4.00: exception Emask 0x0 SAct 0x0 SErr 
 0x240 action 0x0
 Oct 13 13:01:26 fileserver kernel: ata4.00: (BMDMA2 stat 0x650001)
 Oct 13 13:01:26 fileserver kernel: ata4.00: cmd 
 ca/00:f8:47:e1:5e/00:00:00:00:00/e4 tag 0 cdb 0x0 data 126976 out
 Oct 13 13:01:26 fileserver kernel:  res 
 51/04:98:a7:e1:5e/00:00:00:00:00/e4 Emask 0x1 (device error)
 Oct 13 13:01:26 fileserver kernel: ata4.00: configured for UDMA/100
 Oct 13 13:01:26 fileserver kernel: ata4: EH complete
 Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] 976773168 512-byte 
 hardware sectors (500108 MB)
 Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] Write Protect is off
 Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
 Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] Write cache: enabled, 
 read cache: enabled, doesn't support DPO or FUA
 
 I'm not sure if it will result in corrupt data? But i don't trust it
 anymore.

That looks like a data transmission error.  When you're transferring
massive amount of data, things like that can happen and it won't cause
data corruption.

 You people advise me to not buy the Promise SATA300 TX4 controller and
 this Sweex PU102 (3512) seems to have problems. Not much choices left
 except the really expensive solutions.
 
 Is it really so hard to build a controller without problems?!? :(

It seems so. :-(

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-10-15 Thread Alexander Sabourenkov

MisterE wrote:

Hello,

Alexander, does these problems with the Promise SATA300 TX4 happen to
everyone?



Most probably not, as I think it would have been fixed much faster then.

I was waiting for a) release of 2.6.23, and b) me completing the move to 
another flat

to retest all the latest developments in mainline and libata-dev.

With a) done and b) almost done, I'll retest and report any issues quite 
soon.


Besides, there is a report of TX4 and 2.6.23 not showing problems that 
were there with 2.6.22,
( see Bug is fixed in 2.6.23.1: sata_promise: port is slow to respond, 
reset failed thread).




The only alternatives are
using soft-raid products as normal controllers. Does anyone have experiences
with the following products?
* Highpoint RocketRAID 1640 (150 MB/s)
* Highpoint RocketRAID 1740 (300 MB/s)
* Adaptec 1210SA



For any kind of non-hobby task I'd skip trying to build a disk array to 
buying a SATA-SCSI/SATA-iSCSI box.
While I had many mind-boggling issues with various combinations of SATA 
HDDs, onboard and standalone

controllers, Promise and Infortrend disk arrays worked quite reliably.

--

./lxnt
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re[2]: Sata Sil3512 bug?

2007-10-14 Thread MisterE
Hello,

Alexander, does these problems with the Promise SATA300 TX4 happen to
everyone?

The only alternatives are
using soft-raid products as normal controllers. Does anyone have experiences
with the following products?
* Highpoint RocketRAID 1640 (150 MB/s)
* Highpoint RocketRAID 1740 (300 MB/s)
* Adaptec 1210SA


Wednesday, October 3, 2007, 10:31:17 AM, you wrote:

 Mikael Pettersson wrote:


 I'm thinking of replacing both 3512 controllers with a Promise SATA300
 TX4. Do you know if there are problems with this device?
 
 (please don't top-post)
 
 There are no known data-corruption issues with Promise SATA cards.
 However, some of them, especially the 2nd generation SATA300 TX4,
 are known to trigger intermittent error interrupts (that are dealt
 with but may cause a speed reduction) in some systems. We're still
 scratching our heads on that issue.
 

 But see this thread:

 http://marc.info/?l=linux-idem=119122463403033w=2
 http://www.spinics.net/lists/linux-ide/msg14868.html

 Personally I would not recommend Promise SATA300 TX4 at the moment.




-- 
Best regards,
 MisterEmailto:[EMAIL PROTECTED]


-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re[2]: Sata Sil3512 bug?

2007-10-13 Thread MisterE
Hello Tejun,

unfortunately. I today tried to build the array (4x500GB, so it is still 
building) and i got again a error:

Oct 13 12:17:56 fileserver kernel: raid5: device sdc1 operational as raid disk 2
Oct 13 12:17:56 fileserver kernel: raid5: device sdb1 operational as raid disk 1
Oct 13 12:17:56 fileserver kernel: raid5: device sda1 operational as raid disk 0
Oct 13 12:17:56 fileserver kernel: raid5: allocated 4204kB for md0
Oct 13 12:17:56 fileserver kernel: raid5: raid level 5 set md0 active with 3 
out of 4 devices, algorithm 2
Oct 13 12:17:56 fileserver kernel: RAID5 conf printout:
Oct 13 12:17:56 fileserver kernel:  --- rd:4 wd:3
Oct 13 12:17:56 fileserver kernel:  disk 0, o:1, dev:sda1
Oct 13 12:17:56 fileserver kernel:  disk 1, o:1, dev:sdb1
Oct 13 12:17:56 fileserver kernel:  disk 2, o:1, dev:sdc1
Oct 13 12:18:24 fileserver kernel: RAID5 conf printout:
Oct 13 12:18:24 fileserver kernel:  --- rd:4 wd:3
Oct 13 12:18:24 fileserver kernel:  disk 0, o:1, dev:sda1
Oct 13 12:18:24 fileserver kernel:  disk 1, o:1, dev:sdb1
Oct 13 12:18:24 fileserver kernel:  disk 2, o:1, dev:sdc1
Oct 13 12:18:24 fileserver kernel:  disk 3, o:1, dev:sdd1
Oct 13 12:18:24 fileserver kernel: md: recovery of RAID array md0
Oct 13 12:18:24 fileserver kernel: md: minimum _guaranteed_  speed: 1000 
KB/sec/disk.
Oct 13 12:18:24 fileserver kernel: md: using maximum available idle IO 
bandwidth (but not more than 20 KB/sec) for recovery.
Oct 13 12:18:24 fileserver kernel: md: using 128k window, over a total of 
488383936 blocks.
Oct 13 13:01:26 fileserver kernel: ata4.00: exception Emask 0x0 SAct 0x0 SErr 
0x240 action 0x0
Oct 13 13:01:26 fileserver kernel: ata4.00: (BMDMA2 stat 0x650001)
Oct 13 13:01:26 fileserver kernel: ata4.00: cmd 
ca/00:f8:47:e1:5e/00:00:00:00:00/e4 tag 0 cdb 0x0 data 126976 out
Oct 13 13:01:26 fileserver kernel:  res 
51/04:98:a7:e1:5e/00:00:00:00:00/e4 Emask 0x1 (device error)
Oct 13 13:01:26 fileserver kernel: ata4.00: configured for UDMA/100
Oct 13 13:01:26 fileserver kernel: ata4: EH complete
Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] 976773168 512-byte 
hardware sectors (500108 MB)
Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] Write Protect is off
Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
Oct 13 13:01:26 fileserver kernel: sd 3:0:0:0: [sdd] Write cache: enabled, read 
cache: enabled, doesn't support DPO or FUA


I'm not sure if it will result in corrupt data? But i don't trust it
anymore.
You people advise me to not buy the Promise SATA300 TX4 controller and
this Sweex PU102 (3512) seems to have problems. Not much choices left
except the really expensive solutions.

Is it really so hard to build a controller without problems?!? :(


Thursday, October 4, 2007, 3:27:18 AM, you wrote:

 Hello,

 MisterE wrote:
 I build another setup with almost the same hardware.
 This motherboard had already the latest bios.
 I notice that the computer does almost never find the hard drive
 although the controller is found every time (with lspci).

 What do you mean by almost never?  Does it find the harddisk
 sometimes?  Also, please post kernel boot log after disk detection
 failure.  lspci result just indicates only that the PCI device is present.

 So i get no
 drive (sda) assigned. I don't always see the bios screen from the
 controller at startup. And in the past it showed the hard drive.
 So i could not experiment with this motherboard.

 Can you re-seat the controller or move it to another slot and see
 whether things change?

 After that i installed Windows XP and used the orginal (sweex)
 drivers with the first motherboard. This also makes the data corrupt.
 So it seems not to be an linux problem. So there is something wrong with
 the motherboard or the 3512 controller.
 
 After that i plugged both hard drives (ide with windows and sata disk)
 to the Asus board. No data corruption. So the hard disks are'nt the
 problem either.

 Hmmm... It's relieving to know that the problem isn't caused by sata_sil
 but I don't have much idea than it seems like something goes wrong on
 the PCI bus.  :-(

 I'm thinking of replacing both 3512 controllers with a Promise SATA300
 TX4. Do you know if there are problems with this device?

 I see occasional bug reports on sata_promise but AFAIK there haven't
 been any data corruption report.  Mikael knows much better about promise
 controllers.

 Thanks.




-- 
Best regards,
 MisterEmailto:[EMAIL PROTECTED]

kern.log
Description: Binary data


Re: Re[2]: Sata Sil3512 bug?

2007-10-03 Thread Mikael Pettersson
On Tue, 2 Oct 2007 21:20:23 +0200, MisterE wrote:
 I build another setup with almost the same hardware.
 This motherboard had already the latest bios.
 I notice that the computer does almost never find the hard drive
 although the controller is found every time (with lspci). So i get no
 drive (sda) assigned. I don't always see the bios screen from the
 controller at startup. And in the past it showed the hard drive.
 So i could not experiment with this motherboard.
 
 After that i installed Windows XP and used the orginal (sweex)
 drivers with the first motherboard. This also makes the data corrupt.
 So it seems not to be an linux problem. So there is something wrong with
 the motherboard or the 3512 controller.
 
 After that i plugged both hard drives (ide with windows and sata disk)
 to the Asus board. No data corruption. So the hard disks are'nt the
 problem either.
 
 I'm thinking of replacing both 3512 controllers with a Promise SATA300
 TX4. Do you know if there are problems with this device?

(please don't top-post)

There are no known data-corruption issues with Promise SATA cards.
However, some of them, especially the 2nd generation SATA300 TX4,
are known to trigger intermittent error interrupts (that are dealt
with but may cause a speed reduction) in some systems. We're still
scratching our heads on that issue.

/Mikael

 Friday, September 28, 2007, 6:55:47 PM, you wrote:
 
  Alan Cox wrote:
  sda1 are corrupted (2 to 4 blocks missing). Copying that data back to
  Windows and it give the same results in Quickpar. So reading does not
  have problems. The data written to hda1 is correct.
  
  We've got a whole pile of reports like this with the 3512 and almost
  always Nvidia chipset, plus reports of BIOS updates fixing it. That you
  see something similar on intel boards is a bit worrying.
 
  Multiple sil3112/3512 + nvidia chipset problem doesn't usually involve
  device errors or timeouts.  It usually corrupts data silently.  And,
  yeah, data corruption on intel board is really disturbing.
 
  MisterE, do you have any processor powersaving mechanism enabled?  If
  so, can you disable all and see whether that changes anything?
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-10-03 Thread Alexander Sabourenkov

Mikael Pettersson wrote:



I'm thinking of replacing both 3512 controllers with a Promise SATA300
TX4. Do you know if there are problems with this device?


(please don't top-post)

There are no known data-corruption issues with Promise SATA cards.
However, some of them, especially the 2nd generation SATA300 TX4,
are known to trigger intermittent error interrupts (that are dealt
with but may cause a speed reduction) in some systems. We're still
scratching our heads on that issue.



But see this thread:

http://marc.info/?l=linux-idem=119122463403033w=2
http://www.spinics.net/lists/linux-ide/msg14868.html

Personally I would not recommend Promise SATA300 TX4 at the moment.

--

./lxnt


-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re[2]: Sata Sil3512 bug?

2007-10-03 Thread MisterE
Hello Alexander,

Wednesday, October 3, 2007, 10:31:17 AM, you wrote:

 Mikael Pettersson wrote:


 I'm thinking of replacing both 3512 controllers with a Promise SATA300
 TX4. Do you know if there are problems with this device?
 
 (please don't top-post)
 
 There are no known data-corruption issues with Promise SATA cards.
 However, some of them, especially the 2nd generation SATA300 TX4,
 are known to trigger intermittent error interrupts (that are dealt
 with but may cause a speed reduction) in some systems. We're still
 scratching our heads on that issue.
 

 But see this thread:

 http://marc.info/?l=linux-idem=119122463403033w=2
 http://www.spinics.net/lists/linux-ide/msg14868.html

 Personally I would not recommend Promise SATA300 TX4 at the moment.


That is not hopefull. Highpoint does not have sata controllers (Except
softraid controllers). Other (real raid controllers) brands are too
expensive or/and does not have a PCI interface.
Maybe i should keep those 3512 cards? How are the user experiences
with these controllers (except nvidia boards)? Because i don't really
trust the intel boards so using the Asus would be an option.


-- 
Best regards,
 MisterEmailto:[EMAIL PROTECTED]


-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-10-03 Thread Alan Cox
 That is not hopefull. Highpoint does not have sata controllers (Except
 softraid controllers). Other (real raid controllers) brands are too

There are pretty much no real RAID controllers in the ATA world except
the very high end pricy ones.

-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-10-03 Thread Richard Scobie
 There are pretty much no real RAID controllers in the ATA world 
except the very high end pricy ones.


Can anyone comment on the reliability or otherwise of Marvell 885X6081 
controllers?


Supermicro do a reasonably priced non-RAID 8 drive SATA card using it:

http://www.supermicro.com/products/accessories/addon/AoC-SAT2-MV8.cfm

Regards,

Richard
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-10-03 Thread Tejun Heo
Hello,

MisterE wrote:
 I build another setup with almost the same hardware.
 This motherboard had already the latest bios.
 I notice that the computer does almost never find the hard drive
 although the controller is found every time (with lspci).

What do you mean by almost never?  Does it find the harddisk
sometimes?  Also, please post kernel boot log after disk detection
failure.  lspci result just indicates only that the PCI device is present.

 So i get no
 drive (sda) assigned. I don't always see the bios screen from the
 controller at startup. And in the past it showed the hard drive.
 So i could not experiment with this motherboard.

Can you re-seat the controller or move it to another slot and see
whether things change?

 After that i installed Windows XP and used the orginal (sweex)
 drivers with the first motherboard. This also makes the data corrupt.
 So it seems not to be an linux problem. So there is something wrong with
 the motherboard or the 3512 controller.
 
 After that i plugged both hard drives (ide with windows and sata disk)
 to the Asus board. No data corruption. So the hard disks are'nt the
 problem either.

Hmmm... It's relieving to know that the problem isn't caused by sata_sil
but I don't have much idea than it seems like something goes wrong on
the PCI bus.  :-(

 I'm thinking of replacing both 3512 controllers with a Promise SATA300
 TX4. Do you know if there are problems with this device?

I see occasional bug reports on sata_promise but AFAIK there haven't
been any data corruption report.  Mikael knows much better about promise
controllers.

Thanks.

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re[2]: Sata Sil3512 bug?

2007-10-02 Thread MisterE
Hello Tejun,

I build another setup with almost the same hardware.
This motherboard had already the latest bios.
I notice that the computer does almost never find the hard drive
although the controller is found every time (with lspci). So i get no
drive (sda) assigned. I don't always see the bios screen from the
controller at startup. And in the past it showed the hard drive.
So i could not experiment with this motherboard.

After that i installed Windows XP and used the orginal (sweex)
drivers with the first motherboard. This also makes the data corrupt.
So it seems not to be an linux problem. So there is something wrong with
the motherboard or the 3512 controller.

After that i plugged both hard drives (ide with windows and sata disk)
to the Asus board. No data corruption. So the hard disks are'nt the
problem either.

I'm thinking of replacing both 3512 controllers with a Promise SATA300
TX4. Do you know if there are problems with this device?


Friday, September 28, 2007, 6:55:47 PM, you wrote:

 Alan Cox wrote:
 sda1 are corrupted (2 to 4 blocks missing). Copying that data back to
 Windows and it give the same results in Quickpar. So reading does not
 have problems. The data written to hda1 is correct.
 
 We've got a whole pile of reports like this with the 3512 and almost
 always Nvidia chipset, plus reports of BIOS updates fixing it. That you
 see something similar on intel boards is a bit worrying.

 Multiple sil3112/3512 + nvidia chipset problem doesn't usually involve
 device errors or timeouts.  It usually corrupts data silently.  And,
 yeah, data corruption on intel board is really disturbing.

 MisterE, do you have any processor powersaving mechanism enabled?  If
 so, can you disable all and see whether that changes anything?




-- 
Best regards,
 MisterEmailto:[EMAIL PROTECTED]


-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re[2]: Sata Sil3512 bug?

2007-09-28 Thread MisterE
Hello Tejun,

I've tried with 2 sata cables. Cables which don't give problems with
the Asus motherboard.
I also now use a new decent power supply (corsair vx450) for the
intel. Before it, i used an Fortron supply. But it had no sata
connectors so i used converter adapters. But replacing the supply
did'nt solve it.

btw. If i copy/delete from the Western Digital to my Windows pc (via winscp
or samba) i don't get error messages. Only when i write to it.
The data itself seems to get corrupted; tested with par2. uploaded
with winscp (windows) to the western digital (sda1). Uploaded the same data
to hda1. Tested the data on both drives with par2verify. Most files on
sda1 are corrupted (2 to 4 blocks missing). Copying that data back to
Windows and it give the same results in Quickpar. So reading does not
have problems. The data written to hda1 is correct.


I now inserted a videocard, instead of using the onboard card. And at
first hand it looked solved, but some (much less) errors still happens.
I did not change anything in the bios. The card:
02:00.0 VGA compatible controller: nVidia Corporation NV10 [GeForce 256 SDR] 
(rev 10)
How can a videocard influence this?

I wanted to build a fileserver, using the intel motherboard. Bacause
of the fact that it has a onboard videocard. I can use the asus if i
have to. But if it is a bug it can better be solved :)

I have some other similar motherboards like the intel laying around.
With and without onboard videocard. I also have a second D815EEA
(probably still has an older BIOS, i have updated the current one because
when i inserted my second 128MB memory it did not gave any picture on
the onboard video card). So i can do some experiments if i have to.

Hope, this helps...

Friday, September 28, 2007, 2:25:05 PM, you wrote:

 Hello,

 MisterE wrote:
 I've tried the controller in another motherboard, the ASUS CUSL2 (with 
 similar specs)
 and i don't have any problems. Can you help? I've included some logs
 with may be of use.

 Did you use the same cable on both machines?  Also, does the problem go
 away if you power the hard drive from the power supply of the other machine?




-- 
Best regards,
 MisterEmailto:[EMAIL PROTECTED]


-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-09-28 Thread Tejun Heo
Alan Cox wrote:
 sda1 are corrupted (2 to 4 blocks missing). Copying that data back to
 Windows and it give the same results in Quickpar. So reading does not
 have problems. The data written to hda1 is correct.
 
 We've got a whole pile of reports like this with the 3512 and almost
 always Nvidia chipset, plus reports of BIOS updates fixing it. That you
 see something similar on intel boards is a bit worrying.

Multiple sil3112/3512 + nvidia chipset problem doesn't usually involve
device errors or timeouts.  It usually corrupts data silently.  And,
yeah, data corruption on intel board is really disturbing.

MisterE, do you have any processor powersaving mechanism enabled?  If
so, can you disable all and see whether that changes anything?

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-09-28 Thread Alan Cox
 sda1 are corrupted (2 to 4 blocks missing). Copying that data back to
 Windows and it give the same results in Quickpar. So reading does not
 have problems. The data written to hda1 is correct.

We've got a whole pile of reports like this with the 3512 and almost
always Nvidia chipset, plus reports of BIOS updates fixing it. That you
see something similar on intel boards is a bit worrying.

Alan
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Sata Sil3512 bug?

2007-09-28 Thread Tejun Heo
Hello,

MisterE wrote:
 I've tried the controller in another motherboard, the ASUS CUSL2 (with 
 similar specs)
 and i don't have any problems. Can you help? I've included some logs
 with may be of use.

Did you use the same cable on both machines?  Also, does the problem go
away if you power the hard drive from the power supply of the other machine?

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Sata Sil3512 bug?

2007-09-27 Thread MisterE
Hello,

First off, i'm quite new to linux. I don't know the official way's to
report bugs. I'm not even sure that the bug is sata driver related. I
hope you can do some suggestions.

I recently bought 2 Sweex Sata controllers (without raid). This device
contains the Sil3512 chip.
I connected it to my D815EEA motherboard with a samsung hard drive.
When i mounted it and connected to it with WinScp or samba i got
hangs; a couple of seconds the filetransfer stopped.
The logs in var and the screen are spooled with errors like in samsung 
error.txt.

I now have bought some Western Digitals drives. I get similar problems
(wd error.txt), but nog hangs.

I've tried the controller in another motherboard, the ASUS CUSL2 (with similar 
specs)
and i don't have any problems. Can you help? I've included some logs
with may be of use.

btw: i use Debian unstable. I use the same hd with the OS (IDE drive)
on both systems, so we can exclude a faulty OS.
  

-- 
Best regards,
 MisterE  mailto:[EMAIL PROTECTED]Sep 24 17:42:43 fileserver kernel: NTFS driver 2.1.28 [Flags: R/W MODULE].
Sep 24 17:42:43 fileserver kernel: NTFS volume version 3.1.
Sep 24 17:44:05 fileserver kernel: NTFS volume version 3.1.
Sep 24 17:44:36 fileserver kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 
0x0 action 0x0
Sep 24 17:44:36 fileserver kernel: ata2.00: (BMDMA2 stat 0x6c0009)
Sep 24 17:44:36 fileserver kernel: ata2.00: cmd 
c8/00:0c:a8:01:bd/00:00:00:00:00/e1 tag 0 cdb 0x0 data 6144 in
Sep 24 17:44:36 fileserver kernel: res 51/04:00:b3:01:bd/00:00:00:00:00/e1 
Emask 0x1 (device error)
Sep 24 17:44:36 fileserver kernel: ata2.00: configured for UDMA/100
Sep 24 17:44:36 fileserver kernel: ata2: EH complete
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] 488397168 512-byte 
hardware sectors (250059 MB)
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] Write Protect is off
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] Mode Sense: 00 3a 00 00
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] Write cache: enabled, read 
cache: enabled, doesn't support DPO or FUA
Sep 24 17:44:36 fileserver kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 
0x0 action 0x0
Sep 24 17:44:36 fileserver kernel: ata2.00: (BMDMA2 stat 0x6d0009)
Sep 24 17:44:36 fileserver kernel: ata2.00: cmd 
c8/00:07:38:19:bd/00:00:00:00:00/e1 tag 0 cdb 0x0 data 3584 in
Sep 24 17:44:36 fileserver kernel: res 51/04:00:3e:19:bd/00:00:00:00:00/e1 
Emask 0x1 (device error)
Sep 24 17:44:36 fileserver kernel: ata2.00: configured for UDMA/100
Sep 24 17:44:36 fileserver kernel: ata2: EH complete
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] 488397168 512-byte 
hardware sectors (250059 MB)
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] Write Protect is off
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] Mode Sense: 00 3a 00 00
Sep 24 17:44:36 fileserver kernel: sd 1:0:0:0: [sda] Write cache: enabled, read 
cache: enabled, doesn't support DPO or FUA
Sep 24 17:45:06 fileserver kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 
0x0 action 0x2 frozen
Sep 24 17:45:06 fileserver kernel: ata2.00: cmd 
c8/00:80:3f:21:bd/00:00:00:00:00/e1 tag 0 cdb 0x0 data 65536 in
Sep 24 17:45:06 fileserver kernel: res 40/00:00:3e:19:bd/00:00:00:00:00/e1 
Emask 0x4 (timeout)
Sep 24 17:45:07 fileserver kernel: ata2: soft resetting port
Sep 24 17:45:07 fileserver kernel: ata2: SATA link up 1.5 Gbps (SStatus 113 
SControl 310)
Sep 24 17:45:07 fileserver kernel: ata2.00: configured for UDMA/100
Sep 24 17:45:07 fileserver kernel: ata2: EH complete
Sep 24 17:45:07 fileserver kernel: sd 1:0:0:0: [sda] 488397168 512-byte 
hardware sectors (250059 MB)
Sep 24 17:45:07 fileserver kernel: sd 1:0:0:0: [sda] Write Protect is off
Sep 24 17:45:07 fileserver kernel: sd 1:0:0:0: [sda] Mode Sense: 00 3a 00 00
Sep 24 17:45:07 fileserver kernel: sd 1:0:0:0: [sda] Write cache: enabled, read 
cache: enabled, doesn't support DPO or FUA
Sep 24 17:45:37 fileserver kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 
0x0 action 0x2 frozen
Sep 24 17:45:37 fileserver kernel: ata2.00: cmd 
c8/00:80:3f:24:bd/00:00:00:00:00/e1 tag 0 cdb 0x0 data 65536 in
Sep 24 17:45:37 fileserver kernel: res 40/00:00:3e:19:bd/00:00:00:00:00/e1 
Emask 0x4 (timeout)
Sep 24 17:45:37 fileserver kernel: ata2: soft resetting port
Sep 24 17:45:37 fileserver kernel: ata2: SATA link up 1.5 Gbps (SStatus 113 
SControl 310)
Sep 24 17:45:37 fileserver kernel: ata2.00: configured for UDMA/100
Sep 24 17:45:37 fileserver kernel: ata2: EH complete
Sep 24 17:45:37 fileserver kernel: sd 1:0:0:0: [sda] 488397168 512-byte 
hardware sectors (250059 MB)
Sep 24 17:45:37 fileserver kernel: sd 1:0:0:0: [sda] Write Protect is off
Sep 24 17:45:37 fileserver kernel: sd 1:0:0:0: [sda] Mode Sense: 00 3a 00 00
Sep 24 17:45:37 fileserver kernel: sd 1:0:0:0: [sda] Write cache: enabled, read 
cache: enabled, doesn't support DPO or FUASep 25 14:09:57 fileserver kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 
0x240 action 0x0
Sep 25