Re: sata_sil data corruption

2005-02-21 Thread Doug McLain
Jeff Garzik wrote:
Been there done that, all on this list actually.  Bios upgrade, bios 
reset, new cable, different drive, correct functionality in windows, and 
now the drive and cable ahave found their home in my other PC, a kt600 
board using the sata_via driver, where it works flawlessly.  I'm not 
even interested in using the sata interface on this machine anymore, but 
I still try to make an effort whenever possible to contribute byt 
reporting bugs.  So many people with so many different hardware 
combinations, sometimes it takes the right combination of 
hardware/software to reveal a bug.  That being said, if you want to 
investigate further, I can provide any information you want. If you want 
to ignore it, thats fine too.  Evidence strongly leans towards a driver 
bug though.

I also noticed that I get oops now sometimes even with no drive 
attatched at all

What is a blacklist entry?  A combination of drive/controller that is 
determined to be incompatable together? I would assume for a drive to 
make it to a blacklists it would have to be incompatable regardless of 
OS right?  The drive in question is a WD2000JD

Doug
In this case, the bug _reports_ are hard to find.
Each case with sata_sil is either solved with a BIOS update, a
blacklist entry, or new cables.  Just read through bugzilla.kernel.org.
Jeff

-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
http://nostar.net/
-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_sil data corruption

2005-02-21 Thread Prakash Punnoor
Jeff Garzik schrieb:
On Mon, Feb 21, 2005 at 11:25:55AM +, Doug McLain wrote:
Jeff Garzik wrote:
Doug McLain wrote:

The sata_sil driver is without a doubt, totally hosed.  I, along with

"without a doubt" being defined, of course, as "it works for a lot of
people."
  Jeff


Thats like saying "turn up the radio" when your car makes a funny noise,
or "if a tree falls in the woods and nobody is there to hear it, does it
make a sound?"
It's tempting and comforting to pick the good ones as an example, and
some bugs are hard enough to find, let alone fix.  In the end though, if
one is broke, it's still broke, isn't it?

In this case, the bug _reports_ are hard to find.
Each case with sata_sil is either solved with a BIOS update, a
blacklist entry, or new cables.  Just read through bugzilla.kernel.org.
I personally think the driver is OK (works for me like a charm with my 
Samsung
drive), but as I reported a few times, SiI will corrupt data if ext-p2p
discard time setting in bios is set to too low values.
So, Jeff, don't you think adding a quirk to the kernel would prevent such
reports. (I am damn sure, this reports are cause of the setting I am talking
about.) Perhaps you could ask SiI how to do this in the kernel...
--
Prakash Punnoor
formerly known as Prakash K. Cheemplavam


signature.asc
Description: OpenPGP digital signature


Re: sata_sil data corruption

2005-02-21 Thread Jeff Garzik
On Mon, Feb 21, 2005 at 11:25:55AM +, Doug McLain wrote:
> Jeff Garzik wrote:
> >Doug McLain wrote:
> >
> >>The sata_sil driver is without a doubt, totally hosed.  I, along with 
> >
> >
> >"without a doubt" being defined, of course, as "it works for a lot of 
> >people."
> >
> >Jeff
> >
> >
> >
> >
> Thats like saying "turn up the radio" when your car makes a funny noise, 
> or "if a tree falls in the woods and nobody is there to hear it, does it 
> make a sound?"
> 
> It's tempting and comforting to pick the good ones as an example, and 
> some bugs are hard enough to find, let alone fix.  In the end though, if 
> one is broke, it's still broke, isn't it?

In this case, the bug _reports_ are hard to find.

Each case with sata_sil is either solved with a BIOS update, a
blacklist entry, or new cables.  Just read through bugzilla.kernel.org.

Jeff



-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_sil data corruption

2005-02-21 Thread Doug McLain
Jeff Garzik wrote:
Doug McLain wrote:
The sata_sil driver is without a doubt, totally hosed.  I, along with 

"without a doubt" being defined, of course, as "it works for a lot of 
people."

Jeff


Thats like saying "turn up the radio" when your car makes a funny noise, 
or "if a tree falls in the woods and nobody is there to hear it, does it 
make a sound?"

It's tempting and comforting to pick the good ones as an example, and 
some bugs are hard enough to find, let alone fix.  In the end though, if 
one is broke, it's still broke, isn't it?

--
http://nostar.net/
-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_sil data corruption

2005-02-21 Thread Jeff Garzik
Doug McLain wrote:
The sata_sil driver is without a doubt, totally hosed.  I, along with 
"without a doubt" being defined, of course, as "it works for a lot of 
people."

Jeff

-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_sil data corruption

2005-02-21 Thread Doug McLain
The sata_sil driver is without a doubt, totally hosed.  I, along with 
some others I've found on this list and google, are also getting kernel 
oops and hard freezes while loading this driver, with confirmed good 
hardware.

PALFFY Daniel wrote:
On Sat, 19 Feb 2005, Jeff Garzik wrote:

PALFFY Daniel wrote:
Hi,
I'm trying to set up a machine with a si3112a controller (lspci: 1095:3112
(rev 01) Subsystem: 1095:6112, cheap PCI card) and a ST3200822AS Rev:
3.01 disk and I see continuous silent data corruption while reading the
disk. Writing seems to be ok. I have 2.6.10-ac1 built with conservative
options (UP, no PREEMPT, 8k stack, no regparm). After seeing problems I
tried to blacklist my drive to do MOD15, but it didn't help.
Finally I did
   unsigned long long i = 0;
   while (write(1, &i, sizeof(i)) != -1) i++;
to the only primary partition while running sata_sil. Reading it back with
   unsigned long long i=0, j;
   while (read(0, &j, sizeof(i)) == sizeof(i)) {
   if (j != i) fprintf(stderr, "diff at %llx: read: %llx\n", i, j);
   i++;
   }
gives similar results, but always different:
diff at 5efff: read: 5ef24
diff at 7: read: 7ff08
diff at 8: read: 8ff51
diff at 97fff: read: 97f00
diff at a: read: aff00
diff at b: read: bffac
diff at d: read: dff00
diff at efffe: read: eff00
diff at e: read: eff00
diff at fffbf: read: f
and so on.
Reading back the same data with the ide siimage driver worked for at least
500MB without corrupted data, but dma doesn't work with that driver, this
is logged on about the first read attempt:
hde: dma_intr: bad DMA status (dma_stat=76)
hde: dma_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hde: DMA disabled
ide2: reset phy, status=0x0113, siimage_reset
ide2: reset: success
The machine is an old Compaq Prosignia 200, with a p2 300 and
:00:00.0 Host bridge: Intel Corp. 440FX - 82441FX PMC [Natoma] (rev 02)
chipset. Relevant parts from dmesg:
sata_sil:
libata version 1.10 loaded.
sata_sil version 0.8
ata1: SATA max UDMA/100 cmd 0xC886A080 ctl 0xC886A08A bmdma 0xC886A000 irq 10
ata2: SATA max UDMA/100 cmd 0xC886A0C0 ctl 0xC886A0CA bmdma 0xC886A008 irq 10
ata1: dev 0 cfg 49:2f00 82:346b 83:7d01 84:4003 85:3469 86:3c01 87:4003 88:207f
ata1: dev 0 ATA, max UDMA/133, 390721968 sectors: lba48
ata1(0): applying Seagate errata fix
ata1: dev 0 configured for UDMA/100
scsi1 : sata_sil
ata2: no device found (phy stat )
scsi2 : sata_sil
 Vendor: ATA   Model: ST3200822AS   Rev: 3.01
 Type:   Direct-Access  ANSI SCSI revision: 05
SCSI device sdb: 390721968 512-byte hdwr sectors (200050 MB)
SCSI device sdb: drive cache: write back
siimage:
SiI3112 Serial ATA: IDE controller at PCI slot :00:0a.0
SiI3112 Serial ATA: chipset revision 1
SiI3112 Serial ATA: 100% native mode on irq 10
   ide2: MMIO-DMA , BIOS settings: hde:DMA, hdf:DMA
   ide3: MMIO-DMA , BIOS settings: hdg:pio, hdh:pio
Probing IDE interface ide2...
hde: ST3200822AS, ATA DISK drive
hde: applying pessimistic Seagate errata fix
ide2 at 0xc886a080-0xc886a087,0xc886a08a on irq 10
hde: max request size: 7KiB
hde: 390721968 sectors (200049 MB) w/8192KiB Cache, CHS=24321/255/63
hde: cache flushes supported
hde:<3>hde: dma_intr: bad DMA status (dma_stat=76)
hde: dma_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hde1
Probing IDE interface ide3...
hdg: no response (status = 0xfe)
hde: dma_intr: bad DMA status (dma_stat=76)
hde: dma_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown

Hi,

Don't use --two-- drivers for the same hardware.
Can you re-test with siimage disabled?

Of course it was disabled while testing sata_sil and vice-versa. I've just
tried that driver to test if the hardware was faulty. Since then, I've
tested the same disk/controller combo in a BX based PentiumII system, and
it worked perfectly fine, but in the Compaq machine it still failed with
2.6.10-ac10. So I think it might be some low-level hardware
incompatibility...
--
Daniel
...and Linux for all.
-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
http://nostar.net/
-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_sil data corruption

2005-02-21 Thread PALFFY Daniel
On Sat, 19 Feb 2005, Jeff Garzik wrote:

> PALFFY Daniel wrote:
> > Hi,
> >
> > I'm trying to set up a machine with a si3112a controller (lspci: 1095:3112
> > (rev 01) Subsystem: 1095:6112, cheap PCI card) and a ST3200822AS Rev:
> > 3.01 disk and I see continuous silent data corruption while reading the
> > disk. Writing seems to be ok. I have 2.6.10-ac1 built with conservative
> > options (UP, no PREEMPT, 8k stack, no regparm). After seeing problems I
> > tried to blacklist my drive to do MOD15, but it didn't help.
> >
> > Finally I did
> >
> > unsigned long long i = 0;
> > while (write(1, &i, sizeof(i)) != -1) i++;
> >
> > to the only primary partition while running sata_sil. Reading it back with
> >
> > unsigned long long i=0, j;
> > while (read(0, &j, sizeof(i)) == sizeof(i)) {
> > if (j != i) fprintf(stderr, "diff at %llx: read: %llx\n", i, j);
> > i++;
> > }
> >
> > gives similar results, but always different:
> > diff at 5efff: read: 5ef24
> > diff at 7: read: 7ff08
> > diff at 8: read: 8ff51
> > diff at 97fff: read: 97f00
> > diff at a: read: aff00
> > diff at b: read: bffac
> > diff at d: read: dff00
> > diff at efffe: read: eff00
> > diff at e: read: eff00
> > diff at fffbf: read: f
> > and so on.
> >
> > Reading back the same data with the ide siimage driver worked for at least
> > 500MB without corrupted data, but dma doesn't work with that driver, this
> > is logged on about the first read attempt:
> >
> > hde: dma_intr: bad DMA status (dma_stat=76)
> > hde: dma_intr: status=0x50 { DriveReady SeekComplete }
> >
> > ide: failed opcode was: unknown
> > hde: DMA disabled
> > ide2: reset phy, status=0x0113, siimage_reset
> > ide2: reset: success
> >
> > The machine is an old Compaq Prosignia 200, with a p2 300 and
> > :00:00.0 Host bridge: Intel Corp. 440FX - 82441FX PMC [Natoma] (rev 02)
> > chipset. Relevant parts from dmesg:
> >
> > sata_sil:
> >
> > libata version 1.10 loaded.
> > sata_sil version 0.8
> > ata1: SATA max UDMA/100 cmd 0xC886A080 ctl 0xC886A08A bmdma 0xC886A000 irq 
> > 10
> > ata2: SATA max UDMA/100 cmd 0xC886A0C0 ctl 0xC886A0CA bmdma 0xC886A008 irq 
> > 10
> > ata1: dev 0 cfg 49:2f00 82:346b 83:7d01 84:4003 85:3469 86:3c01 87:4003 
> > 88:207f
> > ata1: dev 0 ATA, max UDMA/133, 390721968 sectors: lba48
> > ata1(0): applying Seagate errata fix
> > ata1: dev 0 configured for UDMA/100
> > scsi1 : sata_sil
> > ata2: no device found (phy stat )
> > scsi2 : sata_sil
> >   Vendor: ATA   Model: ST3200822AS   Rev: 3.01
> >   Type:   Direct-Access  ANSI SCSI revision: 05
> > SCSI device sdb: 390721968 512-byte hdwr sectors (200050 MB)
> > SCSI device sdb: drive cache: write back
> >
> > siimage:
> >
> > SiI3112 Serial ATA: IDE controller at PCI slot :00:0a.0
> > SiI3112 Serial ATA: chipset revision 1
> > SiI3112 Serial ATA: 100% native mode on irq 10
> > ide2: MMIO-DMA , BIOS settings: hde:DMA, hdf:DMA
> > ide3: MMIO-DMA , BIOS settings: hdg:pio, hdh:pio
> > Probing IDE interface ide2...
> > hde: ST3200822AS, ATA DISK drive
> > hde: applying pessimistic Seagate errata fix
> > ide2 at 0xc886a080-0xc886a087,0xc886a08a on irq 10
> > hde: max request size: 7KiB
> > hde: 390721968 sectors (200049 MB) w/8192KiB Cache, CHS=24321/255/63
> > hde: cache flushes supported
> >  hde:<3>hde: dma_intr: bad DMA status (dma_stat=76)
> > hde: dma_intr: status=0x50 { DriveReady SeekComplete }
> >
> > ide: failed opcode was: unknown
> >  hde1
> > Probing IDE interface ide3...
> > hdg: no response (status = 0xfe)
> > hde: dma_intr: bad DMA status (dma_stat=76)
> > hde: dma_intr: status=0x50 { DriveReady SeekComplete }
> >
> > ide: failed opcode was: unknown

Hi,

> Don't use --two-- drivers for the same hardware.
>
> Can you re-test with siimage disabled?

Of course it was disabled while testing sata_sil and vice-versa. I've just
tried that driver to test if the hardware was faulty. Since then, I've
tested the same disk/controller combo in a BX based PentiumII system, and
it worked perfectly fine, but in the Compaq machine it still failed with
2.6.10-ac10. So I think it might be some low-level hardware
incompatibility...

--
Daniel
...and Linux for all.

-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_sil data corruption

2005-02-19 Thread Jeff Garzik
PALFFY Daniel wrote:
Hi,
I'm trying to set up a machine with a si3112a controller (lspci: 1095:3112
(rev 01) Subsystem: 1095:6112, cheap PCI card) and a ST3200822AS Rev:
3.01 disk and I see continuous silent data corruption while reading the
disk. Writing seems to be ok. I have 2.6.10-ac1 built with conservative
options (UP, no PREEMPT, 8k stack, no regparm). After seeing problems I
tried to blacklist my drive to do MOD15, but it didn't help.
Finally I did
unsigned long long i = 0;
while (write(1, &i, sizeof(i)) != -1) i++;
to the only primary partition while running sata_sil. Reading it back with
unsigned long long i=0, j;
while (read(0, &j, sizeof(i)) == sizeof(i)) {
if (j != i) fprintf(stderr, "diff at %llx: read: %llx\n", i, j);
i++;
}
gives similar results, but always different:
diff at 5efff: read: 5ef24
diff at 7: read: 7ff08
diff at 8: read: 8ff51
diff at 97fff: read: 97f00
diff at a: read: aff00
diff at b: read: bffac
diff at d: read: dff00
diff at efffe: read: eff00
diff at e: read: eff00
diff at fffbf: read: f
and so on.
Reading back the same data with the ide siimage driver worked for at least
500MB without corrupted data, but dma doesn't work with that driver, this
is logged on about the first read attempt:
hde: dma_intr: bad DMA status (dma_stat=76)
hde: dma_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
hde: DMA disabled
ide2: reset phy, status=0x0113, siimage_reset
ide2: reset: success
The machine is an old Compaq Prosignia 200, with a p2 300 and
:00:00.0 Host bridge: Intel Corp. 440FX - 82441FX PMC [Natoma] (rev 02)
chipset. Relevant parts from dmesg:
sata_sil:
libata version 1.10 loaded.
sata_sil version 0.8
ata1: SATA max UDMA/100 cmd 0xC886A080 ctl 0xC886A08A bmdma 0xC886A000 irq 10
ata2: SATA max UDMA/100 cmd 0xC886A0C0 ctl 0xC886A0CA bmdma 0xC886A008 irq 10
ata1: dev 0 cfg 49:2f00 82:346b 83:7d01 84:4003 85:3469 86:3c01 87:4003 88:207f
ata1: dev 0 ATA, max UDMA/133, 390721968 sectors: lba48
ata1(0): applying Seagate errata fix
ata1: dev 0 configured for UDMA/100
scsi1 : sata_sil
ata2: no device found (phy stat )
scsi2 : sata_sil
  Vendor: ATA   Model: ST3200822AS   Rev: 3.01
  Type:   Direct-Access  ANSI SCSI revision: 05
SCSI device sdb: 390721968 512-byte hdwr sectors (200050 MB)
SCSI device sdb: drive cache: write back
siimage:
SiI3112 Serial ATA: IDE controller at PCI slot :00:0a.0
SiI3112 Serial ATA: chipset revision 1
SiI3112 Serial ATA: 100% native mode on irq 10
ide2: MMIO-DMA , BIOS settings: hde:DMA, hdf:DMA
ide3: MMIO-DMA , BIOS settings: hdg:pio, hdh:pio
Probing IDE interface ide2...
hde: ST3200822AS, ATA DISK drive
hde: applying pessimistic Seagate errata fix
ide2 at 0xc886a080-0xc886a087,0xc886a08a on irq 10
hde: max request size: 7KiB
hde: 390721968 sectors (200049 MB) w/8192KiB Cache, CHS=24321/255/63
hde: cache flushes supported
 hde:<3>hde: dma_intr: bad DMA status (dma_stat=76)
hde: dma_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
 hde1
Probing IDE interface ide3...
hdg: no response (status = 0xfe)
hde: dma_intr: bad DMA status (dma_stat=76)
hde: dma_intr: status=0x50 { DriveReady SeekComplete }
ide: failed opcode was: unknown
Don't use --two-- drivers for the same hardware.
Can you re-test with siimage disabled?
Jeff
-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html