Re: sata_promise ata exceptions (2.6.20.6)

2007-04-16 Thread Phil Dibowitz
Given that the last one was a hardware issue, I bought a new controller.
Despite my bad luck, given my price-range promise still seemed to be the one
with the most good reports, so I went with that. I was going to go with a
sil, but I couldn't find one..

Anyway, things are MUCH better now... but about once a week, I get:

ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
ata2.00: (port_status 0x1000)
ata2.00: cmd c8/00:80:9a:71:d0/00:00:00:00:00/ea tag 0 cdb 0x0 data 65536 in
 res 40/00:00:06:4f:c2/00:00:00:00:00/00 Emask 0x24 (host bus error)
ata2: soft resetting port
ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
ata2.00: configured for UDMA/133
ata2: EH complete
SCSI device sda: 312581808 512-byte hdwr sectors (160042 MB)
sda: Write Protect is off
sda: Mode Sense: 00 3a 00 00
SCSI device sda: write cache: enabled, read cache: enabled, doesn't support
DPO or FUA


It's the same port_status and Emask/SAct/SErr/action each time... only the
cmd/res and data change (obviously those would change)...

Can anyone tell me what that means?

-- 
Phil Dibowitz [EMAIL PROTECTED]
Open Source software and tech docsInsanity Palace of Metallica
http://www.phildev.net/   http://www.ipom.com/

Never write it in C if you can do it in 'awk';
 Never do it in 'awk' if 'sed' can handle it;
 Never use 'sed' when 'tr' can do the job;
 Never invoke 'tr' when 'cat' is sufficient;
 Avoid using 'cat' whenever possible -- Taylor's Laws of Programming




signature.asc
Description: OpenPGP digital signature


Re: sata_promise ata exceptions (2.6.20.6)

2007-04-16 Thread Tomi Orava

Hi,

 Given that the last one was a hardware issue, I bought a new controller.
 Despite my bad luck, given my price-range promise still seemed to be the
 one
 with the most good reports, so I went with that. I was going to go with a
 sil, but I couldn't find one..

Unfortunately, I don't have a solution/fix for you, but out of curiosity,
what hard-disks you have problems with ? Also what is the exact model
of your SATA-controller ?

 Anyway, things are MUCH better now... but about once a week, I get:

 ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
 ata2.00: (port_status 0x1000)
 ata2.00: cmd c8/00:80:9a:71:d0/00:00:00:00:00/ea tag 0 cdb 0x0 data 65536
 in
  res 40/00:00:06:4f:c2/00:00:00:00:00/00 Emask 0x24 (host bus
 error)
 ata2: soft resetting port
 ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
 ata2.00: configured for UDMA/133
 ata2: EH complete
 SCSI device sda: 312581808 512-byte hdwr sectors (160042 MB)
 sda: Write Protect is off
 sda: Mode Sense: 00 3a 00 00
 SCSI device sda: write cache: enabled, read cache: enabled, doesn't
 support
 DPO or FUA

Regards,
Tomi Orava

-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_promise ata exceptions (2.6.20.6)

2007-04-16 Thread Mikael Pettersson
On Sun, 15 Apr 2007 23:55:31 -0700, Phil Dibowitz wrote:
 Given that the last one was a hardware issue, I bought a new controller.
 Despite my bad luck, given my price-range promise still seemed to be the =
 one
 with the most good reports, so I went with that. I was going to go with a=
 
 sil, but I couldn't find one..
 
 Anyway, things are MUCH better now... but about once a week, I get:
 
 ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
 ata2.00: (port_status 0x1000)
 ata2.00: cmd c8/00:80:9a:71:d0/00:00:00:00:00/ea tag 0 cdb 0x0 data 65536=
  in
  res 40/00:00:06:4f:c2/00:00:00:00:00/00 Emask 0x24 (host bus err=
 or)
 ata2: soft resetting port
 ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
 ata2.00: configured for UDMA/133
 ata2: EH complete
 SCSI device sda: 312581808 512-byte hdwr sectors (160042 MB)
 sda: Write Protect is off
 sda: Mode Sense: 00 3a 00 00
 SCSI device sda: write cache: enabled, read cache: enabled, doesn't suppo=
 rt
 DPO or FUA
 
 
 It's the same port_status and Emask/SAct/SErr/action each time... only th=
 e
 cmd/res and data change (obviously those would change)...
 
 Can anyone tell me what that means?

port_status 0x1000 is host bus timeout, which the manual
defines as the host bus being busy for more than 256 clock cycles
during an ATA I/O transfer.

I have no idea what would cause this error, and I've never
seen it myself.

As long as libata recovers and doesn't downgrade your transfer
speed it shouldn't pose too much of a problem.

/Mikael
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: sata_promise ata exceptions (2.6.20.6)

2007-04-16 Thread Phil Dibowitz
Tomi Orava wrote:
 Hi,
 
 Given that the last one was a hardware issue, I bought a new controller.
 Despite my bad luck, given my price-range promise still seemed to be the
 one
 with the most good reports, so I went with that. I was going to go with a
 sil, but I couldn't find one..
 
 Unfortunately, I don't have a solution/fix for you, but out of curiosity,
 what hard-disks you have problems with ? Also what is the exact model
 of your SATA-controller ?

Whoops, forgot that bit. Sorry!

02:0c.0 Mass storage controller: Promise Technology, Inc. PDC40718 (SATA 300
TX4) (rev 02)
Subsystem: Promise Technology, Inc. PDC40718 (SATA 300 TX4)
Flags: bus master, 66MHz, medium devsel, latency 72, IRQ 17
I/O ports at d480 [size=128]
I/O ports at d000 [size=256]
Memory at feafd000 (32-bit, non-prefetchable) [size=4K]
Memory at fea8 (32-bit, non-prefetchable) [size=128K]
Expansion ROM at 5002 [disabled] [size=32K]
Capabilities: [60] Power Management version 2

My disks are WD WD1600JS 160GB drives (2 of them in linux software RAID 1).

-- 
Phil Dibowitz [EMAIL PROTECTED]
Open Source software and tech docsInsanity Palace of Metallica
http://www.phildev.net/   http://www.ipom.com/

Never write it in C if you can do it in 'awk';
 Never do it in 'awk' if 'sed' can handle it;
 Never use 'sed' when 'tr' can do the job;
 Never invoke 'tr' when 'cat' is sufficient;
 Avoid using 'cat' whenever possible -- Taylor's Laws of Programming




signature.asc
Description: OpenPGP digital signature


Re: [BUG] 2.6.21-rc7 hpt366 driver broken

2007-04-16 Thread Adrian Bunk
[ Cc's added, full bug report was in http://lkml.org/lkml/2007/4/16/18 ]

On Mon, Apr 16, 2007 at 04:38:22AM -0700, Mike Mattie wrote:
 On Sun, 15 Apr 2007 22:48:46 -0700
 Mike Mattie [EMAIL PROTECTED] wrote:
 
  Hello,
  
  I am testing the 2.6.21-rc7 kernel release. The IDE hpt366 driver is
  crashing hanging the boot. I have basically the same config as
  2.6.20.7 which works fine (except for netconsole mentioned in a
  previous mail).
  
  here is the hand-copied info:
  
  * unable to handle paging request , null deref
  * EIP @ init_chipset_hpt366
  
 
  I am running a git-bisect to see if I can resolve it to a commit.
 
 This was identified as the first broken commit:
 
 commit 7b73ee05d0acb926923d43d78b61add776ea4bb1
 Author: Sergei Shtylyov [EMAIL PROTECTED]
 Date:   Wed Feb 7 18:18:16 2007 +0100
 
 hpt366: init code rewrite
 
 Reverting is conflicted so it will be a bit longer before I pin-point any 
 other build-breaks.

Thanks for your report.

Can you use a digital camera for taking a photograph of the crash?

cu
Adrian

--

   Is there not promise of rain? Ling Tan asked suddenly out
of the darkness. There had been need of rain for many days.
   Only a promise, Lao Er said.
   Pearl S. Buck - Dragon Seed

-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen on SATA hd

2007-04-16 Thread Rodrigo Severo

On 4/15/07, Rodrigo Severo [EMAIL PROTECTED] wrote:


I have this cheap SATA addon card with an ULI chipset.

A have a linear raid over 3 disks one PATA, hda and two SATAs, sda and sdb.

When copying data to the raid device I get log messages like the following:

Apr 15 10:54:20 [kernel] ata1.00: exception Emask 0x0 SAct 0x0 SErr
0x0 action 0x2 frozen
Apr 15 10:54:20 [kernel] ata1.00: cmd
35/00:00:df:90:0e/00:04:0b:00:00/e0 tag 0 cdb 0x0 data 524288 out
Apr 15 10:54:20 [kernel]  res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 15 10:54:27 [kernel] ata1: port is slow to respond, please be
patient (Status 0xd0)
Apr 15 10:54:48 [kernel] ata1: port failed to respond (30 secs, Status 0xd0)
Apr 15 10:54:48 [kernel] ata1: soft resetting port
Apr 15 10:54:48 [kernel] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
Apr 15 10:54:48 [kernel] ATA: abnormal status 0xD0 on port 0x9007
- Last output repeated 6 times -
Apr 15 10:54:48 [kernel] ata1.00: configured for UDMA/133
Apr 15 10:54:48 [kernel] ata1: EH complete
Apr 15 10:54:48 [kernel] SCSI device sda: 488397168 512-byte hdwr
sectors (250059 MB)
Apr 15 10:54:48 [kernel] sda: Write Protect is off
Apr 15 10:54:48 [kernel] SCSI device sda: write cache: enabled, read
cache: enabled, doesn't support DPO or FUA

All references I found to this kind of message where related to some
driver issue and not hardware problems. Can this be the case?


In search for a solution for this issue I bought a new HD to replace
the one presenting the problem above.

Now I have a SAMSUNG HD400LJ in place of the previous ST3250820AS. The
problem remains the same.

When I put a Maxtor 6L300S0 to perform the same hole as the Samsung or
the Seagate, I couldn't trigger the exception. I bet this happened
because Maxtor is immune to this issue and the Samsung (which seems to
be affected) was barely used.

I'm really out of ideas where to look for an explanation or fix. I
don't think I'm dealing with a hardware issue.

What should be my next step?


Regards,

Rodrigo Severo
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen on SATA hd

2007-04-16 Thread Rodrigo Severo

On 4/16/07, Rodrigo Severo [EMAIL PROTECTED] wrote:

On 4/15/07, Rodrigo Severo [EMAIL PROTECTED] wrote:

 I have this cheap SATA addon card with an ULI chipset.

 A have a linear raid over 3 disks one PATA, hda and two SATAs, sda and sdb.

 When copying data to the raid device I get log messages like the following:

 Apr 15 10:54:20 [kernel] ata1.00: exception Emask 0x0 SAct 0x0 SErr
 0x0 action 0x2 frozen
 Apr 15 10:54:20 [kernel] ata1.00: cmd
 35/00:00:df:90:0e/00:04:0b:00:00/e0 tag 0 cdb 0x0 data 524288 out
 Apr 15 10:54:20 [kernel]  res
 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
 Apr 15 10:54:27 [kernel] ata1: port is slow to respond, please be
 patient (Status 0xd0)
 Apr 15 10:54:48 [kernel] ata1: port failed to respond (30 secs, Status 0xd0)
 Apr 15 10:54:48 [kernel] ata1: soft resetting port
 Apr 15 10:54:48 [kernel] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 
300)
 Apr 15 10:54:48 [kernel] ATA: abnormal status 0xD0 on port 0x9007
 - Last output repeated 6 times -
 Apr 15 10:54:48 [kernel] ata1.00: configured for UDMA/133
 Apr 15 10:54:48 [kernel] ata1: EH complete
 Apr 15 10:54:48 [kernel] SCSI device sda: 488397168 512-byte hdwr
 sectors (250059 MB)
 Apr 15 10:54:48 [kernel] sda: Write Protect is off
 Apr 15 10:54:48 [kernel] SCSI device sda: write cache: enabled, read
 cache: enabled, doesn't support DPO or FUA

 All references I found to this kind of message where related to some
 driver issue and not hardware problems. Can this be the case?

In search for a solution for this issue I bought a new HD to replace
the one presenting the problem above.

Now I have a SAMSUNG HD400LJ in place of the previous ST3250820AS. The
problem remains the same.

When I put a Maxtor 6L300S0 to perform the same hole as the Samsung or
the Seagate, I couldn't trigger the exception. I bet this happened
because Maxtor is immune to this issue and the Samsung (which seems to
be affected) was barely used.


Now with kernel 2.6.21-rc6, addon board reattached and cables changed
I still get messages like the following every 10 minutes or so under
high load:

Apr 16 13:03:27 [kernel] ata1.00: exception Emask 0x0 SAct 0x0 SErr
0x0 action 0x2 frozen
Apr 16 13:03:27 [kernel] ata1.00: cmd
35/00:00:91:fd:06/00:04:0b:00:00/e0 tag 0 cdb 0x0 data 524288 out
Apr 16 13:03:27 [kernel]  res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Apr 16 13:03:35 [kernel] ata1: port is slow to respond, please be
patient (Status 0xd0)
Apr 16 13:03:58 [kernel] ata1: port failed to respond (30 secs, Status 0xd0)
Apr 16 13:03:58 [kernel] ata1: soft resetting port
Apr 16 13:03:58 [kernel] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
Apr 16 13:03:58 [kernel] ata1.00: configured for UDMA/133
Apr 16 13:03:58 [kernel] ata1: EH complete
Apr 16 13:03:58 [kernel] SCSI device sda: 781422768 512-byte hdwr
sectors (400088 MB)
Apr 16 13:03:58 [kernel] sda: Write Protect is off
Apr 16 13:03:58 [kernel] SCSI device sda: write cache: enabled, read
cache: enabled, doesn't support DPO or FUA

Is this expected behavior?


Rodrigo Severo
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[RFC/PATCH -mm] add pci_try_set_mwi

2007-04-16 Thread Randy Dunlap
On Thu, 5 Apr 2007 14:20:01 -0700 Andrew Morton wrote:

 hm. Well, what to do?
 
 How about we prevail upon Randy to:
 
 - rename pci_set_mwi() to pci_try_set_mwi()
 
 - make it return 0 on success, 1 if the try failed
 
 - make it return -EFOO on error (presently unimplemented)
 
 - review callers
 
 - remove __must_check
 
 ?



From: Randy Dunlap [EMAIL PROTECTED]

As suggested by Andrew, add pci_try_set_mwi(), which does not require
return-value checking.

- add pci_try_set_mwi() without __must_check
- make it return 0 on success, errno if the try failed or error
- review callers

Signed-off-by: Randy Dunlap [EMAIL PROTECTED]
---
 Documentation/pci.txt  |5 +++-
 drivers/ata/pata_cs5530.c  |2 -
 drivers/ide/pci/cs5530.c   |2 -
 drivers/net/cassini.c  |4 +--
 drivers/net/starfire.c |2 -
 drivers/net/tulip/tulip_core.c |2 -
 drivers/net/wireless/mac80211/p54/prism54pci.c |2 -
 drivers/net/wireless/prism54/islpci_hotplug.c  |3 --
 drivers/pci/pci.c  |   28 +
 drivers/scsi/lpfc/lpfc_init.c  |5 
 drivers/usb/gadget/net2280.c   |2 -
 include/linux/pci.h|1 
 12 files changed, 39 insertions(+), 19 deletions(-)

--- linux-2.6.21-rc6-mm1.orig/include/linux/pci.h
+++ linux-2.6.21-rc6-mm1/include/linux/pci.h
@@ -547,6 +547,7 @@ void pci_set_master(struct pci_dev *dev)
 int pci_set_pcie_reset_state(struct pci_dev *dev, enum pcie_reset_state state);
 #define HAVE_PCI_SET_MWI
 int __must_check pci_set_mwi(struct pci_dev *dev);
+int pci_try_set_mwi(struct pci_dev *dev);
 void pci_clear_mwi(struct pci_dev *dev);
 void pci_intx(struct pci_dev *dev, int enable);
 void pci_msi_off(struct pci_dev *dev);
--- linux-2.6.21-rc6-mm1.orig/drivers/pci/pci.c
+++ linux-2.6.21-rc6-mm1/drivers/pci/pci.c
@@ -1162,6 +1162,11 @@ int pci_set_mwi(struct pci_dev *dev)
return 0;
 }
 
+int pci_try_set_mwi(struct pci_dev *dev)
+{
+   return 0;
+}
+
 void pci_clear_mwi(struct pci_dev *dev)
 {
 }
@@ -1218,9 +1223,7 @@ pci_set_cacheline_size(struct pci_dev *d
  * pci_set_mwi - enables memory-write-invalidate PCI transaction
  * @dev: the PCI device for which MWI is enabled
  *
- * Enables the Memory-Write-Invalidate transaction in %PCI_COMMAND,
- * and then calls @pcibios_set_mwi to do the needed arch specific
- * operations or a generic mwi-prep function.
+ * Enables the Memory-Write-Invalidate transaction in %PCI_COMMAND.
  *
  * RETURNS: An appropriate -ERRNO error value on error, or zero for success.
  */
@@ -1236,7 +1239,8 @@ pci_set_mwi(struct pci_dev *dev)
 
pci_read_config_word(dev, PCI_COMMAND, cmd);
if (! (cmd  PCI_COMMAND_INVALIDATE)) {
-   pr_debug(PCI: Enabling Mem-Wr-Inval for device %s\n, 
pci_name(dev));
+   pr_debug(PCI: Enabling Mem-Wr-Inval for device %s\n,
+   pci_name(dev));
cmd |= PCI_COMMAND_INVALIDATE;
pci_write_config_word(dev, PCI_COMMAND, cmd);
}
@@ -1245,6 +1249,21 @@ pci_set_mwi(struct pci_dev *dev)
 }
 
 /**
+ * pci_try_set_mwi - enables memory-write-invalidate PCI transaction
+ * @dev: the PCI device for which MWI is enabled
+ *
+ * Enables the Memory-Write-Invalidate transaction in %PCI_COMMAND.
+ * Callers are not required to check the return value.
+ *
+ * RETURNS: An appropriate -ERRNO error value on error, or zero for success.
+ */
+int pci_try_set_mwi(struct pci_dev *dev)
+{
+   int rc = pci_set_mwi(dev);
+   return rc;
+}
+
+/**
  * pci_clear_mwi - disables Memory-Write-Invalidate for device dev
  * @dev: the PCI device to disable
  *
@@ -1418,6 +1437,7 @@ EXPORT_SYMBOL(pci_release_selected_regio
 EXPORT_SYMBOL(pci_request_selected_regions);
 EXPORT_SYMBOL(pci_set_master);
 EXPORT_SYMBOL(pci_set_mwi);
+EXPORT_SYMBOL(pci_try_set_mwi);
 EXPORT_SYMBOL(pci_clear_mwi);
 EXPORT_SYMBOL_GPL(pci_intx);
 EXPORT_SYMBOL(pci_set_dma_mask);
--- linux-2.6.21-rc6-mm1.orig/drivers/ata/pata_cs5530.c
+++ linux-2.6.21-rc6-mm1/drivers/ata/pata_cs5530.c
@@ -270,7 +270,7 @@ static int cs5530_init_chip(void)
}
 
pci_set_master(cs5530_0);
-   pci_set_mwi(cs5530_0);
+   pci_try_set_mwi(cs5530_0);
 
/*
 * Set PCI CacheLineSize to 16-bytes:
--- linux-2.6.21-rc6-mm1.orig/drivers/ide/pci/cs5530.c
+++ linux-2.6.21-rc6-mm1/drivers/ide/pci/cs5530.c
@@ -236,7 +236,7 @@ static unsigned int __devinit init_chips
 */
 
pci_set_master(cs5530_0);
-   pci_set_mwi(cs5530_0);
+   pci_try_set_mwi(cs5530_0);
 
/*
 * Set PCI CacheLineSize to 16-bytes:
--- linux-2.6.21-rc6-mm1.orig/drivers/usb/gadget/net2280.c
+++ linux-2.6.21-rc6-mm1/drivers/usb/gadget/net2280.c
@@ -2964,7 +2964,7 @@ static int net2280_probe (struct pci_dev
, dev-pci-pcimstctl);
  

Re: [PATCH 0/4] bidi support: block layer bidirectional io.

2007-04-16 Thread Douglas Gilbert
Boaz Harrosh wrote:
 Following are 4 (large) patches for support of bidirectional
 block I/O in kernel. (not including SCSI-ml or iSCSI)
 
 The submitted work is against linux-2.6-block tree as of
 2007/04/15, and will only cleanly apply in succession.
 
 The patches are based on the RFC I sent 3 months ago. They only
 cover the block layer at this point. I suggest they get included
 in Morton's tree until they reach the kernel so they can get
 compiled on all architectures/platforms. There is still a chance
 that architectures I did not compile were not fully converted.
 (FWIW, my search for use of struct request members failed to find
 them). If you find such a case, please send me the file
 name and I will fix it ASAP.
 
 Patches summary:
 1. [PATCH 1/4] bidi support: request dma_data_direction
   - Convert REQ_RW bit flag to a dma_data_direction member like in 
 SCSI-ml use.
   - removed rq_data_dir() and added other APIs for querying request's 
 direction.
   - fix usage of rq_data_dir() and peeking at req-cmd_flags  REQ_RW to 
 using
 new api.
   - clean-up bad usage of DMA_BIDIRECTIONAL and bzero of none-queue 
 requests,
 to use the new blk_rq_init_unqueued_req()
 
 2. [PATCH 2/4] bidi support: fix req-cmd == INT cases
   - Digging into all these old drivers, I have found traces of past life
 where request-cmd was the command type. This patch fixes some of 
 these
 places. All drivers touched by this patch are clear indication of 
 drivers
 that were not used for a while. Should we removed them from Kernel? 
 These Are:
   drivers/acorn/block/fd1772.c, drivers/acorn/block/mfmhd.c,
   drivers/block/nbd.c, drivers/cdrom/aztcd.c, 
 drivers/cdrom/cm206.c
   drivers/cdrom/gscd.c, drivers/cdrom/mcdx.c, 
 drivers/cdrom/optcd.c
   drivers/cdrom/sjcd.c, drivers/ide/legacy/hd.c, 
 drivers/block/amiflop.c
 
 2. [PATCH 3/4] bidi support: request_io_part
   - extract io related fields in struct request into struct 
 request_io_part
 in preparation to full bidi support.
   - new rq_uni() API to access the sub-structure. (Please read below 
 comment
 on why an API and not open code the access)
   - Convert All users to new API.
 
 3. [PATCH 4/4] bidi support: bidirectional block layer
   - add one more request_io_part member for bidi support in struct 
 request.
   - add block layer API functions for mapping and accessing bidi data 
 buffers
 and for ending a block request as a whole (end_that_request_block())
 
 
 Developer comments:
 
 patch 1/4: Borrow from struct scsi_cmnd use of enum dma_data_direction. 
 Further work (in
 progress) is the removal of the corresponding member from struct scsi_cmnd 
 and converting
 all users to directly access rq_dma_dir(sc-req).
 
 patch 3/4: The reasons for introducing the rq_uni(req) API rather than 
 directly accessing
 req-uni are:
 
 * WARN(!bidi_dir(req)) is life saving when developing bidi enabled paths.  
 Once we, bidi
   users, start to push bidi requests down the kernel paths, we immediately 
 get warned of
   paths we did not anticipate. Otherwise, they will be very hard to find, and 
 will hurt
   kernel stability.
 
 * A cleaner and saner future implementation could be in/out members rather 
 than
   uni/bidi_read.  This way the dma_direction member can deprecated and the 
 uni sub-
   structure can be maintained using a pointer in struct req.
   With this API we are free to change the implementation in the future without
   touching any users of the API. We can also experiment with what's best. 
 Also, with the
   API it is much easier to convert uni-directional drivers for bidi (look in
   ll_rw_block.c in patch 4/4).
 
 * Note, that internal uses inside the block layer access req-uni directly, 
 as they will
   need to be changed if the implementation of req-{uni, bidi_read} changes.

Boaz,
Recently I have been looking at things from the perspective
of a SAS target and thinking about bidi commands. Taking
XDWRITEREAD(10) in sbc3r09.pdf (section 5.44) as an example,
with DISABLE_WRITE=0, the device server in the target should
do the following:
  a) decode the cdb **
  b) read from storage [lba, transfer_length]
  c) fetch data_out from initiator [transfer_length] ***
  d) XOR data from (b) and (c) and place result in (z)
  e) write the data from (c) to storage [lba, transfer_length]
  f) send (z) in data_in to initiator [transfer_length]
  g) send SCSI completion status to initiator

Logically a) must occur first and g) last. The b) to f)
sequence could be repeated (perhaps) by the device server
subdividing the transfer_length (i.e. it may not be
reasonable for the OS to assume that the data_out transfer
will be complete before there is any data_in transfer).
With this command (and with most other bidi commands 

Driver update for CentOS 4.4 2.6.9.-42.0.10

2007-04-16 Thread aps21121
PS 
Sorry forgot to send a subject with my last email

A little more info:
For the default CentOS4.4 kernel (2.6.9.-42.0.10) kept up2date using yum
from dmesg: libata version is  1.20

for latest kernel.org 
libata version is 2.00

so to reword questions:
1)  How would I use the latest CentOS 4.4 (2.6.9.x kernel)
but also add the ;atest fixes in libata version 2.0

2) Any chance CentOS will include the latest libata 2.0 in normal kernel updates
so I could just run yum update kernel*
   instead of compiling a  custom kernel

Thanks again for your time,
Al Smith



 -- Original message --
From: [EMAIL PROTECTED]
 Dear Mr Garzik,
 
 Thanks so much for all the time(unpaid?) 
 you've put in developing drivers
 
 My problems specifically concerns a Promise ESATA 300 TX2plus
 which is bundled with the Seagate 300GB eSATA drive 
 -easily available via CompUSA and major reatailers.
 (If Seagate doesn't hire you - they are crazy!)
 
 I'm running CentOS4.4 with latest kernel updates
 currently 2.6.9-42.0.10EL
 
 I've seen other people with same timeout problems
 but after compiling the latest kernel 2.6.20.7,
 I get no more errors, so I know you've been busy fixing it
 -but also get other errors (not concerning the drive or you're fixes)
 
 Problem is how to I apply whatever fixes you've made
 to the latest CentOS4.4 2.6.9-42.0.10 kernel
 since I can't run my custom 2.6.20.7 kernel yet
 
 1) Will CentOS(RedHat - equiv)  be upgrading their kernel to
include you're fixes in the current 2.6.9-x kernel?
 
 2)Can you give me a link or howto on using the latest
  CentOS 4.4 kernel but also adding whatever fixes you've made
 
 Thanks,
 Al Smith

-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Loud pop coming from hard drive on reboot

2007-04-16 Thread Chuck Ebbert
Jan Engelhardt wrote:
 On Apr 15 2007 12:53, Henrique de Moraes Holschuh wrote:
 On Sat, 14 Apr 2007, Pavel Machek wrote:
 How common are notebooks that cut power to disks during reboot?
 Assuming it also does this when running Windows, I'd report it as a grave
 bug to the vendor and demand it to be fixed, or the machine to be exchanged
 with another model that doesn't have this defect.
 
 Given that it does not happen on Windows (IIRC Chuck's post),
 then just what is Windows [not] doing that Linux does?

It looks like there are two problems here:

(1) Some notebooks power off and back on when restarting.
Both Linux and other OS handle that badly because they
assume power is not interrupted on reboot. The noise
emitted is relatively loud.

(2) Linux (alone) gives a very muted pop on shutdown. This could
be from bad interaction with the shutdown command, or some
other reason (drive not given enough time to shut down?)
The noise is not very loud, maybe the head did not have to
move very far?

-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


RE: [1/2] 2.6.21-rc7: known regressions

2007-04-16 Thread Brandeburg, Jesse
Adrian Bunk wrote:
 Subject: laptops with e1000: lockups
 References :
 https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229603 
 Submitter  : Dave Jones [EMAIL PROTECTED]
 Handled-By : Jesse Brandeburg [EMAIL PROTECTED]
 Status : problem is being debugged

this is being actively debugged, here is what we have so far:
o v2.6.20: crashes during boot, unless noacpi and nousb bootparams used
o v2.6.21-rc6: some userspace issue, crashes just after root mount
without init=/bin/bash
o v2.6.2X: serial console in docking station spews goo at all speeds
with console=ttyS0,n8 . work continues on this, as we don't know if
there are kernel panic messages during the hard lock.
o fedora 7 test kernel 2948: boots okay, have been using this as only
truly working kernel on this machine.

one reproduction of the problem was had with scp -l 5000 file remote
when linked at 100Mb/Full.  Tried probably 20 other times same test with
no repro, ugh.

Otherwise, slogging through continues.  We are actively working on this
in case it *is* an e1000 issue.  Right now the repro is so unlikely we
could hardly tell if we fixed it.

Jesse

PS .config available on request
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [1/2] 2.6.21-rc7: known regressions

2007-04-16 Thread Dave Jones
On Mon, Apr 16, 2007 at 05:14:40PM -0700, Brandeburg, Jesse wrote:
  Adrian Bunk wrote:
   Subject: laptops with e1000: lockups
   References :
   https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229603 
   Submitter  : Dave Jones [EMAIL PROTECTED]
   Handled-By : Jesse Brandeburg [EMAIL PROTECTED]
   Status : problem is being debugged
  
  this is being actively debugged, here is what we have so far:
  o v2.6.20: crashes during boot, unless noacpi and nousb bootparams used
  o v2.6.21-rc6: some userspace issue, crashes just after root mount
  without init=/bin/bash
  o v2.6.2X: serial console in docking station spews goo at all speeds
  with console=ttyS0,n8 . work continues on this, as we don't know if
  there are kernel panic messages during the hard lock.
  o fedora 7 test kernel 2948: boots okay, have been using this as only
  truly working kernel on this machine.
  
  one reproduction of the problem was had with scp -l 5000 file remote
  when linked at 100Mb/Full.  Tried probably 20 other times same test with
  no repro, ugh.
  
  Otherwise, slogging through continues.  We are actively working on this
  in case it *is* an e1000 issue.  Right now the repro is so unlikely we
  could hardly tell if we fixed it.

FWIW, I can reproduce this pretty much ondemand, on 100M through
the ethernet port on a netgear wireless AP.
A number of our Fedora7 testers are also able to easily reproduce this.
To isolate e1000, for tomorrows test build I've reverted e1000 to
the same code that was in 2.6.20.  If that works out without causing
hangs, I'll try and narrow down further which of the dozen csets
is responsible.

Dave

-- 
http://www.codemonkey.org.uk
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [1/2] 2.6.21-rc7: known regressions

2007-04-16 Thread Jeff Garzik

Dave Jones wrote:

On Mon, Apr 16, 2007 at 05:14:40PM -0700, Brandeburg, Jesse wrote:
  Adrian Bunk wrote:
   Subject: laptops with e1000: lockups
   References :
   https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229603 
   Submitter  : Dave Jones [EMAIL PROTECTED]

   Handled-By : Jesse Brandeburg [EMAIL PROTECTED]
   Status : problem is being debugged
  
  this is being actively debugged, here is what we have so far:

  o v2.6.20: crashes during boot, unless noacpi and nousb bootparams used
  o v2.6.21-rc6: some userspace issue, crashes just after root mount
  without init=/bin/bash
  o v2.6.2X: serial console in docking station spews goo at all speeds
  with console=ttyS0,n8 . work continues on this, as we don't know if
  there are kernel panic messages during the hard lock.
  o fedora 7 test kernel 2948: boots okay, have been using this as only
  truly working kernel on this machine.
  
  one reproduction of the problem was had with scp -l 5000 file remote

  when linked at 100Mb/Full.  Tried probably 20 other times same test with
  no repro, ugh.
  
  Otherwise, slogging through continues.  We are actively working on this

  in case it *is* an e1000 issue.  Right now the repro is so unlikely we
  could hardly tell if we fixed it.

FWIW, I can reproduce this pretty much ondemand, on 100M through
the ethernet port on a netgear wireless AP.
A number of our Fedora7 testers are also able to easily reproduce this.
To isolate e1000, for tomorrows test build I've reverted e1000 to
the same code that was in 2.6.20.  If that works out without causing
hangs, I'll try and narrow down further which of the dozen csets
is responsible.


Also, there are e1000 fixes in -mm.  At the time (rc2? rc3?) I felt it 
was best to get that into -mm for testing, rather than fast-tracking it 
to 2.6.21.


But hey, see if they help.

It's the netdev-2.6.git#e1000-fixes branch, if you are git-ified.  I was 
going to push them first thing 2.6.22.


Jeff



-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


2.6.21-rc6-mm1 ATA HPT37x regression

2007-04-16 Thread John Stoffel

Hi Jeff and crew,

I was just testing out 2.6.21-rc6-mm1 to test some Cyclades patches
and I noticed that my HPT302 (rev1) controller with a pair of 120gb WD
disks are not longer detected and I get the following in the dmesg
logs:

[  148.121490] hpt37x: DPLL did not stabilize.

Where before, under 2.6.21-rc6 I got the following:

[  173.749349] pata_hpt37x: BIOS has not set timing clocks.
[  173.752949] hpt37x: HPT302: Bus clock 33MHz.
[  173.754409] ACPI: PCI Interrupt :03:06.0[A] - GSI 18 (level,
low) - IRQ
 18
[  173.758403] ata5: PATA max UDMA/133 cmd 0x0001ecf8 ctl 0x0001ecf2
bmdma 0x000
1e800 irq 18
[  173.761396] ata6: PATA max UDMA/133 cmd 0x0001ece0 ctl 0x0001ecda
bmdma 0x000
1e808 irq 18
[  173.764319] scsi6 : pata_hpt37x
[  173.928997] ATA: abnormal status 0x78 on port 0x0001ecff
[  173.930511] scsi7 : pata_hpt37x
[  174.094906] ATA: abnormal status 0x8 on port 0x0001ece7


Here's my lspci infomation on the board, it's an addon.  My apologies
for the crappy word wrapping, xterms inside screen, etc. 

  03:06.0 RAID bus controller: Triones Technologies, Inc. HPT302/302N
  (rev 01)
  Subsystem: Triones Technologies, Inc. Unknown device 0001
  Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop-
  ParErr- Step
  ping- SERR+ FastB2B-
  Status: Cap+ 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium
  TAbort- TAbort
  - MAbort- SERR- PERR-
  Latency: 120 (2000ns min, 2000ns max)
  Interrupt: pin A routed to IRQ 11
  Region 0: I/O ports at ecf8 [size=8]
  Region 1: I/O ports at ecf0 [size=4]
  Region 2: I/O ports at ece0 [size=8]
  Region 3: I/O ports at ecd8 [size=4]
  Region 4: I/O ports at e800 [size=256]
  Expansion ROM at f900 [disabled] [size=128K]
  Capabilities: [60] Power Management version 2
  Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA
  PME(D0-,D1-,D2-,D3hot
  -,D3cold-)
  Status: D0 PME-Enable- DSel=0 DScale=0 PME-

  03:0a.0 SCSI storage controller: Adaptec AHA-2940U2/U2W / 7890/7891
  Subsystem: Dell Unknown device 0087
  Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop-
  ParErr- Step
  ping- SERR+ FastB2B-
  Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium
  TAbort- TAbort
  - MAbort- SERR- PERR-
  Latency: 64 (9750ns min, 6250ns max), Cache Line Size: 32
  bytes
  Interrupt: pin A routed to IRQ 18
  BIST result: 00
  Region 0: I/O ports at e400 [disabled] [size=256]
  Region 1: Memory at f8fff000 (64-bit, non-prefetchable)
  [size=4K]
  Expansion ROM at f100 [disabled] [size=128K]
  Capabilities: [dc] Power Management version 1
  Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
  PME(D0-,D1-,D2-,D3hot
  -,D3cold-)
  Status: D0 PME-Enable- DSel=0 DScale=0 PME-



I'm moving to 2.6.21-rc7 now, but I'm willing to test out patches as
needed.  I suspect it's the move from the 0.6.0 driver to the 0.6.5
version, and the changes in the hpt_clock hpt37x_timings_* arrays, but
god knows which ones are affecting me.

Thanks,
John
-
To unsubscribe from this list: send the line unsubscribe linux-ide in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html