Re: [CentOS] usb irq problem

2008-09-24 Thread nate
partha chowdhury wrote:

> i want to know one thing  - the hardware failing logic you earlier spoke
> of  - is that disabled or removed in latest 2.6.26.5 kernel because i
> experimented with custom compiling that kernel and did not receive any
> error message whatsoever.

Download the kernel source and take a peek. I had never heard of
that option until you mentioned it

I would suggest something like
grep -nri noirqdebug *

from within the extracted kernel source tree.

If you don't see any matches then it's probably not there.

nate

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-24 Thread partha chowdhury

MHR wrote:

On Tue, Sep 23, 2008 at 8:02 PM, partha chowdhury <[EMAIL PROTECTED]> wrote:
  

well i managed to fix the problem after an intensive search through the
forum and adding the noirqdebug option to the kernel line.




Are you /sure/ this fixes the problem?  Your last fix didn't work out
so well, so I'm just curious, not criticizing

mhr
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos
  
well it is running close for a day now and the message has not appeared 
yet and all the usb drives are working as usual. so far so good ,keeping 
fingers crossed !

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-24 Thread partha chowdhury

On Wed, 2008-09-24 at 07:09 -0700, nate wrote:

> 
> I wouldn't think the system is about to fail if it's just going
> slower. If there are specific error messages that point to it's
> failing then maybe. Errors quoted earlier just seem like bad
> hardware(perhaps poorly designed or built, rather than hardware
> that is physically failing).

well i myself have assembled this box. did i do something wrong because
it is running fine for last 10 months without any hiccups and other than
this minor issue no problem at all.
> 
> Only thing I can suggest is to just verify that the drive is
> detected as USB 2.0 via lsusb -v
> 
> e.g.
>  Bus 004 Device 020: ID 1058:0702 Western Digital Technologies, Inc.
> Device Descriptor:
>   bLength18
>   bDescriptorType 1
>   bcdUSB   2.00
> [..]
>   iManufacturer   1 Western Digital
>   iProduct2 External HDD
> 
> I believe the 2.00 indicates USB 2.0, I see several other
> devices on my USB that are marked as 1.x

here is my output 

Bus 002 Device 002: ID 05e3:0702 Genesys Logic, Inc. USB 2.0 IDE Adapter
Device Descriptor:
  bLength18
  bDescriptorType 1
  bcdUSB   2.00
  bDeviceClass0 (Defined at Interface level)
  bDeviceSubClass 0 
  bDeviceProtocol 0 
  bMaxPacketSize064
  idVendor   0x05e3 Genesys Logic, Inc.
  idProduct  0x0702 USB 2.0 IDE Adapter
  bcdDevice0.33
  iManufacturer   0 
  iProduct1 USB TO IDE
  iSerial 0 
  bNumConfigurations  1
  Configuration Descriptor:
bLength 9
bDescriptorType 2
wTotalLength   32
bNumInterfaces  1
bConfigurationValue 1
iConfiguration  0 
bmAttributes 0xc0
  Self Powered
MaxPower   96mA
Interface Descriptor:
  bLength 9
  bDescriptorType 4
  bInterfaceNumber0
  bAlternateSetting   0
  bNumEndpoints   2
  bInterfaceClass 8 Mass Storage
  bInterfaceSubClass  6 SCSI
  bInterfaceProtocol 80 Bulk (Zip)
  iInterface  0 
  Endpoint Descriptor:
bLength 7
bDescriptorType 5
bEndpointAddress 0x81  EP 1 IN
bmAttributes2
  Transfer TypeBulk
  Synch Type   None
  Usage Type   Data
wMaxPacketSize 0x0200  1x 512 bytes
bInterval   1
  Endpoint Descriptor:
bLength 7
bDescriptorType 5
bEndpointAddress 0x02  EP 2 OUT
bmAttributes2
  Transfer TypeBulk
  Synch Type   None
  Usage Type   Data
wMaxPacketSize 0x0200  1x 512 bytes
bInterval   1

you can see it is detected as usb2.0.

> If you configured your system's kernel to ignore the irq errors
> as the other poster did(I think your a different poster..didn't
> check), you really should remove that option and enable the
> checking again, and try a PCI USB expansion card instead and
> see if that helps.

actually it was i who sent it. i sent it through gmail so it did not
insert my name just my email id. usually i send through evolution. i did
not know this issue. apology for the unintentional mixups.

i want to know one thing  - the hardware failing logic you earlier spoke
of  - is that disabled or removed in latest 2.6.26.5 kernel because i
experimented with custom compiling that kernel and did not receive any
error message whatsoever.


___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-24 Thread nate
kira laucas wrote:

> now that you have mentioned it, i have noticed recently that my desktop
> motherboard usb port has gone slower. i mean previously i used to get 28-30
> MB/s transfer speed with my external usb drive. but now the max i get is
> 10MB/s . i have tested the external drive on my friend's laptop and to my
> surprise it transferred with 25MB/s ! is it any indication of any
> potentially disastrous hardware failure issue ?

I wouldn't think the system is about to fail if it's just going
slower. If there are specific error messages that point to it's
failing then maybe. Errors quoted earlier just seem like bad
hardware(perhaps poorly designed or built, rather than hardware
that is physically failing).

Only thing I can suggest is to just verify that the drive is
detected as USB 2.0 via lsusb -v

e.g.
 Bus 004 Device 020: ID 1058:0702 Western Digital Technologies, Inc.
Device Descriptor:
  bLength18
  bDescriptorType 1
  bcdUSB   2.00
[..]
  iManufacturer   1 Western Digital
  iProduct2 External HDD

I believe the 2.00 indicates USB 2.0, I see several other
devices on my USB that are marked as 1.x

If the device is bus powered, make sure it is getting enough
power, some of my bus powered disks I have to use a USB Y cable
to plug the drives into two ports simultaneously(one for
power+data, the other for power only).

If you configured your system's kernel to ignore the irq errors
as the other poster did(I think your a different poster..didn't
check), you really should remove that option and enable the
checking again, and try a PCI USB expansion card instead and
see if that helps.

nate



___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-24 Thread kira laucas
On Wed, Sep 24, 2008 at 5:39 PM, nate <[EMAIL PROTECTED]> wrote:

>
>
>
> http://www.linuxtopia.org/online_books/linux_kernel/kernel_configuration/re18.html
>
> By default, the kernel attempts to detect and disable unhandled
> interrupt sources because they can cause problems with the
> responsiveness of the rest of the kernel if left unchecked.
> This option will disable this logic.
>
> just for curiosity, is this option removed  in the latest 2.6.26.5 kernel
> ? because i experimented with compiling a custom kernel and did not ever
> receive the message . anyway i am running centos without any problem now and
> i am glad about it.
>
> Replace the hardware, get better quality stuff. Since this
> is USB, get a PCI USB expansion board see if that helps.
> About a year ago I bought a USB 2.0 PCI card for one of
> my older systems, was about $20 I think.
>
> now that you have mentioned it, i have noticed recently that my desktop
motherboard usb port has gone slower. i mean previously i used to get 28-30
MB/s transfer speed with my external usb drive. but now the max i get is
10MB/s . i have tested the external drive on my friend's laptop and to my
surprise it transferred with 25MB/s ! is it any indication of any
potentially disastrous hardware failure issue ?

for information my hardware is :
00:00.0 RAM memory: nVidia Corporation MCP67 Memory Controller (rev a2)
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0
Capabilities: [44] HyperTransport: Slave or Primary Interface
Capabilities: [dc] HyperTransport: MSI Mapping

00:01.0 ISA bridge: nVidia Corporation MCP67 ISA Bridge (rev a2)
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0
I/O ports at 0900 [size=256]

00:01.1 SMBus: nVidia Corporation MCP67 SMBus (rev a2)
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: 66MHz, fast devsel, IRQ 10
I/O ports at dc00 [size=64]
I/O ports at 0600 [size=64]
I/O ports at 0700 [size=64]
Capabilities: [44] Power Management version 2

00:02.0 USB Controller: nVidia Corporation MCP67 OHCI USB 1.1 Controller
(rev a2) (prog-if 10 [OHCI])
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0, IRQ 58
Memory at feaff000 (32-bit, non-prefetchable) [size=4K]
Capabilities: [44] Power Management version 2

00:02.1 USB Controller: nVidia Corporation MCP67 EHCI USB 2.0 Controller
(rev a2) (prog-if 20 [EHCI])
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0, IRQ 233
Memory at feafec00 (32-bit, non-prefetchable) [size=256]
Capabilities: [44] Debug port
Capabilities: [80] Power Management version 2

00:04.0 USB Controller: nVidia Corporation MCP67 OHCI USB 1.1 Controller
(rev a2) (prog-if 10 [OHCI])
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0, IRQ 66
Memory at feafd000 (32-bit, non-prefetchable) [size=4K]
Capabilities: [44] Power Management version 2

00:04.1 USB Controller: nVidia Corporation MCP67 EHCI USB 2.0 Controller
(rev a2) (prog-if 20 [EHCI])
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0, IRQ 50
Memory at feafe800 (32-bit, non-prefetchable) [size=256]
Capabilities: [44] Debug port
Capabilities: [80] Power Management version 2

00:08.0 PCI bridge: nVidia Corporation MCP67 PCI Bridge (rev a2) (prog-if 01
[Subtractive decode])
Flags: bus master, 66MHz, fast devsel, latency 0
Bus: primary=00, secondary=01, subordinate=01, sec-latency=64
I/O behind bridge: e000-efff
Memory behind bridge: feb0-febf
Capabilities: [b8] #0d []
Capabilities: [8c] HyperTransport: MSI Mapping

00:09.0 IDE interface: nVidia Corporation MCP67 AHCI Controller (rev a2)
(prog-if 85 [Master SecO PriO])
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0, IRQ 233
I/O ports at d480 [size=8]
I/O ports at d400 [size=4]
I/O ports at d080 [size=8]
I/O ports at d000 [size=4]
I/O ports at cc00 [size=16]
Memory at feafa000 (32-bit, non-prefetchable) [size=8K]
Capabilities: [44] Power Management version 2
Capabilities: [8c] #12 [0010]

00:0a.0 Ethernet controller: nVidia Corporation MCP67 Ethernet (rev a2)
Subsystem: ASUSTeK Computer Inc. Unknown device 82b3
Flags: bus master, 66MHz, fast devsel, latency 0, IRQ 90
Memory at feafc000 (32-bit, non-prefetchable) [size=4K]
I/O ports at c880 [size=8]
Memory at feafe400 (32-bit, non-prefetchable) [size=256]
Memory at feafe000 (32-bit, non-prefetchable) [size=16]
Capabilities: [44] Power Management version 2
Capabilities: [50] Message Signalled Interrupts: 64bit+ Queue=0/3
Enable+
Capabilities: [6c] HyperTransport: MSI 

Re: [CentOS] usb irq problem

2008-09-24 Thread nate
MHR wrote:
> On Tue, Sep 23, 2008 at 8:02 PM, partha chowdhury <[EMAIL PROTECTED]>
> wrote:
>>
>> well i managed to fix the problem after an intensive search through the
>> forum and adding the noirqdebug option to the kernel line.
>>
>
> Are you /sure/ this fixes the problem?  Your last fix didn't work out
> so well, so I'm just curious, not criticizing

>From what I've read I'm pretty confident it won't fix the
problem it only masks it

http://www.linuxtopia.org/online_books/linux_kernel/kernel_configuration/re18.html

By default, the kernel attempts to detect and disable unhandled
interrupt sources because they can cause problems with the
responsiveness of the rest of the kernel if left unchecked.
This option will disable this logic.

--

So it sounds like linux is saying the hardware is faulty and
is disabling it pro-actively before bad things can happen,
disabling the code that detects bad hardware and recovers
from it is just asking for trouble IMO.

Replace the hardware, get better quality stuff. Since this
is USB, get a PCI USB expansion board see if that helps.
About a year ago I bought a USB 2.0 PCI card for one of
my older systems, was about $20 I think.

nate



___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-23 Thread MHR
On Tue, Sep 23, 2008 at 8:02 PM, partha chowdhury <[EMAIL PROTECTED]> wrote:
>
> well i managed to fix the problem after an intensive search through the
> forum and adding the noirqdebug option to the kernel line.
>

Are you /sure/ this fixes the problem?  Your last fix didn't work out
so well, so I'm just curious, not criticizing

mhr
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-23 Thread partha chowdhury

On Mon, 2008-09-22 at 22:31 -0700, nate wrote:
> partha chowdhury wrote:
> 
> > can someone tell me if its a centos/rhel bug and if it is how and where
> > i should file a bug report ?
> 
> I'd say it's a hardware problem rather than a software bug
> at this point.
> 
> nate
> 
> ___
> CentOS mailing list
> CentOS@centos.org
> http://lists.centos.org/mailman/listinfo/centos

well i managed to fix the problem after an intensive search through the
forum and adding the noirqdebug option to the kernel line.

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-23 Thread nate
partha chowdhury wrote:

> can someone tell me if its a centos/rhel bug and if it is how and where
> i should file a bug report ?

I'd say it's a hardware problem rather than a software bug
at this point.

nate

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-22 Thread partha chowdhury

On Tue, 2008-09-23 at 00:27 +0530, partha chowdhury wrote:
> On Mon, 2008-09-22 at 11:43 +0300, Pasi Kärkkäinen wrote:
> > I'm seeing this same problem on IBM HS21-8853 blades.
> > 
> > It _seems_ like a bug in IRQ routing in BIOS/motherboard.. ACPI issue? 
> > 
> > With IBM blades I'm able to fix the problem by generating the initrd image
> > with "--without-usb" switch.. this delays the USB module 
> > initialization/loading,
> > somehow fixing the problem.. giving different IRQ to USB controller.
> > 
> > -- Pasi
> > ___
> 
> i just tried your fix and it solved the problem. now the usb drive does
> not disappear any more. Thank you 

well I spoke too soon .Now the error message i am getting :

Sep 23 00:37:46 station2 kernel:  [] __report_bad_irq
+0x2b/0x69
Sep 23 00:37:46 station2 kernel:  [] note_interrupt
+0x1af/0x1e8
Sep 23 00:37:46 station2 kernel:  [] usb_hcd_irq+0x23/0x50
Sep 23 00:37:46 station2 kernel:  [] handle_IRQ_event
+0x23/0x49
Sep 23 00:37:46 station2 kernel:  [] __do_IRQ+0x9b/0xd6
Sep 23 00:37:46 station2 kernel:  [] do_IRQ+0x93/0xae
Sep 23 00:37:46 station2 kernel:  [] common_interrupt
+0x1a/0x20
Sep 23 00:37:46 station2 kernel:  [] default_idle+0x0/0x59
Sep 23 00:37:46 station2 kernel:  [] default_idle+0x31/0x59
Sep 23 00:37:46 station2 kernel:  [] cpu_idle+0x9f/0xb9
Sep 23 00:37:46 station2 kernel:  [] start_kernel+0x379/0x380
Sep 23 00:37:46 station2 kernel:  ===
Sep 23 00:37:46 station2 kernel: handlers:
Sep 23 00:37:46 station2 kernel: [] (usb_hcd_irq+0x0/0x50)
Sep 23 00:37:46 station2 kernel: Disabling IRQ #58

and the same thing happens i.e i am unable to remount the usb drive.

can someone tell me if its a centos/rhel bug and if it is how and where
i should file a bug report ?

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-22 Thread partha chowdhury

On Mon, 2008-09-22 at 11:43 +0300, Pasi Kärkkäinen wrote:
> I'm seeing this same problem on IBM HS21-8853 blades.
> 
> It _seems_ like a bug in IRQ routing in BIOS/motherboard.. ACPI issue? 
> 
> With IBM blades I'm able to fix the problem by generating the initrd image
> with "--without-usb" switch.. this delays the USB module 
> initialization/loading,
> somehow fixing the problem.. giving different IRQ to USB controller.
> 
> -- Pasi
> ___

i just tried your fix and it solved the problem. now the usb drive does
not disappear any more. Thank you !

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-22 Thread Pasi Kärkkäinen
On Fri, Sep 19, 2008 at 06:58:21PM +0530, partha chowdhury wrote:
> i add the "irqpoll" option to kernel line. after the system booted i log
> into X with tty1 also login as root with the command " tail
> -f /var/log/messages" for monitoring . After 10 to 20 minutes i got the
> same message again but this time immediately the system froze up !
> 
> i tried it 5 times now and each time is the same result.i also noticed
> that the system was behaving very slowly till the message came up like
> usually folder copy operation - i usually get 70-90MB/s according to
> vmstat 1  comand, but with irqpoll  the maximum i noticed was like
> 10MB/s.
> 
> Am i doing something wrong ?
> 

I'm seeing this same problem on IBM HS21-8853 blades.

It _seems_ like a bug in IRQ routing in BIOS/motherboard.. ACPI issue? 

With IBM blades I'm able to fix the problem by generating the initrd image
with "--without-usb" switch.. this delays the USB module initialization/loading,
somehow fixing the problem.. giving different IRQ to USB controller.

-- Pasi
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-19 Thread partha chowdhury
i add the "irqpoll" option to kernel line. after the system booted i log
into X with tty1 also login as root with the command " tail
-f /var/log/messages" for monitoring . After 10 to 20 minutes i got the
same message again but this time immediately the system froze up !

i tried it 5 times now and each time is the same result.i also noticed
that the system was behaving very slowly till the message came up like
usually folder copy operation - i usually get 70-90MB/s according to
vmstat 1  comand, but with irqpoll  the maximum i noticed was like
10MB/s.

Am i doing something wrong ?

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] usb irq problem

2008-09-19 Thread Ralph Angenendt
partha chowdhury wrote:
>> Sep 19 13:13:02 station2 kernel: irq 50: nobody cared (try booting with the 
>> "irqpoll" option)

> and the usb drive does not show up in nautilus any more. Then i have
> to reboot and after logging into x the drive shows up .but again after
> some time the same thing happens.
> 
> please help.

What happens when you do what the error message above asks you to do?

When grub comes up, hit a key, then hit "a" to append something to the
kernel command line and then add irqpoll to the end of the kernel
command line.

If that works, edit /etc/grub.conf and add "irqpoll" to the end of the
kernel command line there.

Ralph


pgpF4jl2zDBTl.pgp
Description: PGP signature
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] usb irq problem

2008-09-19 Thread partha chowdhury
hallo to all .

This is my first post to this list so please bare with me if i have made
any mistake .

My problem is :

after every boot i receive the following message in terminal :


> Sep 19 13:13:02 station2 kernel: irq 50: nobody cared (try booting with the 
> "irqpoll" option)
> Sep 19 13:13:02 station2 kernel:  [] __report_bad_irq+0x2b/0x69
> Sep 19 13:13:02 station2 kernel:  [] note_interrupt+0x1af/0x1e8
> Sep 19 13:13:02 station2 kernel:  [] usb_hcd_irq+0x23/0x50
> Sep 19 13:13:02 station2 kernel:  [] handle_IRQ_event+0x23/0x49
> Sep 19 13:13:02 station2 kernel:  [] __do_IRQ+0x9b/0xd6
> Sep 19 13:13:02 station2 kernel:  [] do_IRQ+0x93/0xae
> Sep 19 13:13:02 station2 kernel:  [] common_interrupt+0x1a/0x20
> Sep 19 13:13:02 station2 kernel:  ===
> Sep 19 13:13:02 station2 kernel: handlers:
> Sep 19 13:13:02 station2 kernel: [] (usb_hcd_irq+0x0/0x50)
> Sep 19 13:13:02 station2 kernel: Disabling IRQ #5


And immediately after it my external usb drive vanishes from the nautilus file 
browser and i get messages like this in /var/log/messages :


> usb 2-2: device not accepting address 3, error -110
> Sep 19 14:15:09 station2 kernel: usb 2-2: new high speed USB device using 
> ehci_hcd and address 4
> Sep 19 14:15:21 station2 kernel: usb 2-2: device not accepting address 4, 
> error -110
> Sep 19 14:15:21 station2 kernel: usb 2-2: new high speed USB device using 
> ehci_hcd and address 5
> Sep 19 14:15:31 station2 kernel: usb 2-2: device not accepting address 5, 
> error -110
> Sep 19 14:15:31 station2 kernel: usb 2-2: new high speed USB device using 
> ehci_hcd and address 6
> Sep 19 14:15:42 station2 kernel: usb 2-2: device not accepting address 6, 
> error -110
> Sep 19 14:16:48 station2 kernel: usb 2-4: new high speed USB device using 
> ehci_hcd and address 7
> Sep 19 14:17:00 station2 kernel: usb 2-4: device not accepting address 7, 
> error -110
> Sep 19 14:17:00 station2 kernel: usb 2-4: new high speed USB device using 
> ehci_hcd and address 8
> Sep 19 14:17:11 station2 kernel: usb 2-4: device not accepting address 8, 
> error -110
> 

and the usb drive does not show up in nautilus any more. Then i have to reboot 
and after logging into x the drive shows up .but again after some time the same 
thing happens.

please help.

i don't know if i have given enough information.so please tell me if any more 
info is needed.

Thanks.

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos