Re: 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. Hi Tushar, Seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset

Re: 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset Values ---

RE: 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Sunday, November 18, 2012 9:38 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Uni

Re: 82571EB: Detected Hardware Unit Hang

2012-11-18 Thread Joe Jin
On 11/16/12 04:26, Dave, Tushar N wrote: >> Would you please help to fine the offset of max payload size in eeprom? >> I'd like to have a try to modify it by ethtool. > > It is defined using bit 8 of word 0x1A. > Bit value 0 = 128B , bit value 1 = 256B Hi Tushar, I checked one of my server which

RE: 82571EB: Detected Hardware Unit Hang

2012-11-15 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, November 14, 2012 4:33 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware

Re: 82571EB: Detected Hardware Unit Hang

2012-11-14 Thread Joe Jin
.kernel.org; Mary Mcgrath >> Subject: Re: 82571EB: Detected Hardware Unit Hang >> >> On 11/09/12 04:35, Dave, Tushar N wrote: >>> All devices in path from root complex to 82571, should have *same* max >> payload size otherwise it can cause hang. >>> Can you d

RE: 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, November 13, 2012 6:48 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Uni

RE: 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Dave, Tushar N
>-Original Message- >From: Li Yu [mailto:raise.s...@gmail.com] >Sent: Tuesday, November 13, 2012 7:37 PM >To: Dave, Tushar N >Cc: Joe Jin; e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Ha

Re: 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Li Yu
于 2012年11月09日 04:35, Dave, Tushar N 写道: -Original Message- From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] On Behalf Of Joe Jin Sent: Wednesday, November 07, 2012 10:25 PM To: e1000-de...@lists.sf.net Cc: net...@vger.kernel.org; linux-kernel@vger.kernel.org; Mary

Re: 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Joe Jin
On 11/09/12 04:35, Dave, Tushar N wrote: > All devices in path from root complex to 82571, should have *same* max > payload size otherwise it can cause hang. > Can you double check this? Hi Tushar, Checked with hardware vendor and they said no way to modify the max payload size from BIOS, can

Re: 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Joe Jin
On 11/09/12 04:35, Dave, Tushar N wrote: > Are you sure this is not similar issue as before that you reported. > i.e. Tushar, Thanks for your quick response, I'll check with customer if they can modify the Max payload size from BIOS, this time issue hit on HP's server. Thanks again, Joe > On

RE: 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Dave, Tushar N
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Wednesday, November 07, 2012 10:25 PM >To: e1000-de...@lists.sf.net >Cc: net...@vger.kernel.org; linux-kernel@vger.kernel.org; Mary Mcgrath >Subject: 82571EB: Detected

Re: 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Jon Mason
On Mon, Jul 16, 2012 at 9:08 AM, Henrique de Moraes Holschuh wrote: > On Mon, 16 Jul 2012, Ben Hutchings wrote: >> On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: >> > On Sun, 15 Jul 2012, Dave, Tushar N wrote: >> > > Somehow setting max payload to 256 from BIOS does not set

Re: 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Jon Mason
On Mon, Jul 16, 2012 at 8:47 AM, Ben Hutchings wrote: > On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: >> On Sun, 15 Jul 2012, Dave, Tushar N wrote: >> > Somehow setting max payload to 256 from BIOS does not set this value for >> > all devices. I believe this is a BIOS bug.

Re: 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Henrique de Moraes Holschuh
On Mon, 16 Jul 2012, Ben Hutchings wrote: > On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: > > On Sun, 15 Jul 2012, Dave, Tushar N wrote: > > > Somehow setting max payload to 256 from BIOS does not set this value for > > > all devices. I believe this is a BIOS bug. > > > >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Ben Hutchings
On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: > On Sun, 15 Jul 2012, Dave, Tushar N wrote: > > Somehow setting max payload to 256 from BIOS does not set this value for > > all devices. I believe this is a BIOS bug. > > And preferably, Linux should complain about it. Since

Re: 82571EB: Detected Hardware Unit Hang

2012-07-15 Thread Henrique de Moraes Holschuh
On Sun, 15 Jul 2012, Dave, Tushar N wrote: > Somehow setting max payload to 256 from BIOS does not set this value for all > devices. I believe this is a BIOS bug. And preferably, Linux should complain about it. Since we know it is going to cause problems, and since we know it does happen, we sho

Re: 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Joe Jin
>> To: Dave, Tushar N >>>> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >>>> ker...@vger.kernel.org >>>> Subject: Re: 82571EB: Detected Hardware Unit Hang >>>> >>> Thanks for sending full dmesg log. I am still investigating. I t

RE: 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 9:34 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

RE: 82571EB: Detected Hardware Unit Hang

2012-07-12 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 4:46 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > Thank

RE: 82571EB: Detected Hardware Unit Hang

2012-07-12 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 12:11 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
Sent: Wednesday, July 11, 2012 7:58 PM >>>>> To: Dave, Tushar N >>>>> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >>>>> ker...@vger.kernel.org >>>>> Subject: Re: 82571EB: Detected Hardware Unit Hang >>>>> >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
>> To: Dave, Tushar N >>>> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >>>> ker...@vger.kernel.org >>>> Subject: Re: 82571EB: Detected Hardware Unit Hang >>>> >>>> On 07/12/12 10:52, Dave, Tushar N wrote: >>&g

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 8:13 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
r.kernel.org >> Subject: Re: 82571EB: Detected Hardware Unit Hang >> >> On 07/12/12 10:52, Dave, Tushar N wrote: >>> What is the exact error messages in BIOS log? >> >> Error message from BIOS event log: >> 07/12/12 05:54:00 >>PCI Express Non-Fatal

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 7:58 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 10:52, Dave, Tushar N wrote: > What is the exact error messages in BIOS log? Error message from BIOS event log: 07/12/12 05:54:00 PCI Express Non-Fatal Error Thanks, Joe -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord..

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 7:23 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 02:51, Dave, Tushar N wrote: > > Joe, > > I see couple of errors in lspci output. > Device capability status register shows UnCorrectable PCIe error. This means > there is certainly something went wrong. The only way to recover from > Uncorrectable errors is reset. > > Dev

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 10:03 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:50, Dave, Tushar N wrote: > Device status and AER sections show some errors that looks little suspicious > to me but I'm not too sure. I will get back tomorrow. > Thanks a lot, Tushar! Joe -- Oracle Joe Jin | Software Development Senior Manager | +8610.

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 12:39 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
>> To: Dave, Tushar N >>>> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >>>> ker...@vger.kernel.org >>>> Subject: Re: 82571EB: Detected Hardware Unit Hang >>>> >>>> On 07/11/12 12:05, Dave, Tushar N wrote: >>>>>

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 12:18 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
r.kernel.org >> Subject: Re: 82571EB: Detected Hardware Unit Hang >> >> On 07/11/12 12:05, Dave, Tushar N wrote: >>> When you said you had this issue with RHEL5 and RHEL6 drivers, have you >> install RHEl5/6 kernel and reproduced it? If so I think I should install >&g

RE: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 10:03 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 12:05, Dave, Tushar N wrote: > When you said you had this issue with RHEL5 and RHEL6 drivers, have you > install RHEl5/6 kernel and reproduced it? If so I think I should install > RHEL6 and try reproduce it locally! > Yes I reproduced this on both RHEL5 and RHEL6. So far I tried to

RE: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 8:29 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 11:22, Dave, Tushar N wrote: > Thanks for info. I see that hang occurs right when HW processing first TX > descriptor with TSO. > Would you be able to reproduce issue with TSO off? Disable TSO by 'ethtool > -K ethx tso off' > Let all debug enabled as it is, that will help us debug f

RE: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 5:35 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang >

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
et...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Detected Hardware Unit Hang >> >> When I debug the driver I found before Detected HW hang, driver unable to >> clean and reclaim the resources: >> >> 1457 while ((eop_de

RE: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Dave, Tushar N >Sent: Tuesday, July 10, 2012 12:02 PM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Dave, Tushar N >Subject: RE: 82571EB: Detected Hardware Unit Hang > >>---

RE: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Tuesday, July 10, 2012 12:40 AM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subjec

RE: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Wyborny, Carolyn
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Tuesday, July 10, 2012 12:40 AM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subjec

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
When I debug the driver I found before Detected HW hang, driver unable to clean and reclaim the resources: 1457 while ((eop_desc->upper.data & cpu_to_le32(E1000_TXD_STAT_DD)) && <== at here upper.data always is 0x300 1458(count < tx_ring->count)) { <--- snip ---> 148

Re: 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Joe Jin
On 07/09/12 17:21, Eric Dumazet wrote: > On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: >> Hi list, >> >> I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing >> scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy >> a big file (>500M) from another se

Re: 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Eric Dumazet
On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: > Hi list, > > I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing > scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy > a big file (>500M) from another server will hit it at once. > > Would you ple