Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Yijing Wang
(LAD) >> Intel Corporation >> todd.fujin...@intel.com >> (503) 712-4565 >> >> >> -Original Message- >> From: Ethan Zhao [mailto:ethan.ker...@gmail.com] >> Sent: Wednesday, November 28, 2012 7:10 PM >> To: Fujinaka, Todd >> C

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin
>> LAN Access Division (LAD) >>> Intel Corporation >>> todd.fujin...@intel.com >>> (503) 712-4565 >>> >>> >>> -Original Message- >>> From: Ethan Zhao [mailto:ethan.ker...@gmail.com] >>> Sent: Wednesday, Nov

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin
th; net...@vger.kernel.org; > e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci > Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang > > Joe, > Possibly your customer is running a kernel without source code on a > platform whose vendor wouldn't like to f

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-29 Thread Fujinaka, Todd
x-ker...@vger.kernel.org; linux-pci Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang Joe, Possibly your customer is running a kernel without source code on a platform whose vendor wouldn't like to fix BIOS issue( Is that a HP/Dell server ?). Anyway, to see if is a payloa

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Ethan Zhao
om] > Sent: Wednesday, November 28, 2012 12:31 AM > To: Ben Hutchings > Cc: Fujinaka, Todd; Mary Mcgrath; net...@vger.kernel.org; > e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci > Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang > > On 11/28/12

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Fujinaka, Todd
al Message- From: Joe Jin [mailto:joe@oracle.com] Sent: Wednesday, November 28, 2012 12:31 AM To: Ben Hutchings Cc: Fujinaka, Todd; Mary Mcgrath; net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci Subject: Re: [E1000-devel] 82571EB: Detected Hardware Uni

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Joe Jin
On 11/28/12 02:10, Ben Hutchings wrote: > On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: >> Forgive me if I'm being too repetitious as I think some of this has >> been mentioned in the past. >> >> We (and by we I mean the Ethernet part and driver) can only change the >> advertised availab

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-27 Thread Ben Hutchings
On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: > Forgive me if I'm being too repetitious as I think some of this has > been mentioned in the past. > > We (and by we I mean the Ethernet part and driver) can only change the > advertised availability of a larger MaxPayloadSize. The size is

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-27 Thread Fujinaka, Todd
ay, November 27, 2012 10:11 AM To: Fujinaka, Todd; Mary Mcgrath Cc: Joe Jin; net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; linux-pci Subject: RE: [E1000-devel] 82571EB: Detected Hardware Unit Hang On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: > Forgi

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-27 Thread Fujinaka, Todd
2-4565 -Original Message- From: Mary Mcgrath [mailto:mary.mcgr...@oracle.com] Sent: Monday, November 26, 2012 6:07 PM To: Joe Jin Cc: net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org Subject: Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang Joe Thank yo

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Mary Mcgrath
in, thank you. Regards Mary -Original Message- From: Joe Jin Sent: Monday, November 26, 2012 8:00 PM To: Fujinaka, Todd Cc: Dave, Tushar N; net...@vger.kernel.org; e1000-de...@lists.sf.net; linux-ker...@vger.kernel.org; Mary Mcgrath Subject: Re: [E1000-devel] 82571EB: Detected Hardware

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Joe Jin
On 11/27/12 00:23, Fujinaka, Todd wrote: > If you look at the previous section, DevCap, you'll see that it's > correctly advertising 256 bytes but the system is negotiating 128 for > the link to the Ethernet controller. Things on the "other" side of the > link are controlled outside of the e1000 dr

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Fujinaka, Todd
On Tue, 20 Nov 2012, Joe Jin wrote: > On 11/20/12 16:59, Dave, Tushar N wrote: >> Have you power off the system completely after modifying eeprom? If not >> please do so. > > Hi Tushar, > > Seems not works for me, would you please help to check what is wrong of my > operations? ... > # lspci -

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset Values ---

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin
On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. Hi Tushar, Seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Sunday, November 18, 2012 9:38 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 11/16/12 0

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-18 Thread Joe Jin
On 11/16/12 04:26, Dave, Tushar N wrote: >> Would you please help to fine the offset of max payload size in eeprom? >> I'd like to have a try to modify it by ethtool. > > It is defined using bit 8 of word 0x1A. > Bit value 0 = 128B , bit value 1 = 256B Hi Tushar, I checked one of my server which

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-15 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, November 14, 2012 4:33 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 11/14/1

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-14 Thread Joe Jin
On 11/14/12 11:45, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Tuesday, November 13, 2012 6:48 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org; Mary Mcgrath >> Subject: R

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Dave, Tushar N
>-Original Message- >From: Li Yu [mailto:raise.s...@gmail.com] >Sent: Tuesday, November 13, 2012 7:37 PM >To: Dave, Tushar N >Cc: Joe Jin; e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >于 2

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, November 13, 2012 6:48 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Mary Mcgrath >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 11/09/12

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Li Yu
于 2012年11月09日 04:35, Dave, Tushar N 写道: >> -Original Message- >> From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >> On Behalf Of Joe Jin >> Sent: Wednesday, November 07, 2012 10:25 PM >> To: e1000-de...@lists.sf.net >> Cc: net...@vger.kernel.org; linux-ker...@vger.k

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Joe Jin
On 11/09/12 04:35, Dave, Tushar N wrote: > All devices in path from root complex to 82571, should have *same* max > payload size otherwise it can cause hang. > Can you double check this? Hi Tushar, Checked with hardware vendor and they said no way to modify the max payload size from BIOS, can

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Joe Jin
On 11/09/12 04:35, Dave, Tushar N wrote: > Are you sure this is not similar issue as before that you reported. > i.e. Tushar, Thanks for your quick response, I'll check with customer if they can modify the Max payload size from BIOS, this time issue hit on HP's server. Thanks again, Joe > On

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Dave, Tushar N
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Wednesday, November 07, 2012 10:25 PM >To: e1000-de...@lists.sf.net >Cc: net...@vger.kernel.org; linux-ker...@vger.kernel.org; Mary Mcgrath >Subject: 82571EB: Detected

[E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Joe Jin
Hi list, IHAC reported "82571EB Detected Hardware Unit Hang" on HP ProLiant DL360 G6, and have to reboot the server to recover: e1000e :06:00.1: eth3: Detected Hardware Unit Hang: TDH <1a> TDT <1a> next_to_use <1a> next_to_clean<18> b

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-29 Thread Dave, Tushar N
@lists.sourceforge.net Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang This is the output: ~$ sudo ethtool -S eth1 | grep tx_timeout_count tx_timeout_count: 0 ~$ I will try new driver, but this is a production server. I don't have any actual problems with the nic, but I do

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-29 Thread Andrew Peng
I would suggest try latest e1000e driver > > *From:* Andrew Peng [mailto:peng...@gmail.com] > *Sent:* Friday, August 24, 2012 10:29 AM > > *To:* Dave, Tushar N > *Cc:* e1000-devel@lists.sourceforge.net > *Subject:* Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Nikolay Popov
Hi, Dave! Ok, I have set msglevel as you requested, let's wait for some logs Also, about versions - we using 1.11.3-NAPI on both 3.3.6 and 3.5.2 hosts. We was enforced to do that because with default kernel driver (at least 2.0.0 at 3.5.2) we see some misterious drops and delays (~1-2%, and delay

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Dave, Tushar N
>-Original Message- >From: Nikolay Popov [mailto:niko...@popoff.net.ua] >Sent: Tuesday, August 28, 2012 9:00 PM >To: Dave, Tushar N >Cc: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >29.08.2012 6:29, Dave, Tus

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Nikolay Popov
29.08.2012 6:29, Dave, Tushar N wrote: > Have you tried disabling tso (ethtool -K tso off)? I also tried recompiling driver with DISABLE_PM, disabling gro and other offload types, boot kernel with acpi_aspm=off, increase ring buffers to 4096, playing around flow control - nothing helped. Regards

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-28 Thread Nikolay Popov
29.08.2012 6:29, Dave, Tushar N пишет: > Thanks for the info. > For both, 82571 and 80003ES2LAN, I see UnsuppReq+ and UncorrErr+ in lspci > (DevSta: CorrErr- UncorrErr+ FatalErr- UnsuppReq+ AuxPwr+ TransPend+) > > Have you tried disabling tso (ethtool -K tso off)? Yes, this doesn't help > Was thi

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-27 Thread Dave, Tushar N
>-Original Message- >From: Nikolay Popov [mailto:niko...@popoff.net.ua] >Sent: Saturday, August 25, 2012 1:29 AM >To: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Hi, All > >It seems that I'm getting

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-25 Thread Nikolay Popov
Hi, All It seems that I'm getting same problems with 3.5.2 kernel - 80003ES2LAN onboard NIC is going to reset from time to time under load Aug 25 10:27:53 bras2 kernel: [134612.808590] e1000e :05:00.0: eth2: Detected Hardware Unit Hang: Aug 25 10:27:53 bras2 kernel: [134612.808590] TDH A

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-24 Thread Dave, Tushar N
ethx | grep tx_timeout_count’ -Tushar PS: I would suggest try latest e1000e driver From: Andrew Peng [mailto:peng...@gmail.com] Sent: Friday, August 24, 2012 10:29 AM To: Dave, Tushar N Cc: e1000-devel@lists.sourceforge.net Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang Hi, in

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-08-24 Thread Andrew Peng
s mentioned by Flavio). > > -Tushar > > >-Original Message- > >From: Flavio Leitner [mailto:f...@redhat.com] > >Sent: Thursday, July 19, 2012 6:39 PM > >To: Andrew Peng > >Cc: Dave, Tushar N; e1000-devel@lists.sourceforge.net > >Subject: Re: [E1000-devel

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Dave, Tushar N
N; e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >On Thu, 19 Jul 2012 20:17:14 -0500 >Andrew Peng wrote: > >> Flavio; >> >> I am using the stock kernel driver with the stock Debian Squeeze kernel. >> > >

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Flavio Leitner
ool ethx' > > > > -Tushar > > > > > > > >>-Original Message- > >>From: Andrew Peng [mailto:peng...@gmail.com] > >>Sent: Thursday, July 19, 2012 4:42 PM > >>To: Dave, Tushar N > >>Cc: e1000-devel

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Andrew Peng
the log. > Please confirm that msglvl is set correctly by running 'ethtool ethx' > > -Tushar > > > >>-Original Message- >>From: Andrew Peng [mailto:peng...@gmail.com] >>Sent: Thursday, July 19, 2012 4:42 PM >>To: Dave, Tushar N >>Cc: e1

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Dave, Tushar N
ndrew Peng [mailto:peng...@gmail.com] >Sent: Thursday, July 19, 2012 4:42 PM >To: Dave, Tushar N >Cc: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Attached is the dmesg output. Please let me know if this looks right. >The

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Flavio Leitner
Please enable TSO back. > > Then run "ethtool -s ethx msglvl 0x2c01". This will enable debug code that > > logs HW ring data (into dmesg log) when Tx hang occurs. When issue occur > > next time please send me the full dmesg log. > > > > -Tushar > > > >>

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-19 Thread Andrew Peng
gs HW ring data (into dmesg log) when Tx hang occurs. When issue occur next > time please send me the full dmesg log. > > -Tushar > >>-Original Message- >>From: Andrew Peng [mailto:peng...@gmail.com] >>Sent: Wednesday, July 18, 2012 6:24 AM >>To: e1000-dev

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-18 Thread Dave, Tushar N
ge- >From: Andrew Peng [mailto:peng...@gmail.com] >Sent: Wednesday, July 18, 2012 6:24 AM >To: e1000-devel@lists.sourceforge.net >Subject: Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Thus far disabling TSO via ethtool has seemed to work - can anyone explain >the te

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-18 Thread Andrew Peng
Tushar N > wrote: >>>-Original Message- >>>From: Andrew Peng [mailto:peng...@gmail.com] >>>Sent: Wednesday, July 11, 2012 8:50 AM >>>To: e1000-devel@lists.sourceforge.net >>>Subject: [E1000-devel] 82571EB - Detected Hardware Unit Hang &g

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Jon Mason
On Mon, Jul 16, 2012 at 9:08 AM, Henrique de Moraes Holschuh wrote: > On Mon, 16 Jul 2012, Ben Hutchings wrote: >> On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: >> > On Sun, 15 Jul 2012, Dave, Tushar N wrote: >> > > Somehow setting max payload to 256 from BIOS does not set

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Jon Mason
On Mon, Jul 16, 2012 at 8:47 AM, Ben Hutchings wrote: > On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: >> On Sun, 15 Jul 2012, Dave, Tushar N wrote: >> > Somehow setting max payload to 256 from BIOS does not set this value for >> > all devices. I believe this is a BIOS bug.

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-16 Thread Andrew Peng
Tushar N wrote: >>-Original Message- >>From: Andrew Peng [mailto:peng...@gmail.com] >>Sent: Wednesday, July 11, 2012 8:50 AM >>To: e1000-devel@lists.sourceforge.net >>Subject: [E1000-devel] 82571EB - Detected Hardware Unit Hang >> >>Folks, I've been ge

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Henrique de Moraes Holschuh
On Mon, 16 Jul 2012, Ben Hutchings wrote: > On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: > > On Sun, 15 Jul 2012, Dave, Tushar N wrote: > > > Somehow setting max payload to 256 from BIOS does not set this value for > > > all devices. I believe this is a BIOS bug. > > > >

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-16 Thread Ben Hutchings
On Sun, 2012-07-15 at 10:35 -0300, Henrique de Moraes Holschuh wrote: > On Sun, 15 Jul 2012, Dave, Tushar N wrote: > > Somehow setting max payload to 256 from BIOS does not set this value for > > all devices. I believe this is a BIOS bug. > > And preferably, Linux should complain about it. Since

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-15 Thread Henrique de Moraes Holschuh
On Sun, 15 Jul 2012, Dave, Tushar N wrote: > Somehow setting max payload to 256 from BIOS does not set this value for all > devices. I believe this is a BIOS bug. And preferably, Linux should complain about it. Since we know it is going to cause problems, and since we know it does happen, we sho

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Joe Jin
On 07/15/12 11:42, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Thursday, July 12, 2012 9:34 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Detec

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 9:34 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/13/12 12:10, Dave, Tush

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-12 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 4:46 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > Thanks for sending full dmesg

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-12 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Thursday, July 12, 2012 12:11 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 14:41, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>On 07/12/12 13:57, Dave, Tushar N wrote: >>> -Original Message- >>> From: Joe Jin [mailto:joe@oracle.com] >>> Sent: Wednesday, July 11, 2012 8:13 PM >>> To: Dave, Tushar N >>> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >>> ker...@vger.kernel.org >>> Subject: Re: 82571

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 13:57, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 8:13 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Dete

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 8:13 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 11:07, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 11:07, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 7:58 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Dete

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 7:58 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 10:52, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 10:52, Dave, Tushar N wrote: > What is the exact error messages in BIOS log? Error message from BIOS event log: 07/12/12 05:54:00 PCI Express Non-Fatal Error Thanks, Joe -- Live Security Virtual Conferenc

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 7:23 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/12/12 02:51, Dave, Tus

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/12/12 02:51, Dave, Tushar N wrote: > > Joe, > > I see couple of errors in lspci output. > Device capability status register shows UnCorrectable PCIe error. This means > there is certainly something went wrong. The only way to recover from > Uncorrectable errors is reset. > > Dev

Re: [E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Andrew Peng [mailto:peng...@gmail.com] >Sent: Wednesday, July 11, 2012 8:50 AM >To: e1000-devel@lists.sourceforge.net >Subject: [E1000-devel] 82571EB - Detected Hardware Unit Hang > >Folks, I've been getting some strange error messages i

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 10:03 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 12:05, Dave, Tush

[E1000-devel] 82571EB - Detected Hardware Unit Hang

2012-07-11 Thread Andrew Peng
Folks, I've been getting some strange error messages in my home server / router that I've been having trouble debugging. I'm decently proficient in Linux, but I fear I'm in over my head with this one. The hardware is a HP N40L Microserver - here are the hardware details - http://n40l.wikia.com/wik

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:50, Dave, Tushar N wrote: > Device status and AER sections show some errors that looks little suspicious > to me but I'm not too sure. I will get back tomorrow. > Thanks a lot, Tushar! Joe -- Oracle Joe Jin | Software Development Senior Manager | +8610.

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 12:39 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 15:37, Dave, Tu

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:37, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 12:18 AM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Det

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Wednesday, July 11, 2012 12:18 AM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 15:11, Dave, Tu

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin
On 07/11/12 15:11, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Tuesday, July 10, 2012 10:03 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger.kernel.org >> Subject: Re: 82571EB: Detec

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 10:03 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 12:05, Dave, Tush

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 12:05, Dave, Tushar N wrote: > When you said you had this issue with RHEL5 and RHEL6 drivers, have you > install RHEl5/6 kernel and reproduced it? If so I think I should install > RHEL6 and try reproduce it locally! > Yes I reproduced this on both RHEL5 and RHEL6. So far I tried to

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 8:29 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 11:22, Dave, Tusha

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 11:22, Dave, Tushar N wrote: > Thanks for info. I see that hang occurs right when HW processing first TX > descriptor with TSO. > Would you be able to reproduce issue with TSO off? Disable TSO by 'ethtool > -K ethx tso off' > Let all debug enabled as it is, that will help us debug f

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Joe Jin [mailto:joe@oracle.com] >Sent: Tuesday, July 10, 2012 5:35 PM >To: Dave, Tushar N >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardware Unit Hang > >On 07/11/12 03:02, Dave, Tush

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
On 07/11/12 03:02, Dave, Tushar N wrote: >> -Original Message- >> From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >> On Behalf Of Joe Jin >> Sent: Tuesday, July 10, 2012 12:40 AM >> To: Joe Jin >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker..

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: Dave, Tushar N >Sent: Tuesday, July 10, 2012 12:02 PM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org; Dave, Tushar N >Subject: RE: 82571EB: Detected Hardware Unit Hang > >>-Original Message- >>From: netd

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Dave, Tushar N
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Tuesday, July 10, 2012 12:40 AM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hardw

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Wyborny, Carolyn
>-Original Message- >From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >On Behalf Of Joe Jin >Sent: Tuesday, July 10, 2012 12:40 AM >To: Joe Jin >Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >ker...@vger.kernel.org >Subject: Re: 82571EB: Detected Hard

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin
When I debug the driver I found before Detected HW hang, driver unable to clean and reclaim the resources: 1457 while ((eop_desc->upper.data & cpu_to_le32(E1000_TXD_STAT_DD)) && <== at here upper.data always is 0x300 1458(count < tx_ring->count)) { <--- snip ---> 148

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Joe Jin
On 07/09/12 17:21, Eric Dumazet wrote: > On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: >> Hi list, >> >> I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing >> scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy >> a big file (>500M) from another se

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Eric Dumazet
On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: > Hi list, > > I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing > scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy > a big file (>500M) from another server will hit it at once. > > Would you ple

[E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Joe Jin
Hi list, I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy a big file (>500M) from another server will hit it at once. Would you please help on this? device info: # lspci -s 05:00.0 05:00.0 Ethe

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-11-03 Thread Michael Wang
On 11/03/2011 09:39 PM, Flavio Leitner wrote: > (moving the discussion back to the list) > > Hi, > > I am sorry, I didn't receive your patch as we discussed in private > and ended up writing one patch myself which essentially does the > same thing. > > The patch is available at: > https://bugzilla.

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-11-03 Thread Flavio Leitner
(moving the discussion back to the list) Hi, I am sorry, I didn't receive your patch as we discussed in private and ended up writing one patch myself which essentially does the same thing. The patch is available at: https://bugzilla.redhat.com/show_bug.cgi?id=746272#c13 It schedules a workqueue

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-26 Thread Michael Wang
Hi, Flavio, Jesse I have send out the patch, which I hope can do some help. Because this is my first time to send a patch, I am sorry if I have done some silly thing. And please tell me if there are some problem about it. Thanks & Best regards, Michael Wang

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-25 Thread Michael Wang
On 10/25/2011 11:57 PM, Jesse Brandeburg wrote: > On Mon, 24 Oct 2011 23:29:34 -0700 > Michael Wang wrote: >> May be you can just search macro >> "E1000_TXDCTL_DMA_BURST_ENABLE" >> in "drivers/net/e1000e/e1000.h", change it to: >> >> #define E1000_TXDCTL_DMA_BURST_ENABLE \ >> (E1000_TXDCTL_GRAN |

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-25 Thread Jesse Brandeburg
On Mon, 24 Oct 2011 23:29:34 -0700 Michael Wang wrote: > May be you can just search macro > "E1000_TXDCTL_DMA_BURST_ENABLE" > in "drivers/net/e1000e/e1000.h", change it to: > > #define E1000_TXDCTL_DMA_BURST_ENABLE \ > (E1000_TXDCTL_GRAN | /* set descriptor granularity */ \ > E1000_TXDCTL_COUNT_D

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-24 Thread Michael Wang
On 10/25/2011 12:26 AM, Flavio Leitner wrote: > On Mon, 24 Oct 2011 16:26:28 +0800 > Michael Wang wrote: > >> On 10/21/2011 10:03 PM, Flavio Leitner wrote: >>> On Fri, 21 Oct 2011 14:15:12 +0800 >>> Michael Wang wrote: >>> On 10/19/2011 08:16 PM, Flavio Leitner wrote: > On Wed, 19 Oct 2

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-24 Thread Flavio Leitner
On Mon, 24 Oct 2011 16:26:28 +0800 Michael Wang wrote: > On 10/21/2011 10:03 PM, Flavio Leitner wrote: > > On Fri, 21 Oct 2011 14:15:12 +0800 > > Michael Wang wrote: > > > >> On 10/19/2011 08:16 PM, Flavio Leitner wrote: > >>> On Wed, 19 Oct 2011 12:49:48 +0800 > >>> wangyun wrote: > >>> > >>>

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-24 Thread Michael Wang
On 10/21/2011 10:03 PM, Flavio Leitner wrote: > On Fri, 21 Oct 2011 14:15:12 +0800 > Michael Wang wrote: > >> On 10/19/2011 08:16 PM, Flavio Leitner wrote: >>> On Wed, 19 Oct 2011 12:49:48 +0800 >>> wangyun wrote: >>> Hi, Flavio I am new to join the community, work on e1000e drive

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-21 Thread Flavio Leitner
On Fri, 21 Oct 2011 14:15:12 +0800 Michael Wang wrote: > On 10/19/2011 08:16 PM, Flavio Leitner wrote: > > On Wed, 19 Oct 2011 12:49:48 +0800 > > wangyun wrote: > > > >> Hi, Flavio > >> > >> I am new to join the community, work on e1000e driver currently, > >> And I found a thing strange in this

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-20 Thread Michael Wang
On 10/19/2011 08:16 PM, Flavio Leitner wrote: > On Wed, 19 Oct 2011 12:49:48 +0800 > wangyun wrote: > >> Hi, Flavio >> >> I am new to join the community, work on e1000e driver currently, >> And I found a thing strange in this issue, please check below. >> >> Thanks, >> Michael Wang >> >> On 10/18/

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-19 Thread Flavio Leitner
On Wed, 19 Oct 2011 12:49:48 +0800 wangyun wrote: > Hi, Flavio > > I am new to join the community, work on e1000e driver currently, > And I found a thing strange in this issue, please check below. > > Thanks, > Michael Wang > > On 10/18/2011 10:42 PM, Flavio Leitner wrote: > > On Mon, 17 Oct 2

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-18 Thread wangyun
Hi, Flavio I am new to join the community, work on e1000e driver currently, And I found a thing strange in this issue, please check below. Thanks, Michael Wang On 10/18/2011 10:42 PM, Flavio Leitner wrote: > On Mon, 17 Oct 2011 11:48:22 -0700 > Jesse Brandeburg wrote: > >> On Fri, 14 Oct 2011 1

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-18 Thread Flavio Leitner
On Mon, 17 Oct 2011 11:48:22 -0700 Jesse Brandeburg wrote: > On Fri, 14 Oct 2011 10:04:26 -0700 > Flavio Leitner wrote: > > > > > Hi, > > > > I got few reports so far that 82571EB models are having the > > "Detected Hardware Unit Hang" issue after upgrading the kernel. > > > > Further debugg

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-17 Thread Jesse Brandeburg
On Fri, 14 Oct 2011 10:04:26 -0700 Flavio Leitner wrote: > > Hi, > > I got few reports so far that 82571EB models are having the > "Detected Hardware Unit Hang" issue after upgrading the kernel. > > Further debugging with an instrumented kernel revealed that the > socket buffer time stamp matc

[E1000-devel] 82571EB: Detected Hardware Unit Hang

2011-10-14 Thread Flavio Leitner
Hi, I got few reports so far that 82571EB models are having the "Detected Hardware Unit Hang" issue after upgrading the kernel. Further debugging with an instrumented kernel revealed that the socket buffer time stamp matches with the last time e1000_xmit_frame() was called. Also that the time st