Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-28 Thread Thomas Voegtle

On Sat, 28 Feb 2015, Jiang Liu wrote:


On 2015/2/28 6:24, Rafael J. Wysocki wrote:

On Friday, February 27, 2015 03:50:32 PM Thomas Voegtle wrote:


Hi,

I have the same problem with a Asrock Q1900B-ITX mainboard with
a Intel Celeron J1900 onboard.

I did a bisect and ended up with:

593669c2ac0fe18baee04a3cd5539a148aa48574 is the first bad commit

commit 593669c2ac0fe18baee04a3cd5539a148aa48574
Author: Jiang Liu 
Date:   Thu Feb 5 13:44:46 2015 +0800

 x86/PCI/ACPI: Use common ACPI resource interfaces to simplify
implementation


I can revert this quite big commit on current git head (4f671fe) with no
problems and then everything is fine again.


Thanks for nailing this one!

It really wasn't supposed to make any functional difference, though, so there
must be some subtle mistake that escaped everyone in it.

I'll have a look at that and hopefully Jiang Liu will be able to help in the
meantime too.

Hi all,
Sorry for slow response, just return from Chinese New Holidays:)
Hi Thomas,
Could you please help to provide the dmesgs before and after the
revert?



Attached.

Thanks,

Thomas[0.00] Linux version 4.0.0-rc1-celeron-00037-gec3360c (thomas@maggie) 
(gcc version 4.8.1 20130909 [gcc-4_8-branch revision 202388] (SUSE Linux) ) 
#218 SMP Sat Feb 28 10:54:22 CET 2015
[0.00] Command line: root=/dev/sda3 console=ttyS0,115200N8 panic=30
[0.00] e820: BIOS-provided physical RAM map:
[0.00] BIOS-e820: [mem 0x-0x0009c7ff] usable
[0.00] BIOS-e820: [mem 0x0009c800-0x0009] reserved
[0.00] BIOS-e820: [mem 0x000e-0x000f] reserved
[0.00] BIOS-e820: [mem 0x0010-0x1fff] usable
[0.00] BIOS-e820: [mem 0x2000-0x200f] reserved
[0.00] BIOS-e820: [mem 0x2010-0x79578fff] usable
[0.00] BIOS-e820: [mem 0x79579000-0x795a8fff] reserved
[0.00] BIOS-e820: [mem 0x795a9000-0x795fcfff] usable
[0.00] BIOS-e820: [mem 0x795fd000-0x7970dfff] ACPI NVS
[0.00] BIOS-e820: [mem 0x7970e000-0x79a7afff] reserved
[0.00] BIOS-e820: [mem 0x79a7b000-0x79a7bfff] usable
[0.00] BIOS-e820: [mem 0x79a7c000-0x79abdfff] reserved
[0.00] BIOS-e820: [mem 0x79abe000-0x79c2bfff] usable
[0.00] BIOS-e820: [mem 0x79c2c000-0x79ff9fff] reserved
[0.00] BIOS-e820: [mem 0x79ffa000-0x79ff] usable
[0.00] BIOS-e820: [mem 0xe00f8000-0xe00f8fff] reserved
[0.00] BIOS-e820: [mem 0xfed01000-0xfed01fff] reserved
[0.00] BIOS-e820: [mem 0xffb0-0x] reserved
[0.00] NX (Execute Disable) protection: active
[0.00] SMBIOS 2.8 present.
[0.00] DMI: To Be Filled By O.E.M. To Be Filled By O.E.M./Q1900B-ITX, 
BIOS P1.70 11/04/2014
[0.00] e820: update [mem 0x-0x0fff] usable ==> reserved
[0.00] e820: remove [mem 0x000a-0x000f] usable
[0.00] e820: last_pfn = 0x7a000 max_arch_pfn = 0x4
[0.00] MTRR default type: uncachable
[0.00] MTRR fixed ranges enabled:
[0.00]   0-9 write-back
[0.00]   A-B uncachable
[0.00]   C-E7FFF write-through
[0.00]   E8000-F write-protect
[0.00] MTRR variable ranges enabled:
[0.00]   0 base 0 mask F8000 write-back
[0.00]   1 base 07A80 mask FFF80 uncachable
[0.00]   2 base 07B00 mask FFF00 uncachable
[0.00]   3 base 07C00 mask FFC00 uncachable
[0.00]   4 disabled
[0.00]   5 disabled
[0.00]   6 disabled
[0.00]   7 disabled
[0.00] PAT configuration [0-7]: WB  WC  UC- UC  WB  WC  UC- UC  
[0.00] Scanning 1 areas for low memory corruption
[0.00] Base memory trampoline at [88096000] 96000 size 24576
[0.00] init_memory_mapping: [mem 0x-0x000f]
[0.00]  [mem 0x-0x000f] page 4k
[0.00] BRK [0x01a0f000, 0x01a0] PGTABLE
[0.00] BRK [0x01a1, 0x01a10fff] PGTABLE
[0.00] BRK [0x01a11000, 0x01a11fff] PGTABLE
[0.00] init_memory_mapping: [mem 0x7920-0x793f]
[0.00]  [mem 0x7920-0x793f] page 2M
[0.00] BRK [0x01a12000, 0x01a12fff] PGTABLE
[0.00] init_memory_mapping: [mem 0x6000-0x791f]
[0.00]  [mem 0x6000-0x791f] page 2M
[0.00] init_memory_mapping: [mem 0x4000-0x5fff]
[0.00]  [mem 0x4000-0x5fff] page 2M
[0.00] init_memory_mapping: [mem 0x0010-0x1fff]
[0.00]  [mem 0x0010-0x001f] page 4k
[0.00]  [mem 0x0020-0x1fff] page 2M
[0.00] init_memory_mapping: [mem 0x2010

Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-28 Thread Jiang Liu
On 2015/2/28 16:36, Marcel Holtmann wrote:
> Hi Jiang,
> 
 I have the same problem with a Asrock Q1900B-ITX mainboard with 
 a Intel Celeron J1900 onboard.

 I did a bisect and ended up with:

 593669c2ac0fe18baee04a3cd5539a148aa48574 is the first bad commit

 commit 593669c2ac0fe18baee04a3cd5539a148aa48574
 Author: Jiang Liu 
 Date:   Thu Feb 5 13:44:46 2015 +0800

 x86/PCI/ACPI: Use common ACPI resource interfaces to simplify 
 implementation


 I can revert this quite big commit on current git head (4f671fe) with no 
 problems and then everything is fine again.
>>>
>>> Thanks for nailing this one!
>>>
>>> It really wasn't supposed to make any functional difference, though, so 
>>> there
>>> must be some subtle mistake that escaped everyone in it.
>>>
>>> I'll have a look at that and hopefully Jiang Liu will be able to help in the
>>> meantime too.
>> Hi all,
>>  Sorry for slow response, just return from Chinese New Holidays:)
>> Hi Thomas,
>>  Could you please help to provide the dmesgs before and after the
>> revert?
> 
> just grab a Minnowboard Max and test this by yourself. It has the same 
> problem. The MAC address is ff:ff:ff:ff:ff:ff with this patch. Once I 
> reverted it, the MAC address is correctly read again.
Hi Marcel,
I have no access to any Minnowboard Max board, so couldn't test
it by myself. Could you please help to forward the two dmesg files you
have sent to Bjorn in the original email?
Thanks!
Gerry

> 
> Regards
> 
> Marcel
> 
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-28 Thread Jiang Liu
On 2015/2/25 15:10, Bjorn Helgaas wrote:
> [+cc Jiang, Thomas, Lv, Rafael, linux-pci, linux-acpi]
> 
> On Tue, Feb 24, 2015 at 9:47 PM, Dave Airlie  wrote:
>> Hey,
>>
>> just booted an old AMD rs780 box with new kernel and my ethernet
>> failed to come up!
>>
>> Looks like the MAC addr is all ff's because the PCI bridge windows are
>> messed up.
>>
>> I've attached two dmesg one from a 3.19.0-rc6 I had on it, and one
>> failing from the 4.0.0-rc1 time.
>>
>> b24e2bdde4af656bb0679a101265ebb8f8735d3c is latest Linus commit in
>> that tree (I have some radeon patches on top).
>>
>> motherboard is a Gigabyte GA-MA78GM-S2H, lspci also attached.
> 
> Here's the dmesg diff that looks relevant to me:
> 
> -pci_bus :00: root bus resource [io  0x-0x0cf7]
> -pci_bus :00: root bus resource [io  0x0d00-0x]
> -pci_bus :00: root bus resource [mem 0x000a-0x000b]
> -pci_bus :00: root bus resource [mem 0x000c-0x000d]
> -pci_bus :00: root bus resource [mem 0xfed4-0xfed44fff]
> -pci_bus :00: root bus resource [mem 0x7ff0-0xfebf]
> +pci_bus :00: root bus resource [io  0x0cf8-0x0cff]
> +pci_bus :00: root bus resource [io  0x-0x0cf7 window]
> +pci_bus :00: root bus resource [io  0x0d00-0x window]
> +pci_bus :00: root bus resource [mem 0x000a-0x000b window]
> +pci_bus :00: root bus resource [mem 0x000c-0x000d window]
> +pci_bus :00: root bus resource [mem 0xfed4-0xfed44fff window]
> 
> What's interesting is:
> 
> * v3.19 ignored [io  0x0cf8-0x0cff], but v4.0 includes it.  I think
> it's wrong to include it because that's the configuration space
> address/data registers, so it's consumed by the host bridge and not
> produced on the downstream side.
Hi Bjorn,
We should ignore resources occupied by host bridge itself.
Will work out a patch to fix this issue.

> 
> * v3.19 includes [mem 0x7ff0-0xfebf], but v4.0 does not.  This
> is what's screwing up the devices.
I need more information to figure out why resource [mem
0x7ff0-0xfebf] is ignored by new ACPI parser.
Could you please help to forward the original dmesg attachments?
Regards!
Gerry

> 
> I think all the windows should be marked as ACPI_PRODUCER in _CRS
> since the space is "produced" on the downstream side of the bridge.
> The [io  0x0cf8-0x0cff] region should probably be marked
> ACPI_CONSUMER, and maybe that accounts for why v3.19 ignores it.  But
> I haven't found the code that does that yet.
> 
> I suspect this is all related to the ACPI resource parsing rework.  I
> looked through that briefly, but no issues jumped out at me, so this
> is just a heads-up in case it is obvious to you guys.
> 
> Dave, it'd be useful if you could collect an acpidump so we can look
> at the _CRS data in more detail.
> 
> Bjorn
> 
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-28 Thread Marcel Holtmann
Hi Jiang,

>>> I have the same problem with a Asrock Q1900B-ITX mainboard with 
>>> a Intel Celeron J1900 onboard.
>>> 
>>> I did a bisect and ended up with:
>>> 
>>> 593669c2ac0fe18baee04a3cd5539a148aa48574 is the first bad commit
>>> 
>>> commit 593669c2ac0fe18baee04a3cd5539a148aa48574
>>> Author: Jiang Liu 
>>> Date:   Thu Feb 5 13:44:46 2015 +0800
>>> 
>>> x86/PCI/ACPI: Use common ACPI resource interfaces to simplify 
>>> implementation
>>> 
>>> 
>>> I can revert this quite big commit on current git head (4f671fe) with no 
>>> problems and then everything is fine again.
>> 
>> Thanks for nailing this one!
>> 
>> It really wasn't supposed to make any functional difference, though, so there
>> must be some subtle mistake that escaped everyone in it.
>> 
>> I'll have a look at that and hopefully Jiang Liu will be able to help in the
>> meantime too.
> Hi all,
>   Sorry for slow response, just return from Chinese New Holidays:)
> Hi Thomas,
>   Could you please help to provide the dmesgs before and after the
> revert?

just grab a Minnowboard Max and test this by yourself. It has the same problem. 
The MAC address is ff:ff:ff:ff:ff:ff with this patch. Once I reverted it, the 
MAC address is correctly read again.

Regards

Marcel

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-28 Thread Jiang Liu
On 2015/2/28 6:24, Rafael J. Wysocki wrote:
> On Friday, February 27, 2015 03:50:32 PM Thomas Voegtle wrote:
>>
>> Hi,
>>
>> I have the same problem with a Asrock Q1900B-ITX mainboard with 
>> a Intel Celeron J1900 onboard.
>>
>> I did a bisect and ended up with:
>>
>> 593669c2ac0fe18baee04a3cd5539a148aa48574 is the first bad commit
>>
>> commit 593669c2ac0fe18baee04a3cd5539a148aa48574
>> Author: Jiang Liu 
>> Date:   Thu Feb 5 13:44:46 2015 +0800
>>
>>  x86/PCI/ACPI: Use common ACPI resource interfaces to simplify 
>> implementation
>>
>>
>> I can revert this quite big commit on current git head (4f671fe) with no 
>> problems and then everything is fine again.
> 
> Thanks for nailing this one!
> 
> It really wasn't supposed to make any functional difference, though, so there
> must be some subtle mistake that escaped everyone in it.
> 
> I'll have a look at that and hopefully Jiang Liu will be able to help in the
> meantime too.
Hi all,
Sorry for slow response, just return from Chinese New Holidays:)
Hi Thomas,
Could you please help to provide the dmesgs before and after the
revert?
Thanks!
Gerry

> 
> 
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-27 Thread Rafael J. Wysocki
On Friday, February 27, 2015 03:50:32 PM Thomas Voegtle wrote:
> 
> Hi,
> 
> I have the same problem with a Asrock Q1900B-ITX mainboard with 
> a Intel Celeron J1900 onboard.
> 
> I did a bisect and ended up with:
> 
> 593669c2ac0fe18baee04a3cd5539a148aa48574 is the first bad commit
> 
> commit 593669c2ac0fe18baee04a3cd5539a148aa48574
> Author: Jiang Liu 
> Date:   Thu Feb 5 13:44:46 2015 +0800
> 
>  x86/PCI/ACPI: Use common ACPI resource interfaces to simplify 
> implementation
> 
> 
> I can revert this quite big commit on current git head (4f671fe) with no 
> problems and then everything is fine again.

Thanks for nailing this one!

It really wasn't supposed to make any functional difference, though, so there
must be some subtle mistake that escaped everyone in it.

I'll have a look at that and hopefully Jiang Liu will be able to help in the
meantime too.


-- 
I speak only for myself.
Rafael J. Wysocki, Intel Open Source Technology Center.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-27 Thread Thomas Voegtle


Hi,

I have the same problem with a Asrock Q1900B-ITX mainboard with 
a Intel Celeron J1900 onboard.


I did a bisect and ended up with:

593669c2ac0fe18baee04a3cd5539a148aa48574 is the first bad commit

commit 593669c2ac0fe18baee04a3cd5539a148aa48574
Author: Jiang Liu 
Date:   Thu Feb 5 13:44:46 2015 +0800

x86/PCI/ACPI: Use common ACPI resource interfaces to simplify 
implementation



I can revert this quite big commit on current git head (4f671fe) with no 
problems and then everything is fine again.




  Thomas

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-25 Thread Rafael J. Wysocki
On Wednesday, February 25, 2015 06:03:56 PM Dave Airlie wrote:
> > * v3.19 ignored [io  0x0cf8-0x0cff], but v4.0 includes it.  I think
> > it's wrong to include it because that's the configuration space
> > address/data registers, so it's consumed by the host bridge and not
> > produced on the downstream side.
> >
> > * v3.19 includes [mem 0x7ff0-0xfebf], but v4.0 does not.  This
> > is what's screwing up the devices.
> >
> > I think all the windows should be marked as ACPI_PRODUCER in _CRS
> > since the space is "produced" on the downstream side of the bridge.
> > The [io  0x0cf8-0x0cff] region should probably be marked
> > ACPI_CONSUMER, and maybe that accounts for why v3.19 ignores it.  But
> > I haven't found the code that does that yet.
> >
> > I suspect this is all related to the ACPI resource parsing rework.  I
> > looked through that briefly, but no issues jumped out at me, so this
> > is just a heads-up in case it is obvious to you guys.
> >
> > Dave, it'd be useful if you could collect an acpidump so we can look
> > at the _CRS data in more detail.
> 
> acpidump fails here with a /dev/mem warning in the kernel,
> 
> now I'm not near the machine again until next week most likely, so I
> only have ssh for now,
> and the kernel it is running for which I don't have the source anymore!
> 
> is there any of tables from /sys I can grab instead?

/sys/firmware/acpi/tables/DSDT
/sys/firmware/acpi/tables/SSDT*

Also I'm wondering if reverting commit 2ea3d266bab3 (ACPI: Translate
resource into master side address for bridge window resources) makes
any difference?

Rafael

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-25 Thread Dave Airlie
> * v3.19 ignored [io  0x0cf8-0x0cff], but v4.0 includes it.  I think
> it's wrong to include it because that's the configuration space
> address/data registers, so it's consumed by the host bridge and not
> produced on the downstream side.
>
> * v3.19 includes [mem 0x7ff0-0xfebf], but v4.0 does not.  This
> is what's screwing up the devices.
>
> I think all the windows should be marked as ACPI_PRODUCER in _CRS
> since the space is "produced" on the downstream side of the bridge.
> The [io  0x0cf8-0x0cff] region should probably be marked
> ACPI_CONSUMER, and maybe that accounts for why v3.19 ignores it.  But
> I haven't found the code that does that yet.
>
> I suspect this is all related to the ACPI resource parsing rework.  I
> looked through that briefly, but no issues jumped out at me, so this
> is just a heads-up in case it is obvious to you guys.
>
> Dave, it'd be useful if you could collect an acpidump so we can look
> at the _CRS data in more detail.

acpidump fails here with a /dev/mem warning in the kernel,

now I'm not near the machine again until next week most likely, so I
only have ssh for now,
and the kernel it is running for which I don't have the source anymore!

is there any of tables from /sys I can grab instead?

Dave.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-24 Thread Bjorn Helgaas
[+cc Jiang, Thomas, Lv, Rafael, linux-pci, linux-acpi]

On Tue, Feb 24, 2015 at 9:47 PM, Dave Airlie  wrote:
> Hey,
>
> just booted an old AMD rs780 box with new kernel and my ethernet
> failed to come up!
>
> Looks like the MAC addr is all ff's because the PCI bridge windows are
> messed up.
>
> I've attached two dmesg one from a 3.19.0-rc6 I had on it, and one
> failing from the 4.0.0-rc1 time.
>
> b24e2bdde4af656bb0679a101265ebb8f8735d3c is latest Linus commit in
> that tree (I have some radeon patches on top).
>
> motherboard is a Gigabyte GA-MA78GM-S2H, lspci also attached.

Here's the dmesg diff that looks relevant to me:

-pci_bus :00: root bus resource [io  0x-0x0cf7]
-pci_bus :00: root bus resource [io  0x0d00-0x]
-pci_bus :00: root bus resource [mem 0x000a-0x000b]
-pci_bus :00: root bus resource [mem 0x000c-0x000d]
-pci_bus :00: root bus resource [mem 0xfed4-0xfed44fff]
-pci_bus :00: root bus resource [mem 0x7ff0-0xfebf]
+pci_bus :00: root bus resource [io  0x0cf8-0x0cff]
+pci_bus :00: root bus resource [io  0x-0x0cf7 window]
+pci_bus :00: root bus resource [io  0x0d00-0x window]
+pci_bus :00: root bus resource [mem 0x000a-0x000b window]
+pci_bus :00: root bus resource [mem 0x000c-0x000d window]
+pci_bus :00: root bus resource [mem 0xfed4-0xfed44fff window]

What's interesting is:

* v3.19 ignored [io  0x0cf8-0x0cff], but v4.0 includes it.  I think
it's wrong to include it because that's the configuration space
address/data registers, so it's consumed by the host bridge and not
produced on the downstream side.

* v3.19 includes [mem 0x7ff0-0xfebf], but v4.0 does not.  This
is what's screwing up the devices.

I think all the windows should be marked as ACPI_PRODUCER in _CRS
since the space is "produced" on the downstream side of the bridge.
The [io  0x0cf8-0x0cff] region should probably be marked
ACPI_CONSUMER, and maybe that accounts for why v3.19 ignores it.  But
I haven't found the code that does that yet.

I suspect this is all related to the ACPI resource parsing rework.  I
looked through that briefly, but no issues jumped out at me, so this
is just a heads-up in case it is obvious to you guys.

Dave, it'd be useful if you could collect an acpidump so we can look
at the _CRS data in more detail.

Bjorn
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: regression in 4.0.0-rc1 with r8169 ethernet

2015-02-24 Thread Bjorn Helgaas
[+cc linux-pci]

On Tue, Feb 24, 2015 at 9:47 PM, Dave Airlie  wrote:
> Hey,
>
> just booted an old AMD rs780 box with new kernel and my ethernet
> failed to come up!
>
> Looks like the MAC addr is all ff's because the PCI bridge windows are
> messed up.
>
> I've attached two dmesg one from a 3.19.0-rc6 I had on it, and one
> failing from the 4.0.0-rc1 time.
>
> b24e2bdde4af656bb0679a101265ebb8f8735d3c is latest Linus commit in
> that tree (I have some radeon patches on top).
>
> motherboard is a Gigabyte GA-MA78GM-S2H, lspci also attached.

Hi Dave,

Looking, thanks for the report and sorry for the inconvenience.

Bjorn
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/