Re: [PATCH 1/3] PCI/ASPM: Use the path max in L1 ASPM latency check

2020-12-17 Thread Ian Kumlien
On Thu, Dec 17, 2020 at 12:21 AM Bjorn Helgaas wrote: > > On Wed, Dec 16, 2020 at 12:20:53PM +0100, Ian Kumlien wrote: > > On Wed, Dec 16, 2020 at 1:08 AM Bjorn Helgaas wrote: > > > On Tue, Dec 15, 2020 at 02:09:12PM +0100, Ian Kumlien wrote: > > > > On Tue, Dec

Re: [PATCH 1/3] PCI/ASPM: Use the path max in L1 ASPM latency check

2020-12-16 Thread Ian Kumlien
On Wed, Dec 16, 2020 at 1:08 AM Bjorn Helgaas wrote: > > On Tue, Dec 15, 2020 at 02:09:12PM +0100, Ian Kumlien wrote: > > On Tue, Dec 15, 2020 at 1:40 AM Bjorn Helgaas wrote: > > > > > > On Mon, Dec 14, 2020 at 11:56:31PM +0100, Ian Kumlien wrote: > > >

Re: [PATCH 1/3] PCI/ASPM: Use the path max in L1 ASPM latency check

2020-12-15 Thread Ian Kumlien
On Tue, Dec 15, 2020 at 1:40 AM Bjorn Helgaas wrote: > > On Mon, Dec 14, 2020 at 11:56:31PM +0100, Ian Kumlien wrote: > > On Mon, Dec 14, 2020 at 8:19 PM Bjorn Helgaas wrote: > > > > If you're interested, you could probably unload the Realtek drivers, > >

Re: [PATCH 1/3] PCI/ASPM: Use the path max in L1 ASPM latency check

2020-12-14 Thread Ian Kumlien
On Mon, Dec 14, 2020 at 8:19 PM Bjorn Helgaas wrote: > > On Mon, Dec 14, 2020 at 04:47:32PM +0100, Ian Kumlien wrote: > > On Mon, Dec 14, 2020 at 3:02 PM Bjorn Helgaas wrote: > > > On Mon, Dec 14, 2020 at 10:14:18AM +0100, Ian Kumlien wrote: > > > > On Mon, Dec

Re: [PATCH 1/3] PCI/ASPM: Use the path max in L1 ASPM latency check

2020-12-14 Thread Ian Kumlien
On Mon, Dec 14, 2020 at 3:02 PM Bjorn Helgaas wrote: > > On Mon, Dec 14, 2020 at 10:14:18AM +0100, Ian Kumlien wrote: > > On Mon, Dec 14, 2020 at 6:44 AM Bjorn Helgaas wrote: > > > > > > [+cc Jesse, Tony, David, Jakub, Heiner, lists in case there's an ASPM > &g

Re: [PATCH 1/3] PCI/ASPM: Use the path max in L1 ASPM latency check

2020-12-14 Thread Ian Kumlien
ce, but AFAICT the devices in that path give us > no reason to disable L1. If I understand the spec correctly, the > Realtek device should not be relevant to the I211 path.] > > On Sun, Dec 13, 2020 at 10:39:53PM +0100, Ian Kumlien wrote: > > On Sun, Dec 13, 2020 at 12:47 AM Bjorn

Re: [PATCH 2/2] PCI: vmd: Enable ASPM for mobile platforms

2020-10-09 Thread Ian Kumlien
On Thu, Oct 8, 2020 at 6:19 AM Kai-Heng Feng wrote: > > > > > On Oct 7, 2020, at 21:30, Bjorn Helgaas wrote: > > > > On Wed, Oct 07, 2020 at 12:26:19PM +0800, Kai-Heng Feng wrote: > >>> On Oct 6, 2020, at 03:19, Bjorn Helgaas wrote: > >>> On Tue, Oct 06, 2020 at 02:40:32AM +0800, Kai-Heng Feng

Re: [PATCH 2/2] PCI: vmd: Enable ASPM for mobile platforms

2020-10-07 Thread Ian Kumlien
On Wed, Oct 7, 2020 at 3:30 PM Bjorn Helgaas wrote: > > On Wed, Oct 07, 2020 at 12:26:19PM +0800, Kai-Heng Feng wrote: > > > On Oct 6, 2020, at 03:19, Bjorn Helgaas wrote: > > > On Tue, Oct 06, 2020 at 02:40:32AM +0800, Kai-Heng Feng wrote: > > >>> On Oct 3, 2020, at 06:18, Bjorn Helgaas wrote:

Re: [PATCH 2/2] PCI: vmd: Enable ASPM for mobile platforms

2020-10-07 Thread Ian Kumlien
On Wed, Oct 7, 2020 at 6:26 AM Kai-Heng Feng wrote: > > > > > On Oct 6, 2020, at 03:19, Bjorn Helgaas wrote: > > > > [+cc Ian, who's also working on an ASPM issue] > > > > On Tue, Oct 06, 2020 at 02:40:32AM +0800, Kai-Heng Feng wrote: > >>> On Oct 3, 2020, at 06:18, Bjorn Helgaas wrote: > >>>

Re: [PATCH v4 01/12] IB/hfi1: Check if pcie_capability_read_*() reads ~0

2020-08-03 Thread Ian Kumlien
amp; ASPM_STATE_L1) && - (latency + l1_switch_latency > acceptable->l1)) - link->aspm_capable &= ~ASPM_STATE_L1; - l1_switch_latency += 1000; + if (link->aspm_capable & ASPM_STATE_L1) { + l

Re: [PATCH 4.20 000/111] 4.20.4-stable review

2019-01-21 Thread Ian Kumlien
On Mon, Jan 21, 2019 at 7:48 PM David Miller wrote: > > From: Ian Kumlien > Date: Mon, 21 Jan 2019 16:38:11 +0100 > > > Hi David, > > > > could we have your blessing to add the following patch to -stable for > > 4.20.4: > > commit 41d1c8839e5f8cb781cc63

Re: [PATCH 4.20 000/111] 4.20.4-stable review

2019-01-21 Thread Ian Kumlien
id S. Miller --- Also, do you keep your -stable queue somewhere where I can see it? It feels like I'm stepping on toes, adding overhead and such, which is not what i want... On Mon, Jan 21, 2019 at 4:10 PM Greg KH wrote: > On Mon, Jan 21, 2019 at 03:56:24PM +0100, Ian Kumlien wrote: >

Re: [PATCH 4.20 000/111] 4.20.4-stable review

2019-01-21 Thread Ian Kumlien
On Mon, Jan 21, 2019 at 3:47 PM Greg KH wrote: > > On Mon, Jan 21, 2019 at 03:44:24PM +0100, Ian Kumlien wrote: > > Hi, > > > > IMHO you are missing: 41d1c8839e5f8cb781cc635f12791decee8271b7 > > > > Which should be marked for stable, it fixes: > > h

Re: [PATCH 4.20 000/111] 4.20.4-stable review

2019-01-21 Thread Ian Kumlien
Hi, IMHO you are missing: 41d1c8839e5f8cb781cc635f12791decee8271b7 Which should be marked for stable, it fixes: https://bugzilla.kernel.org/show_bug.cgi?id=202235

Re: [BUG] moving fq back to clock monotonic breaks my setup

2019-01-11 Thread Ian Kumlien
On Fri, Jan 11, 2019 at 10:35 AM Eric Dumazet wrote: > On Thu, Jan 10, 2019 at 12:55 AM Paolo Abeni wrote: > > On Thu, 2019-01-10 at 09:25 +0100, Ian Kumlien wrote: > > > > > This works, and so does: > > > https://marc.info/?l=linux-netdev=154696956604748=2

Re: [BUG] moving fq back to clock monotonic breaks my setup

2019-01-10 Thread Ian Kumlien
On Thu, Jan 10, 2019 at 6:53 AM Eric Dumazet wrote: > On Wed, Jan 9, 2019 at 4:48 PM Ian Kumlien wrote: > > > > Hi, > > > > Just been trough ~5+ hours of bisecting and eventually actually found > > the culprit =) > > > > commit fb420d5d91c1

[BUG] moving fq back to clock monotonic breaks my setup

2019-01-09 Thread Ian Kumlien
Hi, Just been trough ~5+ hours of bisecting and eventually actually found the culprit =) commit fb420d5d91c1274d5966917725e71f27ed092a85 (refs/bisect/bad) Author: Eric Dumazet Date: Fri Sep 28 10:28:44 2018 -0700 tcp/fq: move back to CLOCK_MONOTONIC [--8<--] So this might be because my

Re: [BUG] v4.20 - bridge not getting DHCP responses? (works in 4.19.13)

2019-01-09 Thread Ian Kumlien
Confirmed, sending a new mail with summary etc On Thu, Jan 10, 2019 at 1:16 AM Ian Kumlien wrote: > > On Wed, Jan 9, 2019 at 12:17 AM Ian Kumlien wrote: > > On Wed, Jan 9, 2019, 00:09 Florian Fainelli > [--8<---] > > >> > when looking at "git log v4.

Re: [BUG] v4.20 - bridge not getting DHCP responses? (works in 4.19.13)

2019-01-09 Thread Ian Kumlien
On Wed, Jan 9, 2019 at 12:17 AM Ian Kumlien wrote: > On Wed, Jan 9, 2019, 00:09 Florian Fainelli > > when looking at "git log v4.19...v4.20 >> > drivers/net/ethernet/intel/ixgbe/" nothing else really stands out... >> > The machine is also running NAT for my ho

Re: [BUG] v4.20 - bridge not getting DHCP responses? (works in 4.19.13)

2019-01-08 Thread Ian Kumlien
On Tue, Jan 8, 2019 at 11:51 PM Stephen Hemminger wrote: > On Tue, 8 Jan 2019 23:10:04 +0100 > Ian Kumlien wrote: > > On Sun, Jan 6, 2019 at 11:21 PM Ian Kumlien wrote: > > > > > > [Sorry for the repost, screwed up the netdev address...] > > > > > &g

Re: [BUG] v4.20 - bridge not getting DHCP responses? (works in 4.19.13)

2019-01-08 Thread Ian Kumlien
On Sun, Jan 6, 2019 at 11:21 PM Ian Kumlien wrote: > > [Sorry for the repost, screwed up the netdev address...] > > Hi, > > Switching from 4.19.x -> 4.20 resulted in DHCP not working for my VM:s. > > My firewall (which also runs the dhcpd) runs VM:s and it doe

Fwd: [BUG] v4.20 - bridge not getting DHCP responses? (works in 4.19.13)

2019-01-06 Thread Ian Kumlien
[Sorry for the repost, screwed up the netdev address...] Hi, Switching from 4.19.x -> 4.20 resulted in DHCP not working for my VM:s. My firewall (which also runs the dhcpd) runs VM:s and it does this by having physical interfaces attached to bridges - which the VM:s in turn attach to. Since

[BUG] v4.20 - bridge not getting DHCP responses? (works in 4.19.13)

2019-01-06 Thread Ian Kumlien
Hi, Switching from 4.19.x -> 4.20 resulted in DHCP not working for my VM:s. My firewall (which also runs the dhcpd) runs VM:s and it does this by having physical interfaces attached to bridges - which the VM:s in turn attach to. Since 4.20 the VM:s can't use DHCP, it's odd since the requests

Re: pcc_cpufreq: high LA

2018-12-03 Thread Ian Kumlien
No response? Should pcc_cpufreq be assumed as broken since it actually kills machines? Should I submit a patch that removes it? On Tue, Nov 20, 2018 at 3:05 PM Ian Kumlien wrote: > > Hi, > > We've had this happen a few times now, pcc_cpufreq is loaded and the > machine

Re: pcc_cpufreq: high LA

2018-12-03 Thread Ian Kumlien
No response? Should pcc_cpufreq be assumed as broken since it actually kills machines? Should I submit a patch that removes it? On Tue, Nov 20, 2018 at 3:05 PM Ian Kumlien wrote: > > Hi, > > We've had this happen a few times now, pcc_cpufreq is loaded and the > machine

pcc_cpufreq: high LA

2018-11-20 Thread Ian Kumlien
Hi, We've had this happen a few times now, pcc_cpufreq is loaded and the machine has a LA of 33 with kworkers consuming *all CPU* We have had this happen before, looking at it has been pushed to the leaky-stack^tm in my mind and... 32 cores: processor : 31 vendor_id : GenuineIntel cpu family :

pcc_cpufreq: high LA

2018-11-20 Thread Ian Kumlien
Hi, We've had this happen a few times now, pcc_cpufreq is loaded and the machine has a LA of 33 with kworkers consuming *all CPU* We have had this happen before, looking at it has been pushed to the leaky-stack^tm in my mind and... 32 cores: processor : 31 vendor_id : GenuineIntel cpu family :

[nfsd] page allocation failure -> kernel oops, even local fs hangs.

2018-07-04 Thread Ian Kumlien
Hi, I just had this happen a little while ago, got different weird deadlocks but this one actually generated a oops.. This is basic operation, a machine with 16 gb memory mainly doing NFS traffic. I'm currently playing with RDMA for this, which is why mlx4 is included. When the crash occurs,

[nfsd] page allocation failure -> kernel oops, even local fs hangs.

2018-07-04 Thread Ian Kumlien
Hi, I just had this happen a little while ago, got different weird deadlocks but this one actually generated a oops.. This is basic operation, a machine with 16 gb memory mainly doing NFS traffic. I'm currently playing with RDMA for this, which is why mlx4 is included. When the crash occurs,

[4.11.0 Kernel oops] propagate_one+0xa1/0x200

2017-06-26 Thread Ian Kumlien
Hi, While restarting a docker container today we had four machines deadlock with the following oops If this is related to propagate_one, then nothing has changed in the later kernels and this could still be a issue I'm still trying to investigate what happened This is the OCR-corrected

[4.11.0 Kernel oops] propagate_one+0xa1/0x200

2017-06-26 Thread Ian Kumlien
Hi, While restarting a docker container today we had four machines deadlock with the following oops If this is related to propagate_one, then nothing has changed in the later kernels and this could still be a issue I'm still trying to investigate what happened This is the OCR-corrected

[PATCH] Update pptp handling to avoid null pointer deref. v2

2017-01-02 Thread Ian Kumlien
80 Fixes: ab10dccb1160 ("rps: Inspect PPTP encapsulated by GRE to get flow hash") Signed-off-by: Ian Kumlien <ian.kuml...@gmail.com> --- net/core/flow_dissector.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/net/core/flow_dissector.c b/net/core/flow_disse

[PATCH] Update pptp handling to avoid null pointer deref. v2

2017-01-02 Thread Ian Kumlien
80 Fixes: ab10dccb1160 ("rps: Inspect PPTP encapsulated by GRE to get flow hash") Signed-off-by: Ian Kumlien --- net/core/flow_dissector.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/net/core/flow_dissector.c b/net/core/flow_dissector.c index c6d8207ffa7e..

[PATCH] Update pptp handling to avoid null pointer deref. v2

2017-01-02 Thread Ian Kumlien
80 Fixes: ab10dccb1160 ("rps: Inspect PPTP encapsulated by GRE to get flow hash") Signed-off-by: Ian Kumlien <ian.kuml...@gmail.com> --- net/core/flow_dissector.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/net/core/flow_dissector.c b/net/core/flow_disse

[PATCH] Update pptp handling to avoid null pointer deref. v2

2017-01-02 Thread Ian Kumlien
80 Fixes: ab10dccb1160 ("rps: Inspect PPTP encapsulated by GRE to get flow hash") Signed-off-by: Ian Kumlien --- net/core/flow_dissector.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/net/core/flow_dissector.c b/net/core/flow_dissector.c index c6d8207ffa7e..

Re: [PATCH] Update pptp handling to avoid null pointer deref.

2017-01-02 Thread Ian Kumlien
On Mon, Jan 2, 2017 at 5:07 AM, David Miller <da...@davemloft.net> wrote: > From: Ian Kumlien <ian.kuml...@gmail.com> > Date: Mon, 2 Jan 2017 00:19:36 +0100 > >> __skb_flow_dissect can be called with a skb or a data packet, either >> can be NULL. A

Re: [PATCH] Update pptp handling to avoid null pointer deref.

2017-01-02 Thread Ian Kumlien
On Mon, Jan 2, 2017 at 5:07 AM, David Miller wrote: > From: Ian Kumlien > Date: Mon, 2 Jan 2017 00:19:36 +0100 > >> __skb_flow_dissect can be called with a skb or a data packet, either >> can be NULL. All calls seems to have been moved to __skb_header_pointer >> exc

[PATCH] Update pptp handling to avoid null pointer deref.

2017-01-01 Thread Ian Kumlien
00080 --- Signed-off-by: Ian Kumlien <ian.kuml...@gmail.com> --- net/core/flow_dissector.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/net/core/flow_dissector.c b/net/core/flow_dissector.c index c6d8207ffa7e..32e4e0158846 100644 --- a/net/core/flow_dissector.c

[PATCH] Update pptp handling to avoid null pointer deref.

2017-01-01 Thread Ian Kumlien
00080 --- Signed-off-by: Ian Kumlien --- net/core/flow_dissector.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/net/core/flow_dissector.c b/net/core/flow_dissector.c index c6d8207ffa7e..32e4e0158846 100644 --- a/net/core/flow_dissector.c +++ b/net/core/flow_dissector.c

Re: [BUG] 4.9 - kernel oops when pptp connection is established and the kernel doesn't have pptp modules compiled

2017-01-01 Thread Ian Kumlien
f + offset, +sizeof(_ppp_hdr), +data, hlen, _ppp_hdr); if (!ppp_hdr) goto out_bad; Will send a patch with signed off by n' all. On Sun, Jan 1, 2017 at 6:31 PM, Ian Kumlien <ian.kuml

Re: [BUG] 4.9 - kernel oops when pptp connection is established and the kernel doesn't have pptp modules compiled

2017-01-01 Thread Ian Kumlien
f + offset, +sizeof(_ppp_hdr), +data, hlen, _ppp_hdr); if (!ppp_hdr) goto out_bad; Will send a patch with signed off by n' all. On Sun, Jan 1, 2017 at 6:31 PM, Ian Kumlien wrote: >

Re: [BUG] 4.9 - kernel oops when pptp connection is established and the kernel doesn't have pptp modules compiled

2017-01-01 Thread Ian Kumlien
On Fri, Dec 30, 2016 at 11:48 PM, Ian Kumlien <ian.kuml...@gmail.com> wrote: > Hi, > > Been fighting with "crash" to get it to help me to analyze my crash > dumps... This is the output from vmcore-dmesg. > > This is 100% reproducible... > > Config tha

Re: [BUG] 4.9 - kernel oops when pptp connection is established and the kernel doesn't have pptp modules compiled

2017-01-01 Thread Ian Kumlien
On Fri, Dec 30, 2016 at 11:48 PM, Ian Kumlien wrote: > Hi, > > Been fighting with "crash" to get it to help me to analyze my crash > dumps... This is the output from vmcore-dmesg. > > This is 100% reproducible... > > Config that lets the connec

[BUG] 4.9 - kernel oops when pptp connection is established and the kernel doesn't have pptp modules compiled

2016-12-30 Thread Ian Kumlien
Hi, Been fighting with "crash" to get it to help me to analyze my crash dumps... This is the output from vmcore-dmesg. This is 100% reproducible... Config that lets the connection trough but crashes the kernel: # CONFIG_NF_CONNTRACK_PPTP is not set # CONFIG_NF_NAT_PPTP is not set CONFIG_PPTP=y

[BUG] 4.9 - kernel oops when pptp connection is established and the kernel doesn't have pptp modules compiled

2016-12-30 Thread Ian Kumlien
Hi, Been fighting with "crash" to get it to help me to analyze my crash dumps... This is the output from vmcore-dmesg. This is 100% reproducible... Config that lets the connection trough but crashes the kernel: # CONFIG_NF_CONNTRACK_PPTP is not set # CONFIG_NF_NAT_PPTP is not set CONFIG_PPTP=y

Re: [BUG][4.5-rc5] rcu_shed self-detected stall on CPU - directly after network goes online.

2016-02-24 Thread Ian Kumlien
On 22 February 2016 at 01:38, Ian Kumlien <ian.kuml...@gmail.com> wrote: > Hi, > > When i tried to upgrade my, soon to be, firewall to 4.5-rc5 to do some > testing - it deadlocked almost instantly. After bisect, the offending patch seems to be: b16c29191dc89bd877af99a7b04ce486

Re: [BUG][4.5-rc5] rcu_shed self-detected stall on CPU - directly after network goes online.

2016-02-24 Thread Ian Kumlien
On 22 February 2016 at 01:38, Ian Kumlien wrote: > Hi, > > When i tried to upgrade my, soon to be, firewall to 4.5-rc5 to do some > testing - it deadlocked almost instantly. After bisect, the offending patch seems to be: b16c29191dc89bd877af99a7b04ce4866728a3e0 It looks like some

[BUG][4.5-rc5] rcu_shed self-detected stall on CPU - directly after network goes online.

2016-02-21 Thread Ian Kumlien
Hi, When i tried to upgrade my, soon to be, firewall to 4.5-rc5 to do some testing - it deadlocked almost instantly. In the photo, i started writing "root" and it keeps repeating it, like it's in a while loop. https://goo.gl/photos/yGhNSogJjeb2VJyu5 Trying to get better information - as in any

[BUG][4.5-rc5] rcu_shed self-detected stall on CPU - directly after network goes online.

2016-02-21 Thread Ian Kumlien
Hi, When i tried to upgrade my, soon to be, firewall to 4.5-rc5 to do some testing - it deadlocked almost instantly. In the photo, i started writing "root" and it keeps repeating it, like it's in a while loop. https://goo.gl/photos/yGhNSogJjeb2VJyu5 Trying to get better information - as in any

Re: [BUG] linux 4.3 - intel graphics card driver keeps oopsing

2015-11-04 Thread Ian Kumlien
After a few hours the machine started to spontaneously reboot. It's back to running 4.2.5 - which works fine. (I don't know if these messages are actually reaching lkml since it's not showing up in the index on marc.info) On 4 November 2015 at 10:24, Ian Kumlien wrote: > Hi, > > This

[BUG] linux 4.3 - intel graphics card driver keeps oopsing

2015-11-04 Thread Ian Kumlien
Hi, This happened when i booted linux 4.3 on a old intel machine. Please note, it's named before "twilight" was known and it's name is inspired by "twilight zone" since it's actually a firewall ;) PID USER PR NIVIRTRESSHR S %CPU %MEM TIME+ COMMAND 1666 root 20 0

Re: [BUG] linux 4.3 - intel graphics card driver keeps oopsing

2015-11-04 Thread Ian Kumlien
After a few hours the machine started to spontaneously reboot. It's back to running 4.2.5 - which works fine. (I don't know if these messages are actually reaching lkml since it's not showing up in the index on marc.info) On 4 November 2015 at 10:24, Ian Kumlien <ian.kuml...@gmail.com>

[BUG] linux 4.3 - intel graphics card driver keeps oopsing

2015-11-04 Thread Ian Kumlien
Hi, This happened when i booted linux 4.3 on a old intel machine. Please note, it's named before "twilight" was known and it's name is inspired by "twilight zone" since it's actually a firewall ;) PID USER PR NIVIRTRESSHR S %CPU %MEM TIME+ COMMAND 1666 root 20 0

[igb] AER timeout - resend.

2015-02-23 Thread Ian Kumlien
Sending this to both netdev and kernel since i don't know if it's the driver or the pcie AER that does something odd - the machine was stable before 3.19 and PCIE AER. Everything started out like i first sent to linux nics () intel: -- And today i had some issues and wondered why things was

[igb] AER timeout - resend.

2015-02-23 Thread Ian Kumlien
Sending this to both netdev and kernel since i don't know if it's the driver or the pcie AER that does something odd - the machine was stable before 3.19 and PCIE AER. Everything started out like i first sent to linux nics () intel: -- And today i had some issues and wondered why things was

Re: 3.18-rc regression: drm/nouveau: use shared fences for readable objects

2014-11-28 Thread Ian Kumlien
Hi, Sorry to but in like this but I'm suffering from the same kind of deadlocks with nouveau... The really odd thing is that i could boot some -rc6+ kernel without problems but it hung while playing video and then it refused to start properly again. Anyway, to quote Maarten: Ok that most

Re: 3.18-rc regression: drm/nouveau: use shared fences for readable objects

2014-11-28 Thread Ian Kumlien
Hi, Sorry to but in like this but I'm suffering from the same kind of deadlocks with nouveau... The really odd thing is that i could boot some -rc6+ kernel without problems but it hung while playing video and then it refused to start properly again. Anyway, to quote Maarten: Ok that most

[Q] threading and scope_process

2014-08-29 Thread Ian Kumlien
Hi, I have to admit that this is perhaps not the best time to write this, late in the evening after work but... =) A friend of mine recently started to develop things on Linux and got a bit confused since Linux lacks PTHREAD_SCOPE_PROCESS which just about all other OS:es seems to implement.

[Q] threading and scope_process

2014-08-29 Thread Ian Kumlien
Hi, I have to admit that this is perhaps not the best time to write this, late in the evening after work but... =) A friend of mine recently started to develop things on Linux and got a bit confused since Linux lacks PTHREAD_SCOPE_PROCESS which just about all other OS:es seems to implement.

Re: [Intel-gfx] [BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-08-01 Thread Ian Kumlien
On Fri, Aug 1, 2014 at 3:16 PM, Imre Deak wrote: [--8<--] > Ok, I see the trace of suspend/resume now, but the bug has vanished.. I > can't see the WARN backtrace in your original report, nor the debug > message from the above fix, that would indicate that it had fixed > anything ("VDD left on

Re: [Intel-gfx] [BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-08-01 Thread Ian Kumlien
On Fri, Aug 1, 2014 at 3:16 PM, Imre Deak imre.d...@intel.com wrote: [--8--] Ok, I see the trace of suspend/resume now, but the bug has vanished.. I can't see the WARN backtrace in your original report, nor the debug message from the above fix, that would indicate that it had fixed anything

Re: [Intel-gfx] [BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-25 Thread Ian Kumlien
On fre, 2014-07-25 at 12:28 +0300, Imre Deak wrote: > On Thu, 2014-07-24 at 01:33 +0200, Ian Kumlien wrote: > > Try four, now including CC lists for the intel driver... > > Could you give a try to the 2 patches at: > https://patchwork.kernel.org/patch/4437061/ > > If t

Re: [Intel-gfx] [BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-25 Thread Ian Kumlien
On fre, 2014-07-25 at 12:28 +0300, Imre Deak wrote: On Thu, 2014-07-24 at 01:33 +0200, Ian Kumlien wrote: Try four, now including CC lists for the intel driver... Could you give a try to the 2 patches at: https://patchwork.kernel.org/patch/4437061/ If these don't fix the issue, could you

[BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-23 Thread Ian Kumlien
rep indicated that there might be two more missing...) commit 423d01e192504764d5cb59effe5da68b035a0d41 Author: Ian Kumlien Date: Tue Jul 22 21:10:50 2014 +0200 Add intel_display_power_get before _put When closing the lid when using 3.16-rc6 on my laptop i got three "WA

[BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-23 Thread Ian Kumlien
] [81997e6c] ret_from_fork+0x7c/0xb0 [ 9064.008878] [810eca60] ? kthread_create_on_node+0x170/0x170 [ 9064.008880] ---[ end trace 17f9738f20aec288 ]--- This fixes the cases I found for my usecase: closing and opening the lid. Signed-off-by: Ian Kumlien

[BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-22 Thread Ian Kumlien
..) commit 423d01e192504764d5cb59effe5da68b035a0d41 Author: Ian Kumlien Date: Tue Jul 22 21:10:50 2014 +0200 Add intel_display_power_get before _put When closing the lid when using 3.16-rc6 on my laptop i got three "WARN_ON" back traces. Example: [ 9064.0088

[BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-22 Thread Ian Kumlien
...) commit 423d01e192504764d5cb59effe5da68b035a0d41 Author: Ian Kumlien Date: Tue Jul 22 21:10:50 2014 +0200 Add intel_display_power_get before _put When closing the lid when using 3.16-rc6 on my laptop i got three "WARN_ON" back traces. Example: [ 9064.008821] WARNING: CPU: 3 PI

Re: [RFC] 3.16-rc6 -- fs/direct-io.c:1011 from and to uninitialized.

2014-07-22 Thread Ian Kumlien
On tis, 2014-07-22 at 12:20 -0700, Randy Dunlap wrote: > On 07/22/2014 12:03 PM, Ian Kumlien wrote: > > This is a resend, try two... > > > > And while the fix is simple, something along the lines of: > > diff --git a/fs/direct-io.c b/fs/direct-io.c > > index 980

Re: [RFC] 3.16-rc6 -- fs/direct-io.c:1011 from and to uninitialized.

2014-07-22 Thread Ian Kumlien
On tis, 2014-07-22 at 21:12 +0200, Richard Weinberger wrote: > On Tue, Jul 22, 2014 at 9:03 PM, Ian Kumlien wrote: > > This is a resend, try two... > > Please see "[PATCH v3] direct-io: fix uninitialized warning in > do_direct_IO()". That looks like a better approac

[RFC] 3.16-rc6 -- fs/direct-io.c:1011 from and to uninitialized.

2014-07-22 Thread Ian Kumlien
if (IS_ERR(page)) { ret = PTR_ERR(page); --- I however don't know if it's in the correct C standard, it compiles fine though... (or if this is more gcc speific) commit f94d05ce10d869c418d3271bd028fc33bfd25e6f Author: Ian Kumlien Date: Tue Jul 22 20:57:50 2014 +0200

[RFC] 3.16-rc6 -- fs/direct-io.c:1011 from and to uninitialized.

2014-07-22 Thread Ian Kumlien
(page)) { ret = PTR_ERR(page); --- I however don't know if it's in the correct C standard, it compiles fine though... (or if this is more gcc speific) commit f94d05ce10d869c418d3271bd028fc33bfd25e6f Author: Ian Kumlien ian.kuml...@gmail.com Date: Tue Jul 22 20:57:50 2014

Re: [RFC] 3.16-rc6 -- fs/direct-io.c:1011 from and to uninitialized.

2014-07-22 Thread Ian Kumlien
On tis, 2014-07-22 at 21:12 +0200, Richard Weinberger wrote: On Tue, Jul 22, 2014 at 9:03 PM, Ian Kumlien ian.kuml...@gmail.com wrote: This is a resend, try two... Please see [PATCH v3] direct-io: fix uninitialized warning in do_direct_IO(). That looks like a better approach, couldn't

Re: [RFC] 3.16-rc6 -- fs/direct-io.c:1011 from and to uninitialized.

2014-07-22 Thread Ian Kumlien
On tis, 2014-07-22 at 12:20 -0700, Randy Dunlap wrote: On 07/22/2014 12:03 PM, Ian Kumlien wrote: This is a resend, try two... And while the fix is simple, something along the lines of: diff --git a/fs/direct-io.c b/fs/direct-io.c index 98040ba..64a8286 100644 --- a/fs/direct-io.c

[BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-22 Thread Ian Kumlien
? Should it be done differently? (This handles the close and open lid usecase, i don't know if there are others, a grep indicated that there might be two more missing...) commit 423d01e192504764d5cb59effe5da68b035a0d41 Author: Ian Kumlien ian.kuml...@gmail.com Date: Tue Jul 22 21:10:50 2014 +0200

[BUG?] 3.16-rc6 ... at drivers/gpu/drm/i915/intel_pm.c:5997 intel_display_power_put+0x12d/0x160()

2014-07-22 Thread Ian Kumlien
is: Is this the correct approach? Should it be done differently? (This handles the close and open lid usecase, i don't know if there are others, a grep indicated that there might be two more missing...) commit 423d01e192504764d5cb59effe5da68b035a0d41 Author: Ian Kumlien ian.kuml...@gmail.com Date: Tue Jul 22

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-22 Thread Ian Kumlien
On ons, 2014-01-22 at 11:52 +1100, NeilBrown wrote: > On Mon, 20 Jan 2014 19:27:18 +0100 Ian Kumlien wrote: > > I have been unable to trip this up, so this was it! > > > > Tested-by: Ian Kumlien > > > > I hope this hits stable ASAP ;) > > I've push

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-22 Thread Ian Kumlien
On ons, 2014-01-22 at 11:52 +1100, NeilBrown wrote: On Mon, 20 Jan 2014 19:27:18 +0100 Ian Kumlien ian.kuml...@gmail.com wrote: I have been unable to trip this up, so this was it! Tested-by: Ian Kumlien ian.kuml...@gmail.com I hope this hits stable ASAP ;) I've push it out into my

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-20 Thread Ian Kumlien
t; The important aspect of the patch is that it moves the "atomic_inc" of > "sh->count" back under the protection of ->device_lock in the case when some > other thread might be using the same 'sh'. I have been unable to trip this up, so this was it! Tested-by:

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-20 Thread Ian Kumlien
On mån, 2014-01-20 at 14:37 +1100, NeilBrown wrote: > On Mon, 20 Jan 2014 01:49:17 +0100 Ian Kumlien wrote: > > > On mån, 2014-01-20 at 11:38 +1100, NeilBrown wrote: > > > On Sun, 19 Jan 2014 23:00:23 +0100 Ian Kumlien > > > wrote: > > > >

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-20 Thread Ian Kumlien
is that it moves the atomic_inc of sh-count back under the protection of -device_lock in the case when some other thread might be using the same 'sh'. I have been unable to trip this up, so this was it! Tested-by: Ian Kumlien ian.kuml...@gmail.com I hope this hits stable ASAP ;) Thanks, NeilBrown

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-20 Thread Ian Kumlien
On mån, 2014-01-20 at 14:37 +1100, NeilBrown wrote: On Mon, 20 Jan 2014 01:49:17 +0100 Ian Kumlien ian.kuml...@gmail.com wrote: On mån, 2014-01-20 at 11:38 +1100, NeilBrown wrote: On Sun, 19 Jan 2014 23:00:23 +0100 Ian Kumlien ian.kuml...@gmail.com wrote: Ok, so third try

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-19 Thread Ian Kumlien
On mån, 2014-01-20 at 11:38 +1100, NeilBrown wrote: > On Sun, 19 Jan 2014 23:00:23 +0100 Ian Kumlien wrote: > > > Ok, so third try to actually email this... > > --- > > > > Hi, > > > > I started testing 3.13-rc8 on another machine since t

[BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-19 Thread Ian Kumlien
Ok, so third try to actually email this... --- Hi, I started testing 3.13-rc8 on another machine since the first one seemed to be working fine... One spontaneous reboot later i'm not so sure ;) Right now i captured a kernel oops in the raid code it seems... (Also attached to avoid mangling)

[BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-19 Thread Ian Kumlien
Ok, so third try to actually email this... --- Hi, I started testing 3.13-rc8 on another machine since the first one seemed to be working fine... One spontaneous reboot later i'm not so sure ;) Right now i captured a kernel oops in the raid code it seems... (Also attached to avoid mangling)

Re: [BUG] at drivers/md/raid5.c:291! kernel 3.13-rc8

2014-01-19 Thread Ian Kumlien
On mån, 2014-01-20 at 11:38 +1100, NeilBrown wrote: On Sun, 19 Jan 2014 23:00:23 +0100 Ian Kumlien ian.kuml...@gmail.com wrote: Ok, so third try to actually email this... --- Hi, I started testing 3.13-rc8 on another machine since the first one seemed to be working fine

Re: [OOPS][3.12] BUG: unable to handle kernel NULL pointer dereference at 0000000c

2013-11-15 Thread Ian Kumlien
On Fri, Nov 15, 2013 at 05:44:26PM -0500, David Miller wrote: > From: Bjorn Helgaas > Date: Fri, 15 Nov 2013 15:29:53 -0700 > > > [+cc David, Eric, Alex, netdev] > > > > Alex reported a similar issue at > > http://marc.info/?l=linux-netdev=138355719901790=4 > > Fixed by: > > commit

[OOPS][3.12] BUG: unable to handle kernel NULL pointer dereference at 0000000c

2013-11-15 Thread Ian Kumlien
Hi, After a lot of wondering i finally tracked down the bug that was hitting me since 3.12-rc7. Since this is a firewall I haven't actually noticed it all the time. But when i saw that it rebooted too often, i enabled netconsole and this is the output: BUG: unable to handle kernel NULL pointer

[OOPS][3.12] BUG: unable to handle kernel NULL pointer dereference at 0000000c

2013-11-15 Thread Ian Kumlien
Hi, After a lot of wondering i finally tracked down the bug that was hitting me since 3.12-rc7. Since this is a firewall I haven't actually noticed it all the time. But when i saw that it rebooted too often, i enabled netconsole and this is the output: BUG: unable to handle kernel NULL pointer

Re: [OOPS][3.12] BUG: unable to handle kernel NULL pointer dereference at 0000000c

2013-11-15 Thread Ian Kumlien
On Fri, Nov 15, 2013 at 05:44:26PM -0500, David Miller wrote: From: Bjorn Helgaas bhelg...@google.com Date: Fri, 15 Nov 2013 15:29:53 -0700 [+cc David, Eric, Alex, netdev] Alex reported a similar issue at http://marc.info/?l=linux-netdevm=138355719901790w=4 Fixed by: commit

atl1c: networking bug - increasing counters with no data.

2013-01-25 Thread Ian Kumlien
Hi all, I have a atl1c networking card in my laptop - it's not connected since i use wifi instead. After a while of uptime it seems like it goes haywire, tools are currently reporting data transfers of ~9GB/s ifconfig eth0 eth0: flags=4099 mtu 1500 ether 48:5b:39:5e:b0:4b txqueuelen

atl1c: networking bug - increasing counters with no data.

2013-01-25 Thread Ian Kumlien
Hi all, I have a atl1c networking card in my laptop - it's not connected since i use wifi instead. After a while of uptime it seems like it goes haywire, tools are currently reporting data transfers of ~9GB/s ifconfig eth0 eth0: flags=4099UP,BROADCAST,MULTICAST mtu 1500 ether

Re: [BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-29 Thread Ian Kumlien
On tor, 2012-11-29 at 20:25 +0100, Borislav Petkov wrote: > On Thu, Nov 29, 2012 at 05:24:03PM +0100, Ian Kumlien wrote: > > > Can you do: > > > > > > make arch/x86/kernel/ptrace.lst > > > > > > and send me that file, privately is f

Re: [BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-29 Thread Ian Kumlien
On Thu, Nov 29, 2012 at 03:22:20PM +0100, Borislav Petkov wrote: > On Thu, Nov 29, 2012 at 01:27:08PM +0100, Ian Kumlien wrote: > > I think that chrome does traceing all the time as a part of it's > > sandbox - this is most likely chrome monitoring flash... > > Ah, ok. [

Re: [BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-29 Thread Ian Kumlien
On Thu, Nov 29, 2012 at 10:26:21AM +0100, Borislav Petkov wrote: > On Thu, Nov 29, 2012 at 12:20:02AM +0100, Ian Kumlien wrote: > > Hi, > > > > Due to unexplained dns problems, I'll be using google plus to post the > > photo of the bug output. > > &

Re: [BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-29 Thread Ian Kumlien
On Thu, Nov 29, 2012 at 10:26:21AM +0100, Borislav Petkov wrote: On Thu, Nov 29, 2012 at 12:20:02AM +0100, Ian Kumlien wrote: Hi, Due to unexplained dns problems, I'll be using google plus to post the photo of the bug output. https://plus.google.com/photos/110698868656495230656

Re: [BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-29 Thread Ian Kumlien
On Thu, Nov 29, 2012 at 03:22:20PM +0100, Borislav Petkov wrote: On Thu, Nov 29, 2012 at 01:27:08PM +0100, Ian Kumlien wrote: I think that chrome does traceing all the time as a part of it's sandbox - this is most likely chrome monitoring flash... Ah, ok. [ --8--8-- ] Right, so I can

Re: [BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-29 Thread Ian Kumlien
On tor, 2012-11-29 at 20:25 +0100, Borislav Petkov wrote: On Thu, Nov 29, 2012 at 05:24:03PM +0100, Ian Kumlien wrote: Can you do: make arch/x86/kernel/ptrace.lst and send me that file, privately is fine too. Done, =) Ok, thanks. Here it is: 8100b627: 83

[BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-28 Thread Ian Kumlien
these things handled ;) -- Ian Kumlien -- http://demius.net || http://pomac.netswarm.net signature.asc Description: This is a digitally signed message part

[BUG] NULL pointer dereference in 3.7-rc7 (syscall_trace_enter)

2012-11-28 Thread Ian Kumlien
these things handled ;) -- Ian Kumlien -- http://demius.net || http://pomac.netswarm.net signature.asc Description: This is a digitally signed message part

[SKY2] Problems (2.6.24-rc3-git1)

2007-11-27 Thread Ian Kumlien
From lkml: On Tue, 27 Nov 2007 23:40:22 +0100 Ian Kumlien <[EMAIL PROTECTED]> wrote: > [Repost, no reply has been recived] > > Hi, > > A little while ago, something went horribly wrong. > > I could still use my mouse and the desktop was still alive more or > less.

  1   2   >