Re: Need help in bug in isolate_migratepages_range

2014-02-03 Thread Holger Kiehl
On Mon, 3 Feb 2014, David Rientjes wrote: On Mon, 3 Feb 2014, Vlastimil Babka wrote: It seems to come from balloon_page_movable() and its test page_count(page) == 1. Hmm, I think it might be because compound_head() == NULL here. Holger, this looks like a race condition when allocating a co

Re: Need help in bug in isolate_migratepages_range

2014-02-03 Thread David Rientjes
On Mon, 3 Feb 2014, Vlastimil Babka wrote: > It seems to come from balloon_page_movable() and its test page_count(page) == > 1. > Hmm, I think it might be because compound_head() == NULL here. Holger, this looks like a race condition when allocating a compound page, did you only see it once o

Re: Need help in bug in isolate_migratepages_range

2014-02-03 Thread Holger Kiehl
On Mon, 3 Feb 2014, Michal Hocko wrote: On Mon 03-02-14 14:29:22, Holger Kiehl wrote: I have attached it. Please, tell me if you do not get the attachment. I hoped it would help me to get a closer compiled code to yours but I am probably using too different gcc. I have an old gcc, it is 4.4

Re: Need help in bug in isolate_migratepages_range

2014-02-03 Thread Vlastimil Babka
On 02/03/2014 05:20 PM, Michal Hocko wrote: On Mon 03-02-14 14:29:22, Holger Kiehl wrote: I have attached it. Please, tell me if you do not get the attachment. I hoped it would help me to get a closer compiled code to yours but I am probably using too different gcc. Anyway I've tried to check

Re: Need help in bug in isolate_migratepages_range

2014-02-03 Thread Michal Hocko
On Mon 03-02-14 14:29:22, Holger Kiehl wrote: > I have attached it. Please, tell me if you do not get the attachment. I hoped it would help me to get a closer compiled code to yours but I am probably using too different gcc. Anyway I've tried to check whether I can hook on something and it seems t

Re: Need help in bug in isolate_migratepages_range

2014-02-03 Thread Michal Hocko
[CCing linux-mm] Does this ring bells? I haven't checked very deeply but it doesn't seem to be fixed since 3.12. Hoolger, could you post your config, please? On Fri 31-01-14 21:12:27, Holger Kiehl wrote: > Hello, > > today one of our system got a kernel bug message. It kept on running > but mor

Need help in bug in isolate_migratepages_range

2014-01-31 Thread Holger Kiehl
Hello, today one of our system got a kernel bug message. It kept on running but more and more process begin to be stuck in D state (eg. a simple w command would never return) and I eventually had to reboot. Here the full message: Jan 31 13:07:43 asterix kernel: BUG: unable to handle kernel NU