Re: [PATCH 0/5] prevent OOM triggered by TTM

Thomas Hellstrom Wed, 07 Feb 2018 10:08:42 -0800

Hi,

On 02/07/2018 02:22 PM, Christian König wrote:

Understood, but why is that?
Well because customers requested it :)
What we try to do here is having a parameter which says when less thanx megabytes of memory are left then fail the allocation.
This is basically to prevent buggy applications which try to allocateas much memory as possible until they receive an -ENOMEM from runninginto the OOM killer.


OK. Understood.

That's true, but with VRAM, TTM overcommits swap space which may leadto ugly memory allocation failures at hibernate time.
Yeah, that is exactly the reason why I said that Roger should disablethe limit during suspend swap out :)

Well that was really in the context of the swapping implementationrather in the context of this change so it was a little off-topic. Evenif disabling this limit, TTM can overcommit. But looking at the swappingimplementation is a different issue.


/Thomas

Regards,
Christian.

Am 07.02.2018 um 14:17 schrieb Thomas Hellstrom:
Hi, Roger.

On 02/07/2018 09:25 AM, He, Roger wrote:
Why should TTM be different in that aspect? It would be good toknow your reasoning WRT this?
Now, in TTM struct ttm_bo_device it already has member no_retry toindicate your option.If you prefer no OOM triggered by TTM, set it as true. The defaultis false to keep original behavior.
AMD prefers return value of no memory rather than OOM for now.
Understood, but why is that? I mean just because TTM doesn't invokethe OOM killer, that doesn't mean that the process will, the nextmillisecond, page in a number of pages and invoke it? So thismechanism would be pretty susceptible to races?
One thing I looked at at one point was to have TTM do theswapping itself instead of handing it off to the shmem system. Thatway we could pre-allocate swap entries for all swappable (BO)memory, making sure that we wouldn't run out of swap space when,
I prefer current mechanism of swap out. At the beginning the swappedpages stay in system memory by shmem until OS move to status withhigh memory pressure, that has an obvious advantage. For example, ifthe BO is swapped out into shmem, but not really be flushed intoswap disk. When validate it and swap in it at this moment, theoverhead is small compared to swap in from disk.
But that is true for a page handed off to the swap-cache as well. Itwon't be immediately flushed to disc, only when the swap cache isshrunk.
In addition, No need swap space reservation for TTM pages whenallocation since swap disk is shared not only for TTM exclusive.
That's true, but with VRAM, TTM overcommits swap space which may leadto ugly memory allocation failures at hibernate time.
So again we provide a flag no_retry in struct ttm_bo_device to letdriver set according to its request.
Thanks,
Thomas
Thanks
Roger(Hongbo.He)

-----Original Message-----
From: Thomas Hellstrom [mailto:tho...@shipmail.org]
Sent: Wednesday, February 07, 2018 2:43 PM
To: He, Roger <hongbo...@amd.com>; amd-...@lists.freedesktop.org;dri-devel@lists.freedesktop.org
Cc: Koenig, Christian <christian.koe...@amd.com>
Subject: Re: [PATCH 0/5] prevent OOM triggered by TTM

Hi, Roger,

On 02/06/2018 10:04 AM, Roger He wrote:
currently ttm code has no any allocation limit. So it allows pages
allocatation unlimited until OOM. Because if swap space is full of
swapped pages and then system memory will be filled up with ttm pages.
and then any memory allocation request will trigger OOM.
I'm a bit curious, isn't this the way things are supposed to work ona linux system?If all memory resources are used up, the OOM killer will kill themost memory hungry (perhaps rogue) process rather than processesbeing nice and try to find out themselves whether allocations willsucceed?Why should TTM be different in that aspect? It would be good to knowyour reasoning WRT this?
Admittedly, graphics process OOM memory accounting doesn't work verywell, due to not all BOs not being CPU mapped, but it looks likethere is recent work towards fixing this?
One thing I looked at at one point was to have TTM do the swappingitself instead of handing it off to the shmem system. That way wecould pre-allocate swap entries for all swappable (BO) memory,making sure that we wouldn't run out of swap space when, forexample, hibernating and that would also limit the pinnednon-swappable memory (from TTM driver kernel allocations forexample) to half the system memory resources.
Thanks,
Thomas


_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

Re: [PATCH 0/5] prevent OOM triggered by TTM

Reply via email to