Re: [Qemu-devel] linux guests and ksm performance
On 02/28/2012 03:20 PM, Peter Lieven wrote: > On 28.02.2012 14:16, Avi Kivity wrote: >> On 02/24/2012 08:41 AM, Stefan Hajnoczi wrote: I dont think that it is cpu intense. All user pages are zeroed anyway, but at allocation time it shouldnt be a big difference in terms of cpu power. >>> It's easy to find a scenario where eagerly zeroing pages is wasteful. >>> Imagine a process that uses all of physical memory. Once it >>> terminates the system is going to run processes that only use a small >>> set of pages. It's pointless zeroing all those pages if we're not >>> going to use them anymore. >> In the long term, we will use them, except if the guest is completely >> idle. >> >> The scenario in which zeroing is expensive is when the page is refilled >> through DMA. In that case the zeroing was wasted. This is a pretty >> common scenario in pagecache intensive workloads. >> > Avi, what do you think of the proposal to give the guest vm a hint > that the host is running ksm? In that case the administrator > has already chosen that saving physical memory is more important > than performance to him? It makes some sense. Perhaps through the balloon device, a flag that indicates that voluntary ballooning will be gratefully accepted. -- error compiling committee.c: too many arguments to function
Re: [Qemu-devel] linux guests and ksm performance
On 28.02.2012 14:16, Avi Kivity wrote: On 02/24/2012 08:41 AM, Stefan Hajnoczi wrote: I dont think that it is cpu intense. All user pages are zeroed anyway, but at allocation time it shouldnt be a big difference in terms of cpu power. It's easy to find a scenario where eagerly zeroing pages is wasteful. Imagine a process that uses all of physical memory. Once it terminates the system is going to run processes that only use a small set of pages. It's pointless zeroing all those pages if we're not going to use them anymore. In the long term, we will use them, except if the guest is completely idle. The scenario in which zeroing is expensive is when the page is refilled through DMA. In that case the zeroing was wasted. This is a pretty common scenario in pagecache intensive workloads. Avi, what do you think of the proposal to give the guest vm a hint that the host is running ksm? In that case the administrator has already chosen that saving physical memory is more important than performance to him? Peter
Re: [Qemu-devel] linux guests and ksm performance
On 02/24/2012 08:41 AM, Stefan Hajnoczi wrote: > > > > I dont think that it is cpu intense. All user pages are zeroed anyway, but > > at allocation time it shouldnt be a big difference in terms of cpu power. > > It's easy to find a scenario where eagerly zeroing pages is wasteful. > Imagine a process that uses all of physical memory. Once it > terminates the system is going to run processes that only use a small > set of pages. It's pointless zeroing all those pages if we're not > going to use them anymore. In the long term, we will use them, except if the guest is completely idle. The scenario in which zeroing is expensive is when the page is refilled through DMA. In that case the zeroing was wasted. This is a pretty common scenario in pagecache intensive workloads. -- error compiling committee.c: too many arguments to function
Re: [Qemu-devel] linux guests and ksm performance
On 02/23/2012 06:42 PM, Stefan Hajnoczi wrote: > On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: > > However, in a virtual machine I have not observed the above slow down to > > that extend > > while the benefit of zero after free in a virtualisation environment is > > obvious: > > > > 1) zero pages can easily be merged by ksm or other technique. > > 2) zero (dup) pages are a lot faster to transfer in case of migration. > > The other approach is a memory page "discard" mechanism - which > obviously requires more code changes than zeroing freed pages. > > The advantage is that we don't take the brute-force and CPU intensive > approach of zeroing pages. It would be like a fine-grained ballooning > feature. > > I hope someone will follow up saying this has already been done or > prototyped :). It already exists - that's the balloon code. Right now it's host driven, but maybe we can modify it to allow the guest to initiate balloon inflations. -- error compiling committee.c: too many arguments to function
Re: [Qemu-devel] linux guests and ksm performance
On 28.02.2012 13:05, Stefan Hajnoczi wrote: On Tue, Feb 28, 2012 at 11:46 AM, Peter Lieven wrote: On 24.02.2012 08:23, Stefan Hajnoczi wrote: On Fri, Feb 24, 2012 at 6:53 AM, Stefan Hajnoczi wrote: On Fri, Feb 24, 2012 at 6:41 AM, Stefan Hajnoczi wrote: On Thu, Feb 23, 2012 at 7:08 PM, peter.lie...@gmail.com wrote: Stefan Hajnoczischrieb: On Thu, Feb 23, 2012 at 3:40 PM, Peter Lievenwrote: However, in a virtual machine I have not observed the above slow down to that extend while the benefit of zero after free in a virtualisation environment is obvious: 1) zero pages can easily be merged by ksm or other technique. 2) zero (dup) pages are a lot faster to transfer in case of migration. The other approach is a memory page "discard" mechanism - which obviously requires more code changes than zeroing freed pages. The advantage is that we don't take the brute-force and CPU intensive approach of zeroing pages. It would be like a fine-grained ballooning feature. I dont think that it is cpu intense. All user pages are zeroed anyway, but at allocation time it shouldnt be a big difference in terms of cpu power. It's easy to find a scenario where eagerly zeroing pages is wasteful. Imagine a process that uses all of physical memory. Once it terminates the system is going to run processes that only use a small set of pages. It's pointless zeroing all those pages if we're not going to use them anymore. Perhaps the middle path is to zero pages but do it after a grace timeout. I wonder if this helps eliminate the 2-3% slowdown you noticed when compiling. Gah, it's too early in the morning. I don't think this timer actually makes sense. do you think it makes then sense to make a patchset/proposal to notice a guest kernel about the presense of ksm in the host and switch to zero after free? I think your idea is interesting - whether or not people are happy with it will depend on the performance impact. It seems reasonable to me. could you support/help me in implementing and publishing this approach? Peter
Re: [Qemu-devel] linux guests and ksm performance
On 24.02.2012 08:23, Stefan Hajnoczi wrote: On Fri, Feb 24, 2012 at 6:53 AM, Stefan Hajnoczi wrote: On Fri, Feb 24, 2012 at 6:41 AM, Stefan Hajnoczi wrote: On Thu, Feb 23, 2012 at 7:08 PM, peter.lie...@gmail.com wrote: Stefan Hajnoczi schrieb: On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: However, in a virtual machine I have not observed the above slow down to that extend while the benefit of zero after free in a virtualisation environment is obvious: 1) zero pages can easily be merged by ksm or other technique. 2) zero (dup) pages are a lot faster to transfer in case of migration. The other approach is a memory page "discard" mechanism - which obviously requires more code changes than zeroing freed pages. The advantage is that we don't take the brute-force and CPU intensive approach of zeroing pages. It would be like a fine-grained ballooning feature. I dont think that it is cpu intense. All user pages are zeroed anyway, but at allocation time it shouldnt be a big difference in terms of cpu power. It's easy to find a scenario where eagerly zeroing pages is wasteful. Imagine a process that uses all of physical memory. Once it terminates the system is going to run processes that only use a small set of pages. It's pointless zeroing all those pages if we're not going to use them anymore. Perhaps the middle path is to zero pages but do it after a grace timeout. I wonder if this helps eliminate the 2-3% slowdown you noticed when compiling. Gah, it's too early in the morning. I don't think this timer actually makes sense. do you think it makes then sense to make a patchset/proposal to notice a guest kernel about the presense of ksm in the host and switch to zero after free? peter Stefan
Re: [Qemu-devel] linux guests and ksm performance
Am 24.02.2012 um 08:23 schrieb Stefan Hajnoczi: > On Fri, Feb 24, 2012 at 6:53 AM, Stefan Hajnoczi wrote: >> On Fri, Feb 24, 2012 at 6:41 AM, Stefan Hajnoczi wrote: >>> On Thu, Feb 23, 2012 at 7:08 PM, peter.lie...@gmail.com >>> wrote: Stefan Hajnoczi schrieb: > On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: >> However, in a virtual machine I have not observed the above slow down > to >> that extend >> while the benefit of zero after free in a virtualisation environment > is >> obvious: >> >> 1) zero pages can easily be merged by ksm or other technique. >> 2) zero (dup) pages are a lot faster to transfer in case of > migration. > > The other approach is a memory page "discard" mechanism - which > obviously requires more code changes than zeroing freed pages. > > The advantage is that we don't take the brute-force and CPU intensive > approach of zeroing pages. It would be like a fine-grained ballooning > feature. > I dont think that it is cpu intense. All user pages are zeroed anyway, but at allocation time it shouldnt be a big difference in terms of cpu power. >>> >>> It's easy to find a scenario where eagerly zeroing pages is wasteful. >>> Imagine a process that uses all of physical memory. Once it >>> terminates the system is going to run processes that only use a small >>> set of pages. It's pointless zeroing all those pages if we're not >>> going to use them anymore. >> >> Perhaps the middle path is to zero pages but do it after a grace >> timeout. I wonder if this helps eliminate the 2-3% slowdown you >> noticed when compiling. > > Gah, it's too early in the morning. I don't think this timer actually > makes sense. ok, that would be the idea of an ansynchronous page zeroing in the guest. i also think this is to complicated. maybe the other idea is too simple: is it possible to give the guest a hint that ksm is enabled on the host (lets say in a way like its done with kvmclock). if ksm is enabled on the host the administrator has already made the decision that performance is not so important and he/she is eager to save physical memory. what if and only if this flag is set switch from zero on allocate to zero after free. i think the whole thing is less than 10-20 lines of code. and its code that has been proven to be working well in grsecurity for ages. this might introduce a little (2-3%) overhead, but only if there is a lot of non GFP_FREE memory is allocated, but its definitely faster than swapping. of course, it has to be garanteed that this code does not slow down normal systems due to additionales branches (would it be enough to mark the if statements as unlikely?) peter peter > > Stefan
Re: [Qemu-devel] linux guests and ksm performance
On Fri, Feb 24, 2012 at 6:53 AM, Stefan Hajnoczi wrote: > On Fri, Feb 24, 2012 at 6:41 AM, Stefan Hajnoczi wrote: >> On Thu, Feb 23, 2012 at 7:08 PM, peter.lie...@gmail.com wrote: >>> Stefan Hajnoczi schrieb: >>> On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: > However, in a virtual machine I have not observed the above slow down to > that extend > while the benefit of zero after free in a virtualisation environment is > obvious: > > 1) zero pages can easily be merged by ksm or other technique. > 2) zero (dup) pages are a lot faster to transfer in case of migration. The other approach is a memory page "discard" mechanism - which obviously requires more code changes than zeroing freed pages. The advantage is that we don't take the brute-force and CPU intensive approach of zeroing pages. It would be like a fine-grained ballooning feature. >>> >>> I dont think that it is cpu intense. All user pages are zeroed anyway, but >>> at allocation time it shouldnt be a big difference in terms of cpu power. >> >> It's easy to find a scenario where eagerly zeroing pages is wasteful. >> Imagine a process that uses all of physical memory. Once it >> terminates the system is going to run processes that only use a small >> set of pages. It's pointless zeroing all those pages if we're not >> going to use them anymore. > > Perhaps the middle path is to zero pages but do it after a grace > timeout. I wonder if this helps eliminate the 2-3% slowdown you > noticed when compiling. Gah, it's too early in the morning. I don't think this timer actually makes sense. Stefan
Re: [Qemu-devel] linux guests and ksm performance
On Thu, Feb 23, 2012 at 04:42:54PM +, Stefan Hajnoczi wrote: > On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: > > However, in a virtual machine I have not observed the above slow down to > > that extend > > while the benefit of zero after free in a virtualisation environment is > > obvious: > > > > 1) zero pages can easily be merged by ksm or other technique. > > 2) zero (dup) pages are a lot faster to transfer in case of migration. > > The other approach is a memory page "discard" mechanism - which > obviously requires more code changes than zeroing freed pages. > > The advantage is that we don't take the brute-force and CPU intensive > approach of zeroing pages. It would be like a fine-grained ballooning > feature. > > I hope someone will follow up saying this has already been done or > prototyped :). > That was attempted. It is called "page hinting", but AFAIK due to complex locking issue attempt was abandoned. -- Gleb.
Re: [Qemu-devel] linux guests and ksm performance
On Fri, Feb 24, 2012 at 6:41 AM, Stefan Hajnoczi wrote: > On Thu, Feb 23, 2012 at 7:08 PM, peter.lie...@gmail.com wrote: >> >> >> >> >> Stefan Hajnoczi schrieb: >> >>>On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: However, in a virtual machine I have not observed the above slow down >>>to that extend while the benefit of zero after free in a virtualisation environment >>>is obvious: 1) zero pages can easily be merged by ksm or other technique. 2) zero (dup) pages are a lot faster to transfer in case of >>>migration. >>> >>>The other approach is a memory page "discard" mechanism - which >>>obviously requires more code changes than zeroing freed pages. >>> >>>The advantage is that we don't take the brute-force and CPU intensive >>>approach of zeroing pages. It would be like a fine-grained ballooning >>>feature. >>> >> >> I dont think that it is cpu intense. All user pages are zeroed anyway, but >> at allocation time it shouldnt be a big difference in terms of cpu power. > > It's easy to find a scenario where eagerly zeroing pages is wasteful. > Imagine a process that uses all of physical memory. Once it > terminates the system is going to run processes that only use a small > set of pages. It's pointless zeroing all those pages if we're not > going to use them anymore. Perhaps the middle path is to zero pages but do it after a grace timeout. I wonder if this helps eliminate the 2-3% slowdown you noticed when compiling. This requires no special host<->guest interfaces for discarding pages. Stefan
Re: [Qemu-devel] linux guests and ksm performance
On Thu, Feb 23, 2012 at 7:08 PM, peter.lie...@gmail.com wrote: > > > > > Stefan Hajnoczi schrieb: > >>On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: >>> However, in a virtual machine I have not observed the above slow down >>to >>> that extend >>> while the benefit of zero after free in a virtualisation environment >>is >>> obvious: >>> >>> 1) zero pages can easily be merged by ksm or other technique. >>> 2) zero (dup) pages are a lot faster to transfer in case of >>migration. >> >>The other approach is a memory page "discard" mechanism - which >>obviously requires more code changes than zeroing freed pages. >> >>The advantage is that we don't take the brute-force and CPU intensive >>approach of zeroing pages. It would be like a fine-grained ballooning >>feature. >> > > I dont think that it is cpu intense. All user pages are zeroed anyway, but at > allocation time it shouldnt be a big difference in terms of cpu power. It's easy to find a scenario where eagerly zeroing pages is wasteful. Imagine a process that uses all of physical memory. Once it terminates the system is going to run processes that only use a small set of pages. It's pointless zeroing all those pages if we're not going to use them anymore. Stefan
Re: [Qemu-devel] linux guests and ksm performance
Stefan Hajnoczi schrieb: >On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: >> However, in a virtual machine I have not observed the above slow down >to >> that extend >> while the benefit of zero after free in a virtualisation environment >is >> obvious: >> >> 1) zero pages can easily be merged by ksm or other technique. >> 2) zero (dup) pages are a lot faster to transfer in case of >migration. > >The other approach is a memory page "discard" mechanism - which >obviously requires more code changes than zeroing freed pages. > >The advantage is that we don't take the brute-force and CPU intensive >approach of zeroing pages. It would be like a fine-grained ballooning >feature. > I dont think that it is cpu intense. All user pages are zeroed anyway, but at allocation time it shouldnt be a big difference in terms of cpu power. >I hope someone will follow up saying this has already been done or >prototyped :). > >Stefan -- Diese Nachricht wurde von meinem Android-Mobiltelefon mit K-9 Mail gesendet.
Re: [Qemu-devel] linux guests and ksm performance
On Thu, Feb 23, 2012 at 11:42 AM, Stefan Hajnoczi wrote: > The other approach is a memory page "discard" mechanism - which > obviously requires more code changes than zeroing freed pages. > > The advantage is that we don't take the brute-force and CPU intensive > approach of zeroing pages. It would be like a fine-grained ballooning > feature. (disclaimer: i don't know the code, i'm just guessing) does KVM emulate the MMU? if so, is there any 'unmap page' primitive? -- Javier
Re: [Qemu-devel] linux guests and ksm performance
On Thu, Feb 23, 2012 at 3:40 PM, Peter Lieven wrote: > However, in a virtual machine I have not observed the above slow down to > that extend > while the benefit of zero after free in a virtualisation environment is > obvious: > > 1) zero pages can easily be merged by ksm or other technique. > 2) zero (dup) pages are a lot faster to transfer in case of migration. The other approach is a memory page "discard" mechanism - which obviously requires more code changes than zeroing freed pages. The advantage is that we don't take the brute-force and CPU intensive approach of zeroing pages. It would be like a fine-grained ballooning feature. I hope someone will follow up saying this has already been done or prototyped :). Stefan