Re: [Caml-list] Memory fragmentation

2012-04-24 Thread SerP
I replaced call of 'malloc' in 'caml_alloc_for_heap' function to mmap, and
'free' in 'caml_free_for_heap' to unmap. Unused memory is freed after
Gc.compaction.

On Sat, Apr 21, 2012 at 11:26 AM, ri...@happyleptic.org wrote:

 There is a commit dating from a few days ago that might
 be related to your problem (commit msg was :
 PR#5389: compaction sometimes leaves a very large heap)


 --
 Caml-list mailing list.  Subscription management and archives:
 https://sympa-roc.inria.fr/wws/info/caml-list
 Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
 Bug reports: http://caml.inria.fr/bin/caml-bugs



-- 
Caml-list mailing list.  Subscription management and archives:
https://sympa-roc.inria.fr/wws/info/caml-list
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
Bug reports: http://caml.inria.fr/bin/caml-bugs



[Caml-list] Memory fragmentation

2012-04-20 Thread SerP
Hi!
We have developped a daemon on ocaml using the lwt lib(libev). It processes
about 800 requests per second, but it increases to 2 Gb in memory for a
hour of work. We have studied the problem for a long time, using mtrace,
mallinfo and other tools, and we tried to change GC params. We found out
that setting up GC.minor_heap_size=10Mb а major_heap_increment=2Мb will
make the daemon grow slower. mallinfо shows that the memory (1,5G) is
allocated using mmap (blkhd field in mallinfo struct) and arena size -
about 20M In this case, first calls of GC.compact compress the daemon to
200 Mb (live -110Mb) , and mallinfo show decreasing of total size of memory
allocated by mmap. The daemon keeps growing, but it allocates the memory to
arena (using brk), and calls of GC.compact leads to decrease of heap_bytes
to 200M, but arena size (1.5 Gb) does not decrease, just doing less
uordblks and more fordblks (fileds I.e., after first calls of GC.compact,
the daemon starts using brk much more than mmap, and cant memory given back
to the system.

If Gc.allocation_policy is first fit, memory usage is stable, and it's grow
very little, but worked very slowly.

Any suggestions are very welcome. Thanks

-- 
Caml-list mailing list.  Subscription management and archives:
https://sympa-roc.inria.fr/wws/info/caml-list
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
Bug reports: http://caml.inria.fr/bin/caml-bugs



Re: [Caml-list] Memory fragmentation

2012-04-20 Thread Török Edwin
On 04/20/2012 06:12 PM, SerP wrote:
 Hi!
 We have developped a daemon on ocaml using the lwt lib(libev). It processes 
 about 800 requests per second, but it increases to 2 Gb in memory for a hour 
 of work. We have studied the problem for a long
 time, using mtrace, mallinfo and other tools,
 and we tried to change GC params. We found out that setting up 
 GC.minor_heap_size=10Mb а major_heap_increment=2Мb will make the daemon grow 
 slower.
 mallinfо shows that the memory (1,5G) is allocated using mmap (blkhd field in 
 mallinfo struct) and arena size - about 20M In this case, first calls of 
 GC.compact compress the daemon to 200 Mb (live
 -110Mb) , and mallinfo show decreasing of total size of memory allocated by 
 mmap. The daemon keeps growing, but it allocates the memory to arena (using 
 brk), and calls of GC.compact leads to decrease
 of heap_bytes to 200M, but arena size (1.5 Gb) does not decrease, just doing 
 less uordblks and more fordblks (fileds I.e., after first calls of 
 GC.compact, the daemon starts using brk much more than
 mmap, and cant memory given back to the system.

Sounds like a fragmentation problem in glibc's heap that causes increased VIRT 
usage.
IIRC glibc has a default threshold of 128k for deciding to use brk() vs mmap(), 
and OCaml's default major_heap_increment (124k) is just below that,
which would explain the behaviour you see, and also explain why using a larger 
major_heap_increment (2M) improves the situation:
 brk() cannot free holes, while mmap arenas can.

You can use malloc_stats() to check whether OCaml gives back all memory to 
glibc or not, and you can try to use
an alternative malloc implementation (like jemalloc) that might cope better 
with fragmentation.

However this doesn't explain why memory keeps growing after a Gc.compact, 
unless calls to malloc
are mixed between the OCaml runtime, Lwt and libev. If that is the case then 
maybe it would help
to try and allocate memory in the OCaml's runtime by using mmap() directly.

Is your application completely single-threaded (since you're using lwt), or do 
you also use multiple threads?
I think that glibc caches mmap-ed areas even if you free them (i.e. they're 
still mapped into your process
but they might be changed via mprotect() to PROT_NONE), which is especially a 
problem if you use multiple threads.

Best regards,
--Edwin

-- 
Caml-list mailing list.  Subscription management and archives:
https://sympa-roc.inria.fr/wws/info/caml-list
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
Bug reports: http://caml.inria.fr/bin/caml-bugs



Re: [Caml-list] Memory fragmentation

2012-04-20 Thread ygrek
On Fri, 20 Apr 2012 19:12:12 +0400
SerP serp...@gmail.com wrote:

 Hi!
 We have developped a daemon on ocaml using the lwt lib(libev). It processes
 about 800 requests per second, but it increases to 2 Gb in memory for a
 hour of work. We have studied the problem for a long time, using mtrace,
 mallinfo and other tools, and we tried to change GC params. We found out
 that setting up GC.minor_heap_size=10Mb а major_heap_increment=2Мb will
 make the daemon grow slower. mallinfо shows that the memory (1,5G) is
 allocated using mmap (blkhd field in mallinfo struct) and arena size -
 about 20M In this case, first calls of GC.compact compress the daemon to
 200 Mb (live -110Mb) , and mallinfo show decreasing of total size of memory
 allocated by mmap. The daemon keeps growing, but it allocates the memory to
 arena (using brk), and calls of GC.compact leads to decrease of heap_bytes
 to 200M, but arena size (1.5 Gb) does not decrease, just doing less
 uordblks and more fordblks (fileds I.e., after first calls of GC.compact,
 the daemon starts using brk much more than mmap, and cant memory given back
 to the system.

Alternative malloc implementation may well be worth a try. I've seen the cases
when glibc was constantly growing RSS while tcmalloc was on stable level (and 
10x less), 
guessing due to fragmentation.
Maybe related, see PR#5389 for ocaml compaction related issues.
Also have a look at http://ygrek.org.ua/p/code/mlvalues.py for the way to peep
into ocaml heap at runtime.

-- 
 ygrek
 http://ygrek.org.ua

-- 
Caml-list mailing list.  Subscription management and archives:
https://sympa-roc.inria.fr/wws/info/caml-list
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
Bug reports: http://caml.inria.fr/bin/caml-bugs