Re: [Podofo-users] Boost-based pool allocator

Dominik Seichter Tue, 20 Nov 2007 07:56:26 -0800

Hi,

Am Tuesday 20 November 2007 schrieb Craig Ringer:
> Dominik Seichter wrote:
> > Regarding the pool memory which has to be free'd explicitly with
> > boost::singleton_pool<boost::fast_pool_allocator_tag,
> > sizeof(TRefCountedBuffer)>::release_memory(). Maybe it is time to add
> > podofo_library_init() and podofo_library_free() method (or something
> > similar) which can be used by applications to clear any allocated memory
> > in pools from PoDoFo.
>
> I tend to agree with that. I need to look into whether it's safe to then
> continue using the singleton allocators (say, after another
> podofo_library_init()) though, or whether that closes things down for
> the life of the process.
>
> > I just played a bit with ParserTest and callgrind. For the PDFReference I
> > get
>
> A side note: Are you familiar with the `massif' tool of Valgrind? Very
> impressive. It generates a mapping of memory allocation hotspots and a
> nice graph showing proportionally how much memory was allocated in the
> various places.


I definitly have to try massif. I sounds very interesting! I am currently 
working on a Java profiling project and we are basing our software on HProf 
(a profiler example ship with every VM by Sun) and I really miss valgrind. 
Even if valgrind is slow, the output is far superior to hprof and there so 
many more tools for valgrind.

>
> > 3 647 953 calls to new from PdfVariant::operator=
> > 1363133   calls to new from  
> > __gnu_cxx_new_allocator<PdfObject)::allocate 2285072   calls to new from 
> >  std::string::_Rep_S_create
> > 1453340   calls to new from
> > __gnu_cxx_new_allocator<std::_Rb_tree_node<std::pair<PdfName,PdfObject*>>
> >::allocate (i.e. PdfDictionary::AddKey)
> > 893477    calls to new from PdfDictionary::operator=
> >
> > new is the function called most after malloc and free, but I guess this
> > is because new uses malloc internally.
>
> Well, and because PoDoFo uses malloc directly in a quite a lot of places.
>
> BTW, while I haven't confirmed this in detail the libstdc++ docs say
> that it actually caches memory internally, so not every call to operator
> new() necessarily results in an underlying memory allocation from the C
> library. If that's the case, it might well explain why we're not seeing
> much of a benefit from the Boost memory pools - though I'd expect them
> to at least be faster (by virtue of their specialization) than the
> method new() is using. libstdc++'s allocation behavior has changed many
> times though, so who knows how up to date the docs are.
I will try to find a paper describing libstdc++ memory management. Maybe this 
gives us more insight.

>
> > I see that this is a very special test and very different from creating a
> > PdfFile or working with streams. But I think we coulg gain more from a
> > PdfVariant and PdfObject allocator. Maybe also from a pool for
> > std::_Rb_tree_nodes.
>
> One of the very nice things about the boost pool library is how easy
> that is to do.
>
> If you have:
>
> std::map<K,V>
>
> you can write:
>
> #if defined(HAVE_BOOST)
> typedef boost::fast_pool_alloc<std::map<K,V>::value_type> MapAllocator;
> #else
> typedef std::allocator<std::map<k,V>::value_type> MapAllocator;
> #endif
> std::map<K,V,MapAllocator>
>
> This works, for example, in PdfDictionary.
>
> It's even easier for single value containers (like PdfArray). All that
> changes for bulk-allocating containers is that you use pool_alloc rather
> than fast_pool_alloc, and nothing at all changes for individually
> allocating containers like std::list .
Great. Really simple!

>
> I suspect, though, that gcc's libstdc++ is doing such a good job with
> the existing allocator functions that we're not going to see as much
> benefit as I initially thought.
>
> I'm particularly confused about the lack of benefit for the two
> *incredibly* common classes TRefCountedBuffer and PdfReference.
> Especially PdfReference, which is tiny and allocated in huge numbers,
> should benefit from using the pool allocator - yet in tests I see little
> change.
I think PdfReference is most of the time allocated as part of a PdfObject (ok 
sometimes also as a value to a key in PdfDictionary) and it is not allocated 
on the heap there but as part of the larger PdfObject. That 
PdfRefCountedBuffer does not give us benefits is also a surprise for me.

best regards,
        Dom

>
> I need to do some testing on Windows, too, but that's harder due to the
> fact that I don't have the same sort of tools available.

Are there any (besides Purify which is commercial AFAIK)?

best regards,
        Dom

>
> --
> Craig Ringer



-- 
**********************************************************************
Dominik Seichter - [EMAIL PROTECTED]
KRename  - http://www.krename.net  - Powerful batch renamer for KDE
KBarcode - http://www.kbarcode.net - Barcode and label printing
PoDoFo - http://podofo.sf.net - PDF generation and parsing library
SchafKopf - http://schafkopf.berlios.de - Schafkopf, a card game,  for KDE
Alan - http://alan.sf.net - A Turing Machine in Java
**********************************************************************

signature.asc
Description: This is a digitally signed message part.

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/

_______________________________________________
Podofo-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/podofo-users

Re: [Podofo-users] Boost-based pool allocator

Reply via email to