Re: Significant GC performance penalty

Paulo Pinto Fri, 14 Dec 2012 11:30:31 -0800

On Friday, 14 December 2012 at 18:27:29 UTC, Rob T wrote:

I created a D library wrapper for sqlite3 that uses adynamically constructed result list for returned records from aSELECT statement. It works in a similar way to a C++ versionthat I wrote a while back.
The D code is D code, not a cloned up version of my earlier C++code, so it makes use of many of the features of D, and one ofthem is the garbage collector.
When running comparison tests between the C++ version and the Dversion, both compiled using performance optimization flags,the C++ version runs 3x faster than the D version which wasvery unexpected. If anything I was hoping for a performanceboost out of D or at least the same performance levels.
I remembered reading about people having performance problemswith the GC, so I tried a quick fix, which was to disable theGC before the SELECT is run and re-enable afterwards. Theresult of doing that was a 3x performance boost, making the DMDcompiled version run almost as fast as the C++ version. The DMDcompiled version is now only 2 seconds slower on my stress testruns of a SELECT that returns 200,000+ records with 14 fields.Not too bad! I may get identical performance if I compile usinggdc, but that will have to wait until it is updated to 2.061.
Fixing this was a major relief since the code is expected to beused in a commercial setting. I'm wondering though, why the GCcauses such a large penalty, and what negative effect if any ifthere will be when disabling the GC temporarily. I know thatmemory won't be reclaimed until the GC is re-enabled, but isthere anything else to worry about?
I feel it's worth commenting on my experience as feed back forthe D developers and anyone else starting off with D.
Coming from C++ I *really* did not like having the GC, it mademe very nervous, but now that I'm used to having it, I've cometo like having it up to a point. It really does change the wayyou think and code. However as I've discovered, you still haveto always be thinking about memory management issues becausethe GC can eat up a huge performance penalty under certainsituations. I also NEED to know that I can always go fullmanual where necessary. There's no way I would want to give upthat kind of control.
The trade off with having a GC seems to be that by default, C++apps will perform considerably faster than equivalent D appsout-of-the-box, simply because the manual memory management isfine tuned by the programmer as the development proceeds. WithD, when you simply let the GC take care of business, then youare not necessarily fine tuning as you go along, and when youdo not take the resulting performance hit into consideration itmeans that your apps will likely perform poorly compared to aC++ equivalent. However, building the equivalent app in D is amuch more pleasant experience in terms of the programmingproductivity gain. The code is simpler to deal with, andthere's less to worry about with pointers and other memorymanagement issues.
What I have not yet had the opportunity to explore, is using Din full manual memory management mode. My understanding is thatif I take that route, then I cannot use certain parts of thestd lib, and will also loose a few of the nice features of Dthat make it fun to work with. I'm not fully clear though onwhat to expect, so if there's any detailed information to lookat, it would be a big help.
I wonder what can be done to allow a programmer to go fullymanual, while not loosing any of the nice features of D?
Also, I think everyone agrees we really need a better GC, and Iwonder once we do get a better GC, what kind of overallimprovements we can expect to see?
Thanks for listening.

--rt

Having lots of experience in GC enabled languages, even forsystems programming (Oberon & Active Oberon).


I think there a few issues to consider:

- D's GC still has a lot of room to improve, so some of theissues you have found might eventually get improved;

- Having GC support, does not mean to do call new like crazy, onestill needs to think how to code in a GC friendly way;


- Make proper use of weak references in case they are available;

- GC enabled languages runtimes usually offer ways to peak intothe runtime, somehow, and allow the developer to understand howGC is working and what might be improved;

The goodness of having a GC is to have a safer way to managememory across multiple modules, specially when ownership is notclear.

Even in C++ I seldom do manual memory management nowadays, ifworking on new codebases. Of course, others will have a differentexperience.


Other than that, thanks for sharing your experience.

--
Paulo

Re: Significant GC performance penalty

Reply via email to