Re: stack arenas using alloca

Michael Clark via Gcc Wed, 14 Aug 2024 11:24:28 -0700

Hi Folks,

*sending again with Thunderbird because Apple Mail munged the message*.

I wanted to share a seed of an idea I have been ruminating on for awhile, and that is being able to return alloca memory from a function.

I think it’s trivially possible by hacking the epilogue to unlink theframe pointer but not pop the stack. Stack traces should work fine andit just looks like the parent function called an alloca which covers thefixed child frame and the alloca that it returned.

I tend to use alloca quite a lot in my programming style because I liketo avoid the heap. I have a style where I run a loop with a size pass,then call alloca, then run the loop again without suppressing writes. Iwould like to be able to return, initially, one alloca to a parentfunction, but more than one should also be possible.

I haven't thought a lot about the mechanics of how one might signal tothe compiler to suppress popping the stack, but I imagined one couldreturn a pointer or structure with _Stack qualifiers to signal thatstack memory from the child frame is being returned.

This is just a quick note as there is a deeper conversation aboutcompacting stack arenas that would be possible if one had a runtime*and* compile time reflection API for frame metadata, where the frame isexpressed as an anonymous C structure perhaps accessible via a functionon a reference to a function. If you had frame metadata, you could emitone or many memmove calls. trivially one just needs the fixed framelength to recover the fixed frame stack memory and with full framereflection, it would be possible to defer to a runtime routine in thecase two or more stack pointers are returned, assuming the compilertracks sizes of alloca calls. Ideally, the reflection API would also beconstexpr-able meaning the runtime compaction could be inlined. Thiswon't work for types that have pointers unless one has a fullsingle-thread compacting GC that uses type metadata, but I tend to useindices. But unlike heap compaction, it doesn't have to worry about races.

The beauty of stack arenas, and compacting stack arenas, as a subtype,are many. Firstly the memory has no thread aliasing issues unlike heapmemory, meaning analysis is much simpler and there are no pause issueslike there are for GC. Secondly, it would be quite a neat way toconstexpr variable-length arrays or a _List type because, unlike malloc,it’s much easier to reason about.

I have a relatively complete C reflection API that could be used as astarting point for C reflection. It currently works as an LLVM plugin. Iam not sure how I would do this in GCC. A Python parser came to mind butit would be better if it somehow ran inside the compiler. It would alsobe great to have access to the frame of a function as an anonymousstructure. It’s only runtime at present and it could benefit from a forcomprehension because looping on variable length lists is a littleclumsy. If I could return alloca memory in a structure that wouldlargely solve this problem.


- https://github.com/michaeljclark/crefl

Regards,
Michael

Re: stack arenas using alloca

Reply via email to