Re: More useful structured concurrency stack traces

Alan Bateman Tue, 09 Jul 2024 12:11:49 -0700

Probably best to bring this to loom-dev as there have been someexploration into but where we decided not to expose any APIs at this time.


-Alan


On 09/07/2024 19:50, Louis Wasserman wrote:

My understanding of the structured concurrency APIs now in preview isthat when a subtask is forked, exceptions thrown in that stack tracewill have stack traces going up to the beginning of that subtask, note.g. up the structured concurrency task tree. (My tests suggest thisis the case for simple virtual threads without structuredconcurrency.) Most concurrency frameworks on the JVM that I’veencountered share the property that stack traces for exceptions don’ttrace through the entire causal chain – and, not unrelatedly, thatdevelopers struggle to debug concurrent applications, especially withstack traces from production and not full debuggers attached.
In some cases, like chained CompletableFutures, this seems necessaryto ensure that executing what amounts to a loop does not result instack traces that grow linearly with the number of chained futures. But when structured concurrency is involved, it seems more plausibleto me that the most useful possible stack traces would go up the treeof tasks – that is, whenever a task was forked, the stack trace wouldlook roughly as if it were a normal/sequential/direct invocation ofthe task. This could conceivably cause stack overflows where theydidn’t happen before, but only for code that violates the expectationswe have around normal sequential code: you can’t recurse unboundedly;use iteration instead.
I’m curious if there are ways we could make the upcoming structuredconcurrency APIs give those stack traces all the way up the tree, orprovide hooks to enable you to do that yourself. Last year’s JVMLStalk on Continuations Under the Covers demonstrated how stacks wereredesigned in ways that frequently and efficiently snapshot the stackitself – not just the trace, but the thing that includes all thevariables in use. There’s a linked list of StackChunks, and all butmaybe the top of the stack has those elements frozen, etc, and the topof the stack gets frozen when the thread is yielded. Withoutcertainty about how stack traces are managed in the JVM today, I wouldimagine you could possibly do something similar – you’d add a way tocheaply snapshot a reference to the current stack trace that can betraversed later. If you’re willing to hold on to all the referencescurrently on the stack – which might be acceptable for the structuredconcurrency case in particular, where you might be able to assumeyou’ll return to the parent task and its stack at some point – youmight be able to do this by simply wrapping the existing StackChunks. Then, each `fork` or `StructuredTaskScope` creation might snapshot thecurrent call stack, and you’d stitch together the stack traceslater…somewhere. That part is a little more open ended: would you adda new variant of `fillInStackTrace`? Would it only apply toexceptions that bubbled up to the task scope? Or would we be addingnew semantics to what happens when you throw an exception or walk thestack in general? The most plausible vision I have at this point isan API that spawns a virtual thread which receives a stack trace ofsome sort – or perhaps snapshots the current stack trace – andprepends that trace to all stack traces within the virtual thread’sexecution.
I suppose this is doable today if you’re willing to pay theperformance cost of explicitly getting the current stack trace everytime you fork a task or start a scope. That is kind of antitheticalto the point of virtual threads – making forking tasks very efficient– but it’s something you might be willing to turn on during testing.
Right now, my inspiration for this question is attempting to improvethe stack trace situation with Kotlin coroutines, where Googleproduction apps have complained about the difficulty of debugging withthe current stack traces. But this is something I'd expect to applyequally well to all JVM languages: the ability to snapshot and stringtogether stack trace causal chains like this in production couldsignificantly improve the experience of debugging concurrent code.
--
Louis Wasserman

Re: More useful structured concurrency stack traces

Reply via email to