Tpt opened a new pull request, #18184: URL: https://github.com/apache/datafusion/pull/18184
- introduce a new "result table" intermediate table storing all already emitted results - use existing physical operators to deduplicate the output of both static and recursive terms and remove from the recursive term output the already emitted results - add a simple test of a transitive closure on a cyclic graph This is very naive and slow. It would be much better to build the result table as a hash table inside of `RecursiveQueryExec` and use if for deduplication, removing the need for any extra operator. I guess it will requires significant coding because we need an incremental hash map (build and probe steps are interleaved). * Closes #18140 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
