Tpt opened a new pull request, #18184:
URL: https://github.com/apache/datafusion/pull/18184

   - introduce a new "result table" intermediate table storing all already 
emitted results
   - use existing physical operators to deduplicate the output of both static 
and recursive terms and remove from the recursive term output the already 
emitted results
   - add a simple test of a transitive closure on a cyclic graph
   
   This is very naive and slow. It would be much better to build the result 
table as a hash table inside of `RecursiveQueryExec` and use if for 
deduplication, removing the need for any extra operator. I guess it will 
requires significant coding because we need an incremental hash map (build and 
probe steps are interleaved). 
   
   * Closes #18140


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to