SarahAsad23 opened a new pull request, #5102: URL: https://github.com/apache/texera/pull/5102
Overview This PR introduces a framework for comparing different versions of a workflow execution to help identify where executions begin to diverge. The goal of this project is to improve workflow reproducibility and debugging by making it easier to detect differences across workflow versions, operator outputs, and intermediate states. The system allows workflow executions to be run side-by-side and analyzed for inconsistencies throughout execution rather than only comparing final outputs. Motivation As workflows evolve over time, even small changes to operators, dependencies, configurations, or execution environments can lead to unexpected differences in results. Currently, it can be difficult to determine exactly where two workflow executions stop behaving the same way. This project explores the idea of execution-level differencing for workflows by introducing tooling to: Compare executions across workflow versions Detect divergence points between executions Surface intermediate operator differences Support reproducibility analysis and debugging Features Added support for comparing multiple workflow executions Introduced execution differencing logic for detecting mismatched states Added operator-level comparison support Added infrastructure for tracking divergence points during execution -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
