Re: [D] Support Batch Execution Mode [texera]

via GitHub Thu, 08 Jan 2026 17:34:47 -0800


GitHub user aglinxinyuan edited a discussion: Support Batch Execution Mode

I would like to introduce the idea of supporting multiple runtime execution
modes that users can choose from based on the requirements of their use case
and the characteristics of their jobs.

The current (and default) execution behavior of our engine is what we call
pipelined, or STREAMING, execution mode. In this mode, each operator performs
continuous, incremental processing as data flows through the pipeline.

In addition, we plan to support a batch-style execution mode, referred to as
BATCH execution mode. This mode executes jobs in a manner more reminiscent of
traditional batch processing. We intend to enable this mode via a configuration
flag.

Our unified approach to stream and batch processing ensures that applications
executed over bounded inputs will produce the same final results regardless of
the selected execution mode. It is important to clarify what “final” means in
this context: a job running in STREAMING mode may emit incremental updates over
time (for example, upserts to a database), whereas a job running in BATCH mode
produces a single result upon completion. While the paths differ, the final
outcome—when interpreted correctly—remains the same. Enabling BATCH execution
allows the engine to apply additional optimizations that are only possible when
operators know that their inputs are bounded.

Below, I will provide an example illustrating the differences between these
execution modes:

![Streaming](https://github.com/user-attachments/assets/77f5cbe3-fd5d-474b-b1a4-bd8af7602aff)
![Batch](https://github.com/user-attachments/assets/5f4f847d-b2e7-4d54-9629-b1bcf0df1f07)

GitHub link: https://github.com/apache/texera/discussions/4149

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to:
[email protected]

Re: [D] Support Batch Execution Mode [texera]

Reply via email to