Re: [PR] [docs] Update the deployment introduction for Flink agents [flink-agents]

via GitHub Fri, 17 Oct 2025 23:46:23 -0700


xintongsong commented on code in PR #237:
URL: https://github.com/apache/flink-agents/pull/237#discussion_r2409820564



##########
docs/content/docs/operations/deployment.md:
##########
@@ -22,20 +22,142 @@ specific language governing permissions and limitations
 under the License.
 -->
 
+## Overview
+
+We provide a total of three ways to run the job: Local Run with Test Data, 
Local Run with Flink MiniCluster, and Run in Flink Cluster. The detailed 
differences are shown in the table below:
+
+| Deployment Mode                     | Language Support       | Typical Use 
Case                                           |
+|-------------------------------------|------------------------|------------------------------------------------------------|
+| Local Run with Test Data            | Only Python            | Validate the 
internal logic of the Agent.                  |
+| Local Run with Flink MiniCluster    | Python & Java          | Verify 
upstream/downstream connectivity and schema correctness. |
+| Run in Flink Cluster                | Python & Java          | Large-scale 
data and AI processing in production environments. |
+
 ## Local Run with Test Data
 
-{{< hint warning >}}
-**TODO**: How to run with test data with LocalExecutorEnvironment.
-{{< /hint >}}
+After completing the [installation of flink-agents]({{< ref 
"docs/get-started/installation" >}}) and building your [ReAct Agent]({{< ref 
"docs/development/react_agent" >}}) or [workflow agent]({{< ref 
"docs/development/workflow_agent" >}}), you can test and execute your agent 
locally using a simple Python script. This allows you to validate logic without 
requiring a Flink cluster.
+
+### Key Features
+
+- **No Flink Required**: Local execution is ideal for development and testing.
+- **Test Data Simulation**: Easily inject mock inputs for validation.
+- **IDE Compatibility**: Run directly in your preferred development 
environment.
+
+### Example for Local Run with Test Data
+
+```python
+from flink_agents.api.execution_environment import AgentsExecutionEnvironment
+from my_module.agents import MyAgent  # Replace with your actual agent path
+
+if __name__ == "__main__":
+    # 1. Initialize environment
+    env = AgentsExecutionEnvironment.get_execution_environment()
+    
+    # 2. Prepare test data
+    input_data = [
+        {"key": "0001", "value": "Calculate the sum of 1 and 2."},
+        {"key": "0002", "value": "Tell me a joke about cats."}
+    ]
+    
+    # 3. Create agent instance
+    agent = MyAgent()
+    
+    # 4. Build pipeline
+    output_data = env.from_list(input_data) \
+                     .apply(agent) \
+                     .to_list()
+    
+    # 5. Execute and show results
+    env.execute()
+    
+    print("\nExecution Results:")
+    for record in output_data:
+        for key, value in record.items():
+            print(f"{key}: {value}")
+
+```
+
+#### Input Data Format
+
+The input data should be a list of dictionaries `List[Dict[str, Any]]` with 
the following structure:
+
+```python
+[
+    {
+       # Optional field: Input key
+        "key": "key_1",
+        
+        # Required field: Input content
+        # This becomes the `input` field in InputEvent
+        "value": "Calculate the sum of 1 and 2.",
+    },
+    ...
+]
+```
+
+#### Output Data Format
+
+The output data is a list of dictionaries `List[Dict[str, Any]]` where each 
dictionary contains a single key-value pair representing the processed result. 
The structure is generated from `OutputEvent` objects:
+
+```python
+[
+    {key_1: output_1},  # From first OutputEvent; key is randomly generated if 
it is not provided in input
+    {key_2: output_2},  # From second OutputEvent; key is randomly generated 
if it is not provided in input

Review Comment:
   What happens if the key of input is not specified - this should be explained 
in the input section



##########
docs/content/docs/operations/deployment.md:
##########
@@ -22,20 +22,142 @@ specific language governing permissions and limitations
 under the License.
 -->
 
+## Overview
+
+We provide a total of three ways to run the job: Local Run with Test Data, 
Local Run with Flink MiniCluster, and Run in Flink Cluster. The detailed 
differences are shown in the table below:
+
+| Deployment Mode                     | Language Support       | Typical Use 
Case                                           |
+|-------------------------------------|------------------------|------------------------------------------------------------|
+| Local Run with Test Data            | Only Python            | Validate the 
internal logic of the Agent.                  |
+| Local Run with Flink MiniCluster    | Python & Java          | Verify 
upstream/downstream connectivity and schema correctness. |
+| Run in Flink Cluster                | Python & Java          | Large-scale 
data and AI processing in production environments. |

Review Comment:
   1. Let's simplify this into two categories: run w/o flink & run in flink
   2. key difference: whether support java, datastream/table, from/to_list



##########
docs/content/docs/operations/deployment.md:
##########
@@ -22,20 +22,142 @@ specific language governing permissions and limitations
 under the License.
 -->
 
+## Overview
+
+We provide a total of three ways to run the job: Local Run with Test Data, 
Local Run with Flink MiniCluster, and Run in Flink Cluster. The detailed 
differences are shown in the table below:
+
+| Deployment Mode                     | Language Support       | Typical Use 
Case                                           |
+|-------------------------------------|------------------------|------------------------------------------------------------|
+| Local Run with Test Data            | Only Python            | Validate the 
internal logic of the Agent.                  |
+| Local Run with Flink MiniCluster    | Python & Java          | Verify 
upstream/downstream connectivity and schema correctness. |
+| Run in Flink Cluster                | Python & Java          | Large-scale 
data and AI processing in production environments. |
+
 ## Local Run with Test Data
 
-{{< hint warning >}}
-**TODO**: How to run with test data with LocalExecutorEnvironment.
-{{< /hint >}}
+After completing the [installation of flink-agents]({{< ref 
"docs/get-started/installation" >}}) and building your [ReAct Agent]({{< ref 
"docs/development/react_agent" >}}) or [workflow agent]({{< ref 
"docs/development/workflow_agent" >}}), you can test and execute your agent 
locally using a simple Python script. This allows you to validate logic without 
requiring a Flink cluster.
+
+### Key Features
+
+- **No Flink Required**: Local execution is ideal for development and testing.
+- **Test Data Simulation**: Easily inject mock inputs for validation.
+- **IDE Compatibility**: Run directly in your preferred development 
environment.
+
+### Example for Local Run with Test Data
+
+```python
+from flink_agents.api.execution_environment import AgentsExecutionEnvironment
+from my_module.agents import MyAgent  # Replace with your actual agent path
+
+if __name__ == "__main__":
+    # 1. Initialize environment
+    env = AgentsExecutionEnvironment.get_execution_environment()
+    
+    # 2. Prepare test data
+    input_data = [
+        {"key": "0001", "value": "Calculate the sum of 1 and 2."},
+        {"key": "0002", "value": "Tell me a joke about cats."}
+    ]
+    
+    # 3. Create agent instance
+    agent = MyAgent()
+    
+    # 4. Build pipeline
+    output_data = env.from_list(input_data) \
+                     .apply(agent) \
+                     .to_list()
+    
+    # 5. Execute and show results
+    env.execute()
+    
+    print("\nExecution Results:")
+    for record in output_data:
+        for key, value in record.items():
+            print(f"{key}: {value}")
+
+```
+
+#### Input Data Format
+
+The input data should be a list of dictionaries `List[Dict[str, Any]]` with 
the following structure:
+
+```python
+[
+    {
+       # Optional field: Input key
+        "key": "key_1",
+        
+        # Required field: Input content
+        # This becomes the `input` field in InputEvent
+        "value": "Calculate the sum of 1 and 2.",
+    },
+    ...
+]
+```
+
+#### Output Data Format
+
+The output data is a list of dictionaries `List[Dict[str, Any]]` where each 
dictionary contains a single key-value pair representing the processed result. 
The structure is generated from `OutputEvent` objects:
+
+```python
+[
+    {key_1: output_1},  # From first OutputEvent; key is randomly generated if 
it is not provided in input
+    {key_2: output_2},  # From second OutputEvent; key is randomly generated 
if it is not provided in input
+    ...
+]
+```
 
 ## Local Run with Flink MiniCluster
 
-{{< hint warning >}}
-**TODO**: How to run with Flink MiniCluster locally.
-{{< /hint >}}
+After completing the [installation of flink-agents]({{< ref 
"docs/get-started/installation" >}}) and [building your agent]({{< ref 
"docs/development/workflow_agent" >}}), you can test and execute your agent 
locally using a **Flink MiniCluster**. This allows you to have a lightweight 
Flink streaming environment without deploying to a full cluster.
+
+To run your job locally with the MiniCluster, use the following command:
+
+```bash
+python /path/to/flink_agents_job.py
+```
+
+For more details about how to integrate agents with Flink's `DataStream` or 
`Table`, please refer to the [Integrate with Flink]({{< ref 
"docs/development/integrate_with_flink" >}}) documentation.
 
 ## Run in Flink Cluster
 
-{{< hint warning >}}
-**TODO**: How to run in Flink Cluster.
-{{< /hint >}}
\ No newline at end of file
+### Prerequisites
+
+- **Operating System**: Unix-like environment (Linux, macOS, Cygwin, or WSL)  
+- **Python**: Version 3.10 or 3.11  
+- **Flink**: A running Flink cluster with version 1.20.3 and the Flink Agents 
dependency installed
+
+### Prepare Flink Agents
+
+We recommand creating a Python virtual environment to install the Flink Agents 
Python library.
+
+Follow the [instructions]({{< ref "docs/get-started/installation" >}}) to 
install the Flink Agents Python and Java libraries.
+
+### Submit to Flink Cluster
+
+Submitting Flink Agent jobs to the Flink Cluster is the same as submitting 
PyFlink jobs. For more details on all available options, please refer to the 
[Flink CLI 
documentation](https://nightlies.apache.org/flink/flink-docs-release-1.20/docs/deployment/cli/#submitting-pyflink-jobs).
+
+```bash
+# Run Flink Python Job
+# ------------------------------------------------------------------------
+# 1. Path Note:
+#    Replace "./flink-1.20.3" with the actual Flink installation directory.
+#
+# 2. Python Entry File:
+#    The "--python" parameter specifies the Python script to be executed.
+#    Replace "/path/to/flink_agents_job.py" with the full path to your job 
file.
+#
+# 3. JobManager Address:
+#    Replace "<jobmanagerHost>" with the hostname or IP address of the Flink 
JobManager.
+#    The default REST port is 8081.
+#
+# 4. Example:
+#    ./flink-1.20.3/bin/flink run \
+#        --jobmanager localhost:8081 \
+#        --python /home/user/flink_jobs/flink_agents_job.py
+# ------------------------------------------------------------------------
+./flink-1.20.3/bin/flink run \
+      --jobmanager <jobmanagerHost>:8081 \
+      --python /path/to/flink_agents_job.py

Review Comment:
   <FLINK_HOME>/bin/flink run \
         --jobmanager <FLINK_CLUSTER_ADDR> \
         --python <PATH_TO_YOUR_FLINK_AGENTS_JOB>



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] [docs] Update the deployment introduction for Flink agents [flink-agents]

Reply via email to