This is an automated email from the ASF dual-hosted git repository.
diwu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push:
new 21fe36c8385 [doc](ecosystem) add flink connector 307 faq (#2438)
21fe36c8385 is described below
commit 21fe36c8385511b2ae31f4232537f90e62a45f81
Author: wudi <[email protected]>
AuthorDate: Thu Jun 5 11:35:19 2025 +0800
[doc](ecosystem) add flink connector 307 faq (#2438)
## Versions
- [x] dev
- [x] 3.0
- [x] 2.1
- [ ] 2.0
## Languages
- [x] Chinese
- [x] English
## Docs Checklist
- [ ] Checked by AI
- [ ] Test Cases Built
---
docs/ecosystem/flink-doris-connector.md | 6 +++++-
.../current/ecosystem/flink-doris-connector.md | 8 ++++++--
.../version-2.1/ecosystem/flink-doris-connector.md | 10 +++++++---
.../version-3.0/ecosystem/flink-doris-connector.md | 8 ++++++--
versioned_docs/version-2.1/ecosystem/flink-doris-connector.md | 6 +++++-
versioned_docs/version-3.0/ecosystem/flink-doris-connector.md | 6 +++++-
6 files changed, 34 insertions(+), 10 deletions(-)
diff --git a/docs/ecosystem/flink-doris-connector.md
b/docs/ecosystem/flink-doris-connector.md
index 4617ddd231e..08590d6799c 100644
--- a/docs/ecosystem/flink-doris-connector.md
+++ b/docs/ecosystem/flink-doris-connector.md
@@ -1138,4 +1138,8 @@ In the whole database synchronization tool provided by
the Connector, no additio
6. **How to configure when the network between Flink machines and BE machines
is not connected?**
- When Flink initiates writing to Doris, Doris will redirect the write
operation to BE. At this time, the returned address is the internal network IP
of BE, which is the IP seen through the `show backends` command. If Flink and
Doris have no network connectivity at this time, an error will be reported. In
this case, you can configure the external network IP of BE in `benodes`.
\ No newline at end of file
+ When Flink initiates writing to Doris, Doris will redirect the write
operation to BE. At this time, the returned address is the internal network IP
of BE, which is the IP seen through the `show backends` command. If Flink and
Doris have no network connectivity at this time, an error will be reported. In
this case, you can configure the external network IP of BE in `benodes`.
+
+7. **stream load error: HTTP/1.1 307 Temporary Redirect**
+
+ Flink will first request FE, and after receiving 307, it will request BE
after redirection. When FE is in FullGC/high pressure/network delay, HttpClient
will send data without waiting for a response within a certain period of time
(3 seconds) by default. Since the request body is InputStream by default, when
a 307 response is received, the data cannot be replayed and an error will be
reported directly. There are three ways to solve this problem: 1. Upgrade to
Connector25.1.0 or above [...]
\ No newline at end of file
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/ecosystem/flink-doris-connector.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/ecosystem/flink-doris-connector.md
index ef247a42fda..c551f33cb70 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/ecosystem/flink-doris-connector.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/ecosystem/flink-doris-connector.md
@@ -1132,7 +1132,7 @@ from KAFKA_SOURCE;
4. **tablet writer write failed, tablet_id=190958, txn_id=3505530, err=-235**
-通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和
sink.batch.interval 参数来降低 Streamload 的频率。在 Connector1.1.0 之后,默认写入时机是由
Checkpoint 控制,可以通过增加 Checkpoint 间隔来降低写入频率。频率。
+ 通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和
sink.batch.interval 参数来降低 Streamload 的频率。在 Connector1.1.0 之后,默认写入时机是由
Checkpoint 控制,可以通过增加 Checkpoint 间隔来降低写入频率。频率。
5. **Flink 导入有脏数据,如何跳过?**
@@ -1140,4 +1140,8 @@ from KAFKA_SOURCE;
6. **Flink 机器与 BE 机器的网络不通,如何配置?**
-Flink 向 Doris 发起写入时,Doris 会重定向到 BE 进行写入,此时返回的地址是 BE 的内网 IP,即通过即通过`show
backends`看到的 IP,此时 Flink 与 Doris 网络不通的,会报错。这时可以在 benodes 中配置 BE 的外网 IP 即可。
+ Flink 向 Doris 发起写入时,Doris 会重定向到 BE 进行写入,此时返回的地址是 BE 的内网 IP,即通过即通过`show
backends`看到的 IP,此时 Flink 与 Doris 网络不通的,会报错。这时可以在 benodes 中配置 BE 的外网 IP 即可。
+
+7. **stream load error: HTTP/1.1 307 Temporary Redirect**
+
+ Flink 会先向 FE 请求,收到 307 后会向重定向后的 BE 请求。当 FE 在 FullGC/压力大/网络延迟的时候,HttpClient
默认会在一定时间 (3 秒) 没有等到响应会发送数据,由于默认情况下请求体是 InputStream,当收到 307
响应时,数据无法重放,会直接报错。有三种方式可以解决:1.升级到 Connector25.1.0 以上,调长了默认时间;2.修改
auto-redirect=false,直接向 BE 发起请求(不适用部分云上场景);3.主键模型可以开启攒批模式。
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/ecosystem/flink-doris-connector.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/ecosystem/flink-doris-connector.md
index 508276e4323..51952a54959 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/ecosystem/flink-doris-connector.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/ecosystem/flink-doris-connector.md
@@ -1122,7 +1122,7 @@ from KAFKA_SOURCE;
2. **errCode = 2, detailMessage = transaction [19650] not found**
- 发生在 Commit 阶段,checkpoint 里面记录的事务 ID,在 FE 侧已经过期,此时再次 commit 就会出现上述错误。此时无法从
checkpoint 启动,后续可通过修改 fe.conf 的 `streaming_label_keep_max_second` 配置来延长过期时间,默认
12 小时。Doris2.0 版本后还会受到 fe.conf 中 `label_num_threshold` 配置的限制 (默认 2000)
,可以调大或者改为 -1(-1 表示只受时间限制)。
+ 发生在 Commit 阶段,checkpoint 里面记录的事务 ID,在 FE 侧已经过期,此时再次 commit 就会出现上述错误。此时无法从
checkpoint 启动,后续可通过修改 fe.conf 的 `streaming_label_keep_max_second` 配置来延长过期时间,默认
12 小时。Doris2.0 版本后还会受到 fe.conf 中 `label_num_threshold` 配置的限制 (默认 2000)
,可以调大或者改为 -1(-1 表示只受时间限制)。
3. **errCode = 2, detailMessage = current running txns on db 10006 is 100,
larger than limit 100**
@@ -1132,7 +1132,7 @@ from KAFKA_SOURCE;
4. **tablet writer write failed, tablet_id=190958, txn_id=3505530, err=-235**
-通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和
sink.batch.interval 参数来降低 Streamload 的频率。在 Connector1.1.0 之后,默认写入时机是由
Checkpoint 控制,可以通过增加 Checkpoint 间隔来降低写入频率。频率。
+ 通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和
sink.batch.interval 参数来降低 Streamload 的频率。在 Connector1.1.0 之后,默认写入时机是由
Checkpoint 控制,可以通过增加 Checkpoint 间隔来降低写入频率。频率。
5. **Flink 导入有脏数据,如何跳过?**
@@ -1140,4 +1140,8 @@ from KAFKA_SOURCE;
6. **Flink 机器与 BE 机器的网络不通,如何配置?**
-Flink 向 Doris 发起写入时,Doris 会重定向到 BE 进行写入,此时返回的地址是 BE 的内网 IP,即通过即通过`show
backends`看到的 IP,此时 Flink 与 Doris 网络不通的,会报错。这时可以在 benodes 中配置 BE 的外网 IP 即可。
+ Flink 向 Doris 发起写入时,Doris 会重定向到 BE 进行写入,此时返回的地址是 BE 的内网 IP,即通过即通过`show
backends`看到的 IP,此时 Flink 与 Doris 网络不通的,会报错。这时可以在 benodes 中配置 BE 的外网 IP 即可。
+
+7. **stream load error: HTTP/1.1 307 Temporary Redirect**
+
+ Flink 会先向 FE 请求,收到 307 后会向重定向后的 BE 请求。当 FE 在 FullGC/压力大/网络延迟的时候,HttpClient
默认会在一定时间 (3 秒) 没有等到响应会发送数据,由于默认情况下请求体是 InputStream,当收到 307
响应时,数据无法重放,会直接报错。有三种方式可以解决:1.升级到 Connector25.1.0 以上,调长了默认时间;2.修改
auto-redirect=false,直接向 BE 发起请求(不适用部分云上场景);3.主键模型可以开启攒批模式。
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/ecosystem/flink-doris-connector.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/ecosystem/flink-doris-connector.md
index 3c3dbc2cb0c..b52d857a37c 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/ecosystem/flink-doris-connector.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/ecosystem/flink-doris-connector.md
@@ -1132,7 +1132,7 @@ from KAFKA_SOURCE;
4. **tablet writer write failed, tablet_id=190958, txn_id=3505530, err=-235**
-通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和
sink.batch.interval 参数来降低 Streamload 的频率。在 Connector1.1.0 之后,默认写入时机是由
Checkpoint 控制,可以通过增加 Checkpoint 间隔来降低写入频率。频率。
+ 通常发生在 Connector1.1.0 之前,是由于写入频率过快,导致版本过多。可以通过设置 sink.batch.size 和
sink.batch.interval 参数来降低 Streamload 的频率。在 Connector1.1.0 之后,默认写入时机是由
Checkpoint 控制,可以通过增加 Checkpoint 间隔来降低写入频率。频率。
5. **Flink 导入有脏数据,如何跳过?**
@@ -1140,4 +1140,8 @@ from KAFKA_SOURCE;
6. **Flink 机器与 BE 机器的网络不通,如何配置?**
-Flink 向 Doris 发起写入时,Doris 会重定向到 BE 进行写入,此时返回的地址是 BE 的内网 IP,即通过即通过`show
backends`看到的 IP,此时 Flink 与 Doris 网络不通的,会报错。这时可以在 benodes 中配置 BE 的外网 IP 即可。
+ Flink 向 Doris 发起写入时,Doris 会重定向到 BE 进行写入,此时返回的地址是 BE 的内网 IP,即通过即通过`show
backends`看到的 IP,此时 Flink 与 Doris 网络不通的,会报错。这时可以在 benodes 中配置 BE 的外网 IP 即可。
+
+7. **stream load error: HTTP/1.1 307 Temporary Redirect**
+
+ Flink 会先向 FE 请求,收到 307 后会向重定向后的 BE 请求。当 FE 在 FullGC/压力大/网络延迟的时候,HttpClient
默认会在一定时间 (3 秒) 没有等到响应会发送数据,由于默认情况下请求体是 InputStream,当收到 307
响应时,数据无法重放,会直接报错。有三种方式可以解决:1.升级到 Connector25.1.0 以上,调长了默认时间;2.修改
auto-redirect=false,直接向 BE 发起请求(不适用部分云上场景);3.主键模型可以开启攒批模式。
diff --git a/versioned_docs/version-2.1/ecosystem/flink-doris-connector.md
b/versioned_docs/version-2.1/ecosystem/flink-doris-connector.md
index 9f7f7820724..d433012906b 100644
--- a/versioned_docs/version-2.1/ecosystem/flink-doris-connector.md
+++ b/versioned_docs/version-2.1/ecosystem/flink-doris-connector.md
@@ -1138,4 +1138,8 @@ In the whole database synchronization tool provided by
the Connector, no additio
6. **How to configure when the network between Flink machines and BE machines
is not connected?**
- When Flink initiates writing to Doris, Doris will redirect the write
operation to BE. At this time, the returned address is the internal network IP
of BE, which is the IP seen through the `show backends` command. If Flink and
Doris have no network connectivity at this time, an error will be reported. In
this case, you can configure the external network IP of BE in `benodes`.
\ No newline at end of file
+ When Flink initiates writing to Doris, Doris will redirect the write
operation to BE. At this time, the returned address is the internal network IP
of BE, which is the IP seen through the `show backends` command. If Flink and
Doris have no network connectivity at this time, an error will be reported. In
this case, you can configure the external network IP of BE in `benodes`.
+
+7. **stream load error: HTTP/1.1 307 Temporary Redirect**
+
+ Flink will first request FE, and after receiving 307, it will request BE
after redirection. When FE is in FullGC/high pressure/network delay, HttpClient
will send data without waiting for a response within a certain period of time
(3 seconds) by default. Since the request body is InputStream by default, when
a 307 response is received, the data cannot be replayed and an error will be
reported directly. There are three ways to solve this problem: 1. Upgrade to
Connector25.1.0 or above [...]
\ No newline at end of file
diff --git a/versioned_docs/version-3.0/ecosystem/flink-doris-connector.md
b/versioned_docs/version-3.0/ecosystem/flink-doris-connector.md
index 9f7f7820724..d433012906b 100644
--- a/versioned_docs/version-3.0/ecosystem/flink-doris-connector.md
+++ b/versioned_docs/version-3.0/ecosystem/flink-doris-connector.md
@@ -1138,4 +1138,8 @@ In the whole database synchronization tool provided by
the Connector, no additio
6. **How to configure when the network between Flink machines and BE machines
is not connected?**
- When Flink initiates writing to Doris, Doris will redirect the write
operation to BE. At this time, the returned address is the internal network IP
of BE, which is the IP seen through the `show backends` command. If Flink and
Doris have no network connectivity at this time, an error will be reported. In
this case, you can configure the external network IP of BE in `benodes`.
\ No newline at end of file
+ When Flink initiates writing to Doris, Doris will redirect the write
operation to BE. At this time, the returned address is the internal network IP
of BE, which is the IP seen through the `show backends` command. If Flink and
Doris have no network connectivity at this time, an error will be reported. In
this case, you can configure the external network IP of BE in `benodes`.
+
+7. **stream load error: HTTP/1.1 307 Temporary Redirect**
+
+ Flink will first request FE, and after receiving 307, it will request BE
after redirection. When FE is in FullGC/high pressure/network delay, HttpClient
will send data without waiting for a response within a certain period of time
(3 seconds) by default. Since the request body is InputStream by default, when
a 307 response is received, the data cannot be replayed and an error will be
reported directly. There are three ways to solve this problem: 1. Upgrade to
Connector25.1.0 or above [...]
\ No newline at end of file
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]