This is an automated email from the ASF dual-hosted git repository.
liaoxin01 pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push:
new 069d354b7ca [doc](load) add field examples and troubleshooting query
for routine_load_job (#3635)
069d354b7ca is described below
commit 069d354b7ca6d56177f4daef6d83c1d24beef889
Author: hui lai <[email protected]>
AuthorDate: Thu May 14 15:02:23 2026 +0800
[doc](load) add field examples and troubleshooting query for
routine_load_job (#3635)
---
.../information_schema/routine_load_job.md | 77 +++++++++++++-------
.../information_schema/routine_load_job.md | 77 +++++++++++++-------
.../information_schema/routine_load_job.md | 77 +++++++++++++-------
.../information_schema/routine_load_job.md | 77 +++++++++++++-------
.../information_schema/routine_load_job.md | 81 +++++++++++++++-------
.../information_schema/routine_load_job.md | 77 +++++++++++++-------
.../information_schema/routine_load_job.md | 77 +++++++++++++-------
.../information_schema/routine_load_job.md | 81 +++++++++++++++-------
.../information_schema/routine_load_job.md | 81 +++++++++++++++-------
.../information_schema/routine_load_job.md | 81 +++++++++++++++-------
.../information_schema/routine_load_job.md | 77 +++++++++++++-------
11 files changed, 591 insertions(+), 272 deletions(-)
diff --git
a/docs-next/admin-manual/system-tables/information_schema/routine_load_job.md
b/docs-next/admin-manual/system-tables/information_schema/routine_load_job.md
index f35b0ee1955..8acc853f73b 100644
---
a/docs-next/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/docs-next/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -10,31 +10,60 @@
Used to view information about routine load jobs.
-## Database
+## Database and Table
-`information_schema`
+`information_schema.routine_load_jobs`
## Table Information
-| Column Name | Type | Description
|
-| :-------------------- | :-------- |
:----------------------------------------- |
-| JOB_ID | text | Job ID
|
-| JOB_NAME | text | Job name
|
-| CREATE_TIME | text | Job creation time
|
-| PAUSE_TIME | text | Job pause time
|
-| END_TIME | text | Job end time
|
-| DB_NAME | text | Database name
|
-| TABLE_NAME | text | Table name
|
-| STATE | text | Job status
|
-| CURRENT_TASK_NUM | text | Current number of subtasks
|
-| JOB_PROPERTIES | text | Job property configurations
|
-| DATA_SOURCE_PROPERTIES| text | Data source property configurations
|
-| CUSTOM_PROPERTIES | text | Custom property configurations
|
-| STATISTIC | text | Job statistics information
|
-| PROGRESS | text | Job progress information
|
-| LAG | text | Job delay information
|
-| REASON_OF_STATE_CHANGED| text | Reason for job status change
|
-| ERROR_LOG_URLS | text | Error log URLs
|
-| USER_NAME | text | Username
|
-| CURRENT_ABORT_TASK_NUM| int | Current number of failed tasks
|
-| IS_ABNORMAL_PAUSE | boolean | Whether paused by system (non-user pause)
|
\ No newline at end of file
+| Column Name | Type | Description
| Example |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :------ |
+| JOB_ID | text | Job ID generated by Doris.
| `12025` |
+| JOB_NAME | text | Routine Load job name.
| `example_routine_load` |
+| CREATE_TIME | text | Job creation time.
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | Most recent job pause time. It is
`NULL` if the job has not been paused.
| `NULL` |
+| END_TIME | text | Job end time. It is `NULL` if the job
has not ended.
| `NULL` |
+| DB_NAME | text | Database name of the job.
| `default_cluster:testdb` |
+| TABLE_NAME | text | Target table name of the job. For
multi-table import jobs, this value is `multi-table`.
| `test_routineload_tbl` |
+| STATE | text | Job running status, including
`NEED_SCHEDULE`, `RUNNING`, `PAUSED`, `STOPPED`, and `CANCELLED`.
| `RUNNING` |
+| CURRENT_TASK_NUM | text | Number of subtasks currently being
scheduled or executed.
| `1` |
+| JOB_PROPERTIES | text | Job property configurations, including
batch size, concurrency, import format, column mapping, and error tolerance.
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | Data source property configurations.
For Kafka, this includes topic, broker list, and current Kafka partitions.
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | Custom properties configured when
creating the job. For Kafka, this usually includes offsets, group id, and Kafka
client parameters passed with `property.` prefixes. |
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | Job runtime statistics. Common fields
include `receivedBytes`, `loadedRows`, `errorRows`, `committedTaskNum`,
`abortedTaskNum`, `loadRowsRate`, and `taskExecuteTimeMs`. |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | Job running progress. For Kafka, it
shows the consumed offset of each partition.
| `{"0":"2"}` |
+| LAG | text | Job lag information. For Kafka, it
shows the consumption lag of each partition.
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | Reason for the job state change. It is
usually empty for normally running jobs, and records the specific reason when
the job is paused or cancelled abnormally. | `The number of failed task
exceeded max_error_number` |
+| ERROR_LOG_URLS | text | Error log URLs for viewing filtered
data that failed quality checks. It is empty if there is no error log.
|
`http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | User who created or operated the job.
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | Current number of failed subtasks.
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | Whether the job was paused abnormally
by the system instead of manually by a user. `true` indicates an abnormal
system pause, and `false` indicates no abnormal pause. | `false` |
+
+## Query Abnormal Jobs
+
+If a job is abnormally paused, has failed tasks, or is in the `RUNNING` state
but has no running subtasks while Kafka still has consumption lag, it should be
investigated. You can use the following SQL to query these jobs:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+After finding an abnormal job, switch to the corresponding database and use
`SHOW ROUTINE LOAD` to view the job details:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+`LAG` is the Kafka consumption lag information for each partition. `LAG REGEXP
':[[:space:]]*[1-9][0-9]*'` matches jobs where at least one partition has a lag
greater than 0.
diff --git
a/docs/admin-manual/system-tables/information_schema/routine_load_job.md
b/docs/admin-manual/system-tables/information_schema/routine_load_job.md
index f35b0ee1955..8acc853f73b 100644
--- a/docs/admin-manual/system-tables/information_schema/routine_load_job.md
+++ b/docs/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -10,31 +10,60 @@
Used to view information about routine load jobs.
-## Database
+## Database and Table
-`information_schema`
+`information_schema.routine_load_jobs`
## Table Information
-| Column Name | Type | Description
|
-| :-------------------- | :-------- |
:----------------------------------------- |
-| JOB_ID | text | Job ID
|
-| JOB_NAME | text | Job name
|
-| CREATE_TIME | text | Job creation time
|
-| PAUSE_TIME | text | Job pause time
|
-| END_TIME | text | Job end time
|
-| DB_NAME | text | Database name
|
-| TABLE_NAME | text | Table name
|
-| STATE | text | Job status
|
-| CURRENT_TASK_NUM | text | Current number of subtasks
|
-| JOB_PROPERTIES | text | Job property configurations
|
-| DATA_SOURCE_PROPERTIES| text | Data source property configurations
|
-| CUSTOM_PROPERTIES | text | Custom property configurations
|
-| STATISTIC | text | Job statistics information
|
-| PROGRESS | text | Job progress information
|
-| LAG | text | Job delay information
|
-| REASON_OF_STATE_CHANGED| text | Reason for job status change
|
-| ERROR_LOG_URLS | text | Error log URLs
|
-| USER_NAME | text | Username
|
-| CURRENT_ABORT_TASK_NUM| int | Current number of failed tasks
|
-| IS_ABNORMAL_PAUSE | boolean | Whether paused by system (non-user pause)
|
\ No newline at end of file
+| Column Name | Type | Description
| Example |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :------ |
+| JOB_ID | text | Job ID generated by Doris.
| `12025` |
+| JOB_NAME | text | Routine Load job name.
| `example_routine_load` |
+| CREATE_TIME | text | Job creation time.
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | Most recent job pause time. It is
`NULL` if the job has not been paused.
| `NULL` |
+| END_TIME | text | Job end time. It is `NULL` if the job
has not ended.
| `NULL` |
+| DB_NAME | text | Database name of the job.
| `default_cluster:testdb` |
+| TABLE_NAME | text | Target table name of the job. For
multi-table import jobs, this value is `multi-table`.
| `test_routineload_tbl` |
+| STATE | text | Job running status, including
`NEED_SCHEDULE`, `RUNNING`, `PAUSED`, `STOPPED`, and `CANCELLED`.
| `RUNNING` |
+| CURRENT_TASK_NUM | text | Number of subtasks currently being
scheduled or executed.
| `1` |
+| JOB_PROPERTIES | text | Job property configurations, including
batch size, concurrency, import format, column mapping, and error tolerance.
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | Data source property configurations.
For Kafka, this includes topic, broker list, and current Kafka partitions.
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | Custom properties configured when
creating the job. For Kafka, this usually includes offsets, group id, and Kafka
client parameters passed with `property.` prefixes. |
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | Job runtime statistics. Common fields
include `receivedBytes`, `loadedRows`, `errorRows`, `committedTaskNum`,
`abortedTaskNum`, `loadRowsRate`, and `taskExecuteTimeMs`. |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | Job running progress. For Kafka, it
shows the consumed offset of each partition.
| `{"0":"2"}` |
+| LAG | text | Job lag information. For Kafka, it
shows the consumption lag of each partition.
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | Reason for the job state change. It is
usually empty for normally running jobs, and records the specific reason when
the job is paused or cancelled abnormally. | `The number of failed task
exceeded max_error_number` |
+| ERROR_LOG_URLS | text | Error log URLs for viewing filtered
data that failed quality checks. It is empty if there is no error log.
|
`http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | User who created or operated the job.
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | Current number of failed subtasks.
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | Whether the job was paused abnormally
by the system instead of manually by a user. `true` indicates an abnormal
system pause, and `false` indicates no abnormal pause. | `false` |
+
+## Query Abnormal Jobs
+
+If a job is abnormally paused, has failed tasks, or is in the `RUNNING` state
but has no running subtasks while Kafka still has consumption lag, it should be
investigated. You can use the following SQL to query these jobs:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+After finding an abnormal job, switch to the corresponding database and use
`SHOW ROUTINE LOAD` to view the job details:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+`LAG` is the Kafka consumption lag information for each partition. `LAG REGEXP
':[[:space:]]*[1-9][0-9]*'` matches jobs where at least one partition has a lag
greater than 0.
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs-next/current/admin-manual/system-tables/information_schema/routine_load_job.md
b/i18n/zh-CN/docusaurus-plugin-content-docs-next/current/admin-manual/system-tables/information_schema/routine_load_job.md
index 019b45f01cf..3b85ae2a2c0 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs-next/current/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs-next/current/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -10,31 +10,60 @@
用于查看routine load导入作业的信息
-## 所属数据库
+## 所属数据库和表
-`information_schema`
+`information_schema.routine_load_jobs`
## 表信息
-| 列名 | 类型 | 说明 |
-| :--------------------- | :-------- | :--------------------------------------
|
-| JOB_ID | text | 作业 ID |
-| JOB_NAME | text | 作业名称 |
-| CREATE_TIME | text | 作业创建时间 |
-| PAUSE_TIME | text | 作业暂停时间 |
-| END_TIME | text | 作业结束时间 |
-| DB_NAME | text | 数据库名称 |
-| TABLE_NAME | text | 表名称 |
-| STATE | text | 作业状态 |
-| CURRENT_TASK_NUM | text | 当前子任务数量 |
-| JOB_PROPERTIES | text | 作业属性配置 |
-| DATA_SOURCE_PROPERTIES | text | 数据源属性配置 |
-| CUSTOM_PROPERTIES | text | 自定义属性配置 |
-| STATISTIC | text | 作业统计信息 |
-| PROGRESS | text | 作业进度信息 |
-| LAG | text | 作业延迟信息 |
-| REASON_OF_STATE_CHANGED| text | 作业状态变更原因 |
-| ERROR_LOG_URLS | text | 错误日志 URL |
-| USER_NAME | text | 用户名 |
-| CURRENT_ABORT_TASK_NUM | int | 当前失败的任务数量 |
-| IS_ABNORMAL_PAUSE | boolean | 是否非用户暂停 |
+| 列名 | 类型 | 说明
| 样例 |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :--- |
+| JOB_ID | text | 作业 ID,由 Doris 自动生成。
| `12025` |
+| JOB_NAME | text | Routine Load 作业名称。
| `example_routine_load` |
+| CREATE_TIME | text | 作业创建时间。
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | 最近一次作业暂停时间。作业未暂停时为 `NULL`。
| `NULL` |
+| END_TIME | text | 作业结束时间。作业未结束时为 `NULL`。
| `NULL` |
+| DB_NAME | text | 作业所属数据库名称。
| `default_cluster:testdb` |
+| TABLE_NAME | text | 作业导入的目标表名称。多表导入时显示为 `multi-table`。
| `test_routineload_tbl` |
+| STATE | text | 作业运行状态,包括
`NEED_SCHEDULE`、`RUNNING`、`PAUSED`、`STOPPED`、`CANCELLED`。
| `RUNNING` |
+| CURRENT_TASK_NUM | text | 当前正在调度或执行的子任务数量。
| `1` |
+| JOB_PROPERTIES | text | 作业属性配置,包含批次大小、并发数、导入格式、列映射、错误容忍度等配置。
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | 数据源属性配置。Kafka 数据源中包含 topic、broker 列表和当前
Kafka 分区等信息。
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | 创建作业时设置的自定义属性。Kafka 数据源中通常包含消费
offset、group id 以及以 `property.` 前缀传入的 Kafka 客户端参数。
|
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | 作业运行统计信息。常用字段包括
`receivedBytes`、`loadedRows`、`errorRows`、`committedTaskNum`、`abortedTaskNum`、`loadRowsRate`、`taskExecuteTimeMs`
等。 |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | 作业运行进度。Kafka 数据源中显示每个分区当前已消费的 offset。
| `{"0":"2"}` |
+| LAG | text | 作业延迟信息。Kafka 数据源中显示每个分区的消费积压量。
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | 作业状态变更原因。正常运行时通常为空;异常暂停或取消时记录具体原因。
|
`The number of failed task exceeded max_error_number` |
+| ERROR_LOG_URLS | text | 错误日志 URL,用于查看被过滤的质量不合格数据。无错误日志时为空。
| `http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | 创建或操作该作业的用户。
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | 当前失败的子任务数量。
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | 是否为非用户手动触发的异常暂停。`true`
表示系统因异常原因暂停作业,`false` 表示没有异常暂停。
| `false` |
+
+## 查询异常作业
+
+如果作业异常暂停、存在失败任务,或处于 `RUNNING` 状态但没有正在执行的子任务,同时 Kafka
仍有消费积压,都可以视为需要进一步排查的异常作业。可以通过以下 SQL 查询这些作业:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+查询到异常作业后,可以切换到对应数据库,通过 `SHOW ROUTINE LOAD` 查看作业详情:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+其中 `LAG` 是 Kafka 各分区的消费积压信息。`LAG REGEXP ':[[:space:]]*[1-9][0-9]*'`
用于匹配至少有一个分区存在大于 0 的积压。
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
index 019b45f01cf..3b85ae2a2c0 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -10,31 +10,60 @@
用于查看routine load导入作业的信息
-## 所属数据库
+## 所属数据库和表
-`information_schema`
+`information_schema.routine_load_jobs`
## 表信息
-| 列名 | 类型 | 说明 |
-| :--------------------- | :-------- | :--------------------------------------
|
-| JOB_ID | text | 作业 ID |
-| JOB_NAME | text | 作业名称 |
-| CREATE_TIME | text | 作业创建时间 |
-| PAUSE_TIME | text | 作业暂停时间 |
-| END_TIME | text | 作业结束时间 |
-| DB_NAME | text | 数据库名称 |
-| TABLE_NAME | text | 表名称 |
-| STATE | text | 作业状态 |
-| CURRENT_TASK_NUM | text | 当前子任务数量 |
-| JOB_PROPERTIES | text | 作业属性配置 |
-| DATA_SOURCE_PROPERTIES | text | 数据源属性配置 |
-| CUSTOM_PROPERTIES | text | 自定义属性配置 |
-| STATISTIC | text | 作业统计信息 |
-| PROGRESS | text | 作业进度信息 |
-| LAG | text | 作业延迟信息 |
-| REASON_OF_STATE_CHANGED| text | 作业状态变更原因 |
-| ERROR_LOG_URLS | text | 错误日志 URL |
-| USER_NAME | text | 用户名 |
-| CURRENT_ABORT_TASK_NUM | int | 当前失败的任务数量 |
-| IS_ABNORMAL_PAUSE | boolean | 是否非用户暂停 |
+| 列名 | 类型 | 说明
| 样例 |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :--- |
+| JOB_ID | text | 作业 ID,由 Doris 自动生成。
| `12025` |
+| JOB_NAME | text | Routine Load 作业名称。
| `example_routine_load` |
+| CREATE_TIME | text | 作业创建时间。
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | 最近一次作业暂停时间。作业未暂停时为 `NULL`。
| `NULL` |
+| END_TIME | text | 作业结束时间。作业未结束时为 `NULL`。
| `NULL` |
+| DB_NAME | text | 作业所属数据库名称。
| `default_cluster:testdb` |
+| TABLE_NAME | text | 作业导入的目标表名称。多表导入时显示为 `multi-table`。
| `test_routineload_tbl` |
+| STATE | text | 作业运行状态,包括
`NEED_SCHEDULE`、`RUNNING`、`PAUSED`、`STOPPED`、`CANCELLED`。
| `RUNNING` |
+| CURRENT_TASK_NUM | text | 当前正在调度或执行的子任务数量。
| `1` |
+| JOB_PROPERTIES | text | 作业属性配置,包含批次大小、并发数、导入格式、列映射、错误容忍度等配置。
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | 数据源属性配置。Kafka 数据源中包含 topic、broker 列表和当前
Kafka 分区等信息。
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | 创建作业时设置的自定义属性。Kafka 数据源中通常包含消费
offset、group id 以及以 `property.` 前缀传入的 Kafka 客户端参数。
|
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | 作业运行统计信息。常用字段包括
`receivedBytes`、`loadedRows`、`errorRows`、`committedTaskNum`、`abortedTaskNum`、`loadRowsRate`、`taskExecuteTimeMs`
等。 |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | 作业运行进度。Kafka 数据源中显示每个分区当前已消费的 offset。
| `{"0":"2"}` |
+| LAG | text | 作业延迟信息。Kafka 数据源中显示每个分区的消费积压量。
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | 作业状态变更原因。正常运行时通常为空;异常暂停或取消时记录具体原因。
|
`The number of failed task exceeded max_error_number` |
+| ERROR_LOG_URLS | text | 错误日志 URL,用于查看被过滤的质量不合格数据。无错误日志时为空。
| `http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | 创建或操作该作业的用户。
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | 当前失败的子任务数量。
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | 是否为非用户手动触发的异常暂停。`true`
表示系统因异常原因暂停作业,`false` 表示没有异常暂停。
| `false` |
+
+## 查询异常作业
+
+如果作业异常暂停、存在失败任务,或处于 `RUNNING` 状态但没有正在执行的子任务,同时 Kafka
仍有消费积压,都可以视为需要进一步排查的异常作业。可以通过以下 SQL 查询这些作业:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+查询到异常作业后,可以切换到对应数据库,通过 `SHOW ROUTINE LOAD` 查看作业详情:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+其中 `LAG` 是 Kafka 各分区的消费积压信息。`LAG REGEXP ':[[:space:]]*[1-9][0-9]*'`
用于匹配至少有一个分区存在大于 0 的积压。
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
index 2ff10f68f49..3b85ae2a2c0 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -2,39 +2,68 @@
{
"title": "routine_load_job",
"language": "zh-CN",
- "description": "用于查看routine load导入作业的信息, 从3.0.5版本引入这个特性。"
+ "description": "用于查看routine load导入作业的信息"
}
---
## 概述
-用于查看routine load导入作业的信息, 从3.0.5版本引入这个特性。
+用于查看routine load导入作业的信息
-## 所属数据库
+## 所属数据库和表
-`information_schema`
+`information_schema.routine_load_jobs`
## 表信息
-| 列名 | 类型 | 说明 |
-| :--------------------- | :-------- | :--------------------------------------
|
-| JOB_ID | text | 作业 ID |
-| JOB_NAME | text | 作业名称 |
-| CREATE_TIME | text | 作业创建时间 |
-| PAUSE_TIME | text | 作业暂停时间 |
-| END_TIME | text | 作业结束时间 |
-| DB_NAME | text | 数据库名称 |
-| TABLE_NAME | text | 表名称 |
-| STATE | text | 作业状态 |
-| CURRENT_TASK_NUM | text | 当前子任务数量 |
-| JOB_PROPERTIES | text | 作业属性配置 |
-| DATA_SOURCE_PROPERTIES | text | 数据源属性配置 |
-| CUSTOM_PROPERTIES | text | 自定义属性配置 |
-| STATISTIC | text | 作业统计信息 |
-| PROGRESS | text | 作业进度信息 |
-| LAG | text | 作业延迟信息 |
-| REASON_OF_STATE_CHANGED| text | 作业状态变更原因 |
-| ERROR_LOG_URLS | text | 错误日志 URL |
-| USER_NAME | text | 用户名 |
-| CURRENT_ABORT_TASK_NUM | int | 当前失败的任务数量 |
-| IS_ABNORMAL_PAUSE | boolean | 是否非用户暂停 |
+| 列名 | 类型 | 说明
| 样例 |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :--- |
+| JOB_ID | text | 作业 ID,由 Doris 自动生成。
| `12025` |
+| JOB_NAME | text | Routine Load 作业名称。
| `example_routine_load` |
+| CREATE_TIME | text | 作业创建时间。
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | 最近一次作业暂停时间。作业未暂停时为 `NULL`。
| `NULL` |
+| END_TIME | text | 作业结束时间。作业未结束时为 `NULL`。
| `NULL` |
+| DB_NAME | text | 作业所属数据库名称。
| `default_cluster:testdb` |
+| TABLE_NAME | text | 作业导入的目标表名称。多表导入时显示为 `multi-table`。
| `test_routineload_tbl` |
+| STATE | text | 作业运行状态,包括
`NEED_SCHEDULE`、`RUNNING`、`PAUSED`、`STOPPED`、`CANCELLED`。
| `RUNNING` |
+| CURRENT_TASK_NUM | text | 当前正在调度或执行的子任务数量。
| `1` |
+| JOB_PROPERTIES | text | 作业属性配置,包含批次大小、并发数、导入格式、列映射、错误容忍度等配置。
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | 数据源属性配置。Kafka 数据源中包含 topic、broker 列表和当前
Kafka 分区等信息。
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | 创建作业时设置的自定义属性。Kafka 数据源中通常包含消费
offset、group id 以及以 `property.` 前缀传入的 Kafka 客户端参数。
|
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | 作业运行统计信息。常用字段包括
`receivedBytes`、`loadedRows`、`errorRows`、`committedTaskNum`、`abortedTaskNum`、`loadRowsRate`、`taskExecuteTimeMs`
等。 |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | 作业运行进度。Kafka 数据源中显示每个分区当前已消费的 offset。
| `{"0":"2"}` |
+| LAG | text | 作业延迟信息。Kafka 数据源中显示每个分区的消费积压量。
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | 作业状态变更原因。正常运行时通常为空;异常暂停或取消时记录具体原因。
|
`The number of failed task exceeded max_error_number` |
+| ERROR_LOG_URLS | text | 错误日志 URL,用于查看被过滤的质量不合格数据。无错误日志时为空。
| `http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | 创建或操作该作业的用户。
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | 当前失败的子任务数量。
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | 是否为非用户手动触发的异常暂停。`true`
表示系统因异常原因暂停作业,`false` 表示没有异常暂停。
| `false` |
+
+## 查询异常作业
+
+如果作业异常暂停、存在失败任务,或处于 `RUNNING` 状态但没有正在执行的子任务,同时 Kafka
仍有消费积压,都可以视为需要进一步排查的异常作业。可以通过以下 SQL 查询这些作业:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+查询到异常作业后,可以切换到对应数据库,通过 `SHOW ROUTINE LOAD` 查看作业详情:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+其中 `LAG` 是 Kafka 各分区的消费积压信息。`LAG REGEXP ':[[:space:]]*[1-9][0-9]*'`
用于匹配至少有一个分区存在大于 0 的积压。
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
index 019b45f01cf..3b85ae2a2c0 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -10,31 +10,60 @@
用于查看routine load导入作业的信息
-## 所属数据库
+## 所属数据库和表
-`information_schema`
+`information_schema.routine_load_jobs`
## 表信息
-| 列名 | 类型 | 说明 |
-| :--------------------- | :-------- | :--------------------------------------
|
-| JOB_ID | text | 作业 ID |
-| JOB_NAME | text | 作业名称 |
-| CREATE_TIME | text | 作业创建时间 |
-| PAUSE_TIME | text | 作业暂停时间 |
-| END_TIME | text | 作业结束时间 |
-| DB_NAME | text | 数据库名称 |
-| TABLE_NAME | text | 表名称 |
-| STATE | text | 作业状态 |
-| CURRENT_TASK_NUM | text | 当前子任务数量 |
-| JOB_PROPERTIES | text | 作业属性配置 |
-| DATA_SOURCE_PROPERTIES | text | 数据源属性配置 |
-| CUSTOM_PROPERTIES | text | 自定义属性配置 |
-| STATISTIC | text | 作业统计信息 |
-| PROGRESS | text | 作业进度信息 |
-| LAG | text | 作业延迟信息 |
-| REASON_OF_STATE_CHANGED| text | 作业状态变更原因 |
-| ERROR_LOG_URLS | text | 错误日志 URL |
-| USER_NAME | text | 用户名 |
-| CURRENT_ABORT_TASK_NUM | int | 当前失败的任务数量 |
-| IS_ABNORMAL_PAUSE | boolean | 是否非用户暂停 |
+| 列名 | 类型 | 说明
| 样例 |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :--- |
+| JOB_ID | text | 作业 ID,由 Doris 自动生成。
| `12025` |
+| JOB_NAME | text | Routine Load 作业名称。
| `example_routine_load` |
+| CREATE_TIME | text | 作业创建时间。
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | 最近一次作业暂停时间。作业未暂停时为 `NULL`。
| `NULL` |
+| END_TIME | text | 作业结束时间。作业未结束时为 `NULL`。
| `NULL` |
+| DB_NAME | text | 作业所属数据库名称。
| `default_cluster:testdb` |
+| TABLE_NAME | text | 作业导入的目标表名称。多表导入时显示为 `multi-table`。
| `test_routineload_tbl` |
+| STATE | text | 作业运行状态,包括
`NEED_SCHEDULE`、`RUNNING`、`PAUSED`、`STOPPED`、`CANCELLED`。
| `RUNNING` |
+| CURRENT_TASK_NUM | text | 当前正在调度或执行的子任务数量。
| `1` |
+| JOB_PROPERTIES | text | 作业属性配置,包含批次大小、并发数、导入格式、列映射、错误容忍度等配置。
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | 数据源属性配置。Kafka 数据源中包含 topic、broker 列表和当前
Kafka 分区等信息。
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | 创建作业时设置的自定义属性。Kafka 数据源中通常包含消费
offset、group id 以及以 `property.` 前缀传入的 Kafka 客户端参数。
|
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | 作业运行统计信息。常用字段包括
`receivedBytes`、`loadedRows`、`errorRows`、`committedTaskNum`、`abortedTaskNum`、`loadRowsRate`、`taskExecuteTimeMs`
等。 |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | 作业运行进度。Kafka 数据源中显示每个分区当前已消费的 offset。
| `{"0":"2"}` |
+| LAG | text | 作业延迟信息。Kafka 数据源中显示每个分区的消费积压量。
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | 作业状态变更原因。正常运行时通常为空;异常暂停或取消时记录具体原因。
|
`The number of failed task exceeded max_error_number` |
+| ERROR_LOG_URLS | text | 错误日志 URL,用于查看被过滤的质量不合格数据。无错误日志时为空。
| `http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | 创建或操作该作业的用户。
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | 当前失败的子任务数量。
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | 是否为非用户手动触发的异常暂停。`true`
表示系统因异常原因暂停作业,`false` 表示没有异常暂停。
| `false` |
+
+## 查询异常作业
+
+如果作业异常暂停、存在失败任务,或处于 `RUNNING` 状态但没有正在执行的子任务,同时 Kafka
仍有消费积压,都可以视为需要进一步排查的异常作业。可以通过以下 SQL 查询这些作业:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+查询到异常作业后,可以切换到对应数据库,通过 `SHOW ROUTINE LOAD` 查看作业详情:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+其中 `LAG` 是 Kafka 各分区的消费积压信息。`LAG REGEXP ':[[:space:]]*[1-9][0-9]*'`
用于匹配至少有一个分区存在大于 0 的积压。
diff --git
a/ja-source/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
b/ja-source/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
index 297e00c6e0c..cb94fa52bdf 100644
---
a/ja-source/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/ja-source/docusaurus-plugin-content-docs/current/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -9,31 +9,60 @@
routine load ジョブの情報を表示するために使用されます。
-## データベース
+## データベースとテーブル
-`information_schema`
+`information_schema.routine_load_jobs`
## テーブル情報
-| カラム名 | 型 | 説明 |
-| :-------------------- | :-------- |
:----------------------------------------- |
-| JOB_ID | text | ジョブ ID |
-| JOB_NAME | text | ジョブ名 |
-| CREATE_TIME | text | ジョブ作成時刻 |
-| PAUSE_TIME | text | ジョブ一時停止時刻 |
-| END_TIME | text | ジョブ終了時刻 |
-| DB_NAME | text | データベース名 |
-| TABLE_NAME | text | テーブル名 |
-| STATE | text | ジョブステータス |
-| CURRENT_TASK_NUM | text | 現在のサブタスク数 |
-| JOB_PROPERTIES | text | ジョブプロパティ設定 |
-| DATA_SOURCE_PROPERTIES| text | データソースプロパティ設定 |
-| CUSTOM_PROPERTIES | text | カスタムプロパティ設定 |
-| STATISTIC | text | ジョブ統計情報 |
-| PROGRESS | text | ジョブ進捗情報 |
-| LAG | text | ジョブ遅延情報 |
-| REASON_OF_STATE_CHANGED| text | ジョブステータス変更理由 |
-| ERROR_LOG_URLS | text | エラーログURL |
-| USER_NAME | text | ユーザー名 |
-| CURRENT_ABORT_TASK_NUM| int | 現在の失敗タスク数 |
-| IS_ABNORMAL_PAUSE | boolean | システムによる一時停止かどうか(ユーザー以外による一時停止) |
+| カラム名 | 型 | 説明
| 例 |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :-- |
+| JOB_ID | text | Doris によって自動生成されるジョブ ID。
| `12025` |
+| JOB_NAME | text | Routine Load ジョブ名。
| `example_routine_load` |
+| CREATE_TIME | text | ジョブ作成時刻。
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | 直近のジョブ一時停止時刻。ジョブが一時停止されていない場合は `NULL`。
| `NULL` |
+| END_TIME | text | ジョブ終了時刻。ジョブが終了していない場合は `NULL`。
| `NULL` |
+| DB_NAME | text | ジョブが属するデータベース名。
| `default_cluster:testdb` |
+| TABLE_NAME | text | ジョブのインポート先テーブル名。マルチテーブルインポートの場合は
`multi-table` と表示されます。
| `test_routineload_tbl` |
+| STATE | text |
ジョブ実行状態。`NEED_SCHEDULE`、`RUNNING`、`PAUSED`、`STOPPED`、`CANCELLED` があります。
| `RUNNING` |
+| CURRENT_TASK_NUM | text | 現在スケジュール中または実行中のサブタスク数。
| `1` |
+| JOB_PROPERTIES | text |
ジョブプロパティ設定。バッチサイズ、並行数、インポート形式、カラムマッピング、エラー許容設定などを含みます。
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | データソースプロパティ設定。Kafka の場合、topic、broker
リスト、現在の Kafka パーティションなどを含みます。 |
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | ジョブ作成時に設定されたカスタムプロパティ。Kafka の場合、通常は
offset、group id、および `property.` プレフィックスで渡された Kafka クライアントパラメータを含みます。 |
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | ジョブ実行統計情報。よく使われるフィールドには
`receivedBytes`、`loadedRows`、`errorRows`、`committedTaskNum`、`abortedTaskNum`、`loadRowsRate`、`taskExecuteTimeMs`
などがあります。 |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | ジョブ実行進捗。Kafka の場合、各パーティションの現在の消費済み
offset を表示します。
| `{"0":"2"}` |
+| LAG | text | ジョブ遅延情報。Kafka の場合、各パーティションの消費ラグを表示します。
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text |
ジョブ状態変更の理由。正常実行中は通常空で、異常一時停止またはキャンセル時に具体的な理由が記録されます。
| `The number of failed task exceeded max_error_number` |
+| ERROR_LOG_URLS | text | 品質チェックでフィルタされた不正データを確認するためのエラーログ
URL。エラーログがない場合は空です。 |
`http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | ジョブを作成または操作したユーザー。
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | 現在の失敗サブタスク数。
| `0` |
+| IS_ABNORMAL_PAUSE | boolean |
ユーザーの手動操作ではなく、システムによって異常一時停止されたかどうか。`true` はシステムによる異常一時停止、`false`
は異常一時停止なしを表します。 | `false` |
+
+## 異常ジョブの確認
+
+ジョブが異常一時停止している、失敗タスクが存在する、または `RUNNING` 状態だが実行中のサブタスクがなく Kafka
に消費ラグが残っている場合は、追加調査が必要な異常ジョブとみなせます。以下の SQL でこれらのジョブを確認できます。
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+異常ジョブが見つかったら、対応するデータベースに切り替え、`SHOW ROUTINE LOAD` でジョブ詳細を確認します。
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+`LAG` は Kafka の各パーティションの消費ラグ情報です。`LAG REGEXP ':[[:space:]]*[1-9][0-9]*'`
は、少なくとも 1 つのパーティションに 0 より大きいラグがあるジョブにマッチします。
diff --git
a/ja-source/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
b/ja-source/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
index db1c53ab79d..cb94fa52bdf 100644
---
a/ja-source/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/ja-source/docusaurus-plugin-content-docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -2,38 +2,67 @@
{
"title": "routine_load_job",
"language": "ja",
- "description": "ルーチンロードジョブに関する情報を表示するために使用されます。この機能はバージョン3.0.5で導入されました。"
+ "description": "ルーチンロードジョブに関する情報を表示するために使用されます。"
}
---
## 概要
-routine loadジョブに関する情報を表示するために使用されます。この機能はバージョン3.0.5で導入されました。
+routine load ジョブの情報を表示するために使用されます。
-## データベース
+## データベースとテーブル
-`information_schema`
+`information_schema.routine_load_jobs`
## テーブル情報
-| カラム名 | 型 | 説明 |
-| :-------------------- | :-------- |
:----------------------------------------- |
-| JOB_ID | text | ジョブID |
-| JOB_NAME | text | ジョブ名 |
-| CREATE_TIME | text | ジョブ作成時刻 |
-| PAUSE_TIME | text | ジョブ一時停止時刻 |
-| END_TIME | text | ジョブ終了時刻 |
-| DB_NAME | text | データベース名 |
-| TABLE_NAME | text | テーブル名 |
-| STATE | text | ジョブステータス |
-| CURRENT_TASK_NUM | text | 現在のサブタスク数 |
-| JOB_PROPERTIES | text | ジョブプロパティ設定 |
-| DATA_SOURCE_PROPERTIES| text | データソースプロパティ設定 |
-| CUSTOM_PROPERTIES | text | カスタムプロパティ設定 |
-| STATISTIC | text | ジョブ統計情報 |
-| PROGRESS | text | ジョブ進捗情報 |
-| LAG | text | ジョブ遅延情報 |
-| REASON_OF_STATE_CHANGED| text | ジョブステータス変更理由 |
-| ERROR_LOG_URLS | text | エラーログURL |
-| USER_NAME | text | ユーザー名 |
-| CURRENT_ABORT_TASK_NUM| int | 現在の失敗タスク数 |
-| IS_ABNORMAL_PAUSE | boolean | システムによる一時停止かどうか(ユーザー一時停止以外) |
+| カラム名 | 型 | 説明
| 例 |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :-- |
+| JOB_ID | text | Doris によって自動生成されるジョブ ID。
| `12025` |
+| JOB_NAME | text | Routine Load ジョブ名。
| `example_routine_load` |
+| CREATE_TIME | text | ジョブ作成時刻。
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | 直近のジョブ一時停止時刻。ジョブが一時停止されていない場合は `NULL`。
| `NULL` |
+| END_TIME | text | ジョブ終了時刻。ジョブが終了していない場合は `NULL`。
| `NULL` |
+| DB_NAME | text | ジョブが属するデータベース名。
| `default_cluster:testdb` |
+| TABLE_NAME | text | ジョブのインポート先テーブル名。マルチテーブルインポートの場合は
`multi-table` と表示されます。
| `test_routineload_tbl` |
+| STATE | text |
ジョブ実行状態。`NEED_SCHEDULE`、`RUNNING`、`PAUSED`、`STOPPED`、`CANCELLED` があります。
| `RUNNING` |
+| CURRENT_TASK_NUM | text | 現在スケジュール中または実行中のサブタスク数。
| `1` |
+| JOB_PROPERTIES | text |
ジョブプロパティ設定。バッチサイズ、並行数、インポート形式、カラムマッピング、エラー許容設定などを含みます。
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | データソースプロパティ設定。Kafka の場合、topic、broker
リスト、現在の Kafka パーティションなどを含みます。 |
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | ジョブ作成時に設定されたカスタムプロパティ。Kafka の場合、通常は
offset、group id、および `property.` プレフィックスで渡された Kafka クライアントパラメータを含みます。 |
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | ジョブ実行統計情報。よく使われるフィールドには
`receivedBytes`、`loadedRows`、`errorRows`、`committedTaskNum`、`abortedTaskNum`、`loadRowsRate`、`taskExecuteTimeMs`
などがあります。 |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | ジョブ実行進捗。Kafka の場合、各パーティションの現在の消費済み
offset を表示します。
| `{"0":"2"}` |
+| LAG | text | ジョブ遅延情報。Kafka の場合、各パーティションの消費ラグを表示します。
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text |
ジョブ状態変更の理由。正常実行中は通常空で、異常一時停止またはキャンセル時に具体的な理由が記録されます。
| `The number of failed task exceeded max_error_number` |
+| ERROR_LOG_URLS | text | 品質チェックでフィルタされた不正データを確認するためのエラーログ
URL。エラーログがない場合は空です。 |
`http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | ジョブを作成または操作したユーザー。
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | 現在の失敗サブタスク数。
| `0` |
+| IS_ABNORMAL_PAUSE | boolean |
ユーザーの手動操作ではなく、システムによって異常一時停止されたかどうか。`true` はシステムによる異常一時停止、`false`
は異常一時停止なしを表します。 | `false` |
+
+## 異常ジョブの確認
+
+ジョブが異常一時停止している、失敗タスクが存在する、または `RUNNING` 状態だが実行中のサブタスクがなく Kafka
に消費ラグが残っている場合は、追加調査が必要な異常ジョブとみなせます。以下の SQL でこれらのジョブを確認できます。
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+異常ジョブが見つかったら、対応するデータベースに切り替え、`SHOW ROUTINE LOAD` でジョブ詳細を確認します。
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+`LAG` は Kafka の各パーティションの消費ラグ情報です。`LAG REGEXP ':[[:space:]]*[1-9][0-9]*'`
は、少なくとも 1 つのパーティションに 0 より大きいラグがあるジョブにマッチします。
diff --git
a/ja-source/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
b/ja-source/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
index fce3ed016e2..cb94fa52bdf 100644
---
a/ja-source/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/ja-source/docusaurus-plugin-content-docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -2,38 +2,67 @@
{
"title": "routine_load_job",
"language": "ja",
- "description": "定期ロードジョブに関する情報を表示するために使用されます。"
+ "description": "ルーチンロードジョブに関する情報を表示するために使用されます。"
}
---
## 概要
-routine load ジョブに関する情報を表示するために使用されます。
+routine load ジョブの情報を表示するために使用されます。
-## データベース
+## データベースとテーブル
-`information_schema`
+`information_schema.routine_load_jobs`
## テーブル情報
-| カラム名 | 型 | 説明 |
-| :-------------------- | :-------- |
:----------------------------------------- |
-| JOB_ID | text | ジョブID |
-| JOB_NAME | text | ジョブ名 |
-| CREATE_TIME | text | ジョブ作成時刻 |
-| PAUSE_TIME | text | ジョブ一時停止時刻 |
-| END_TIME | text | ジョブ終了時刻 |
-| DB_NAME | text | データベース名 |
-| TABLE_NAME | text | テーブル名 |
-| STATE | text | ジョブ状態 |
-| CURRENT_TASK_NUM | text | 現在のサブタスク数 |
-| JOB_PROPERTIES | text | ジョブプロパティ設定 |
-| DATA_SOURCE_PROPERTIES| text | データソースプロパティ設定 |
-| CUSTOM_PROPERTIES | text | カスタムプロパティ設定 |
-| STATISTIC | text | ジョブ統計情報 |
-| PROGRESS | text | ジョブ進行状況情報 |
-| LAG | text | ジョブ遅延情報 |
-| REASON_OF_STATE_CHANGED| text | ジョブ状態変更の理由 |
-| ERROR_LOG_URLS | text | エラーログURL |
-| USER_NAME | text | ユーザー名 |
-| CURRENT_ABORT_TASK_NUM| int | 現在の失敗タスク数 |
-| IS_ABNORMAL_PAUSE | boolean | システムによる一時停止かどうか(ユーザー以外による一時停止) |
+| カラム名 | 型 | 説明
| 例 |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :-- |
+| JOB_ID | text | Doris によって自動生成されるジョブ ID。
| `12025` |
+| JOB_NAME | text | Routine Load ジョブ名。
| `example_routine_load` |
+| CREATE_TIME | text | ジョブ作成時刻。
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | 直近のジョブ一時停止時刻。ジョブが一時停止されていない場合は `NULL`。
| `NULL` |
+| END_TIME | text | ジョブ終了時刻。ジョブが終了していない場合は `NULL`。
| `NULL` |
+| DB_NAME | text | ジョブが属するデータベース名。
| `default_cluster:testdb` |
+| TABLE_NAME | text | ジョブのインポート先テーブル名。マルチテーブルインポートの場合は
`multi-table` と表示されます。
| `test_routineload_tbl` |
+| STATE | text |
ジョブ実行状態。`NEED_SCHEDULE`、`RUNNING`、`PAUSED`、`STOPPED`、`CANCELLED` があります。
| `RUNNING` |
+| CURRENT_TASK_NUM | text | 現在スケジュール中または実行中のサブタスク数。
| `1` |
+| JOB_PROPERTIES | text |
ジョブプロパティ設定。バッチサイズ、並行数、インポート形式、カラムマッピング、エラー許容設定などを含みます。
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | データソースプロパティ設定。Kafka の場合、topic、broker
リスト、現在の Kafka パーティションなどを含みます。 |
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | ジョブ作成時に設定されたカスタムプロパティ。Kafka の場合、通常は
offset、group id、および `property.` プレフィックスで渡された Kafka クライアントパラメータを含みます。 |
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | ジョブ実行統計情報。よく使われるフィールドには
`receivedBytes`、`loadedRows`、`errorRows`、`committedTaskNum`、`abortedTaskNum`、`loadRowsRate`、`taskExecuteTimeMs`
などがあります。 |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | ジョブ実行進捗。Kafka の場合、各パーティションの現在の消費済み
offset を表示します。
| `{"0":"2"}` |
+| LAG | text | ジョブ遅延情報。Kafka の場合、各パーティションの消費ラグを表示します。
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text |
ジョブ状態変更の理由。正常実行中は通常空で、異常一時停止またはキャンセル時に具体的な理由が記録されます。
| `The number of failed task exceeded max_error_number` |
+| ERROR_LOG_URLS | text | 品質チェックでフィルタされた不正データを確認するためのエラーログ
URL。エラーログがない場合は空です。 |
`http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | ジョブを作成または操作したユーザー。
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | 現在の失敗サブタスク数。
| `0` |
+| IS_ABNORMAL_PAUSE | boolean |
ユーザーの手動操作ではなく、システムによって異常一時停止されたかどうか。`true` はシステムによる異常一時停止、`false`
は異常一時停止なしを表します。 | `false` |
+
+## 異常ジョブの確認
+
+ジョブが異常一時停止している、失敗タスクが存在する、または `RUNNING` 状態だが実行中のサブタスクがなく Kafka
に消費ラグが残っている場合は、追加調査が必要な異常ジョブとみなせます。以下の SQL でこれらのジョブを確認できます。
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+異常ジョブが見つかったら、対応するデータベースに切り替え、`SHOW ROUTINE LOAD` でジョブ詳細を確認します。
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+`LAG` は Kafka の各パーティションの消費ラグ情報です。`LAG REGEXP ':[[:space:]]*[1-9][0-9]*'`
は、少なくとも 1 つのパーティションに 0 より大きいラグがあるジョブにマッチします。
diff --git
a/versioned_docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
b/versioned_docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
index bf412162cb7..8acc853f73b 100644
---
a/versioned_docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/versioned_docs/version-3.x/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -2,39 +2,68 @@
{
"title": "routine_load_job",
"language": "en",
- "description": "Used to view information about routine load jobs. This
feature was introduced in version 3.0.5."
+ "description": "Used to view information about routine load jobs."
}
---
## Overview
-Used to view information about routine load jobs. This feature was introduced
in version 3.0.5.
+Used to view information about routine load jobs.
-## Database
+## Database and Table
-`information_schema`
+`information_schema.routine_load_jobs`
## Table Information
-| Column Name | Type | Description
|
-| :-------------------- | :-------- |
:----------------------------------------- |
-| JOB_ID | text | Job ID
|
-| JOB_NAME | text | Job name
|
-| CREATE_TIME | text | Job creation time
|
-| PAUSE_TIME | text | Job pause time
|
-| END_TIME | text | Job end time
|
-| DB_NAME | text | Database name
|
-| TABLE_NAME | text | Table name
|
-| STATE | text | Job status
|
-| CURRENT_TASK_NUM | text | Current number of subtasks
|
-| JOB_PROPERTIES | text | Job property configurations
|
-| DATA_SOURCE_PROPERTIES| text | Data source property configurations
|
-| CUSTOM_PROPERTIES | text | Custom property configurations
|
-| STATISTIC | text | Job statistics information
|
-| PROGRESS | text | Job progress information
|
-| LAG | text | Job delay information
|
-| REASON_OF_STATE_CHANGED| text | Reason for job status change
|
-| ERROR_LOG_URLS | text | Error log URLs
|
-| USER_NAME | text | Username
|
-| CURRENT_ABORT_TASK_NUM| int | Current number of failed tasks
|
-| IS_ABNORMAL_PAUSE | boolean | Whether paused by system (non-user pause)
|
\ No newline at end of file
+| Column Name | Type | Description
| Example |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :------ |
+| JOB_ID | text | Job ID generated by Doris.
| `12025` |
+| JOB_NAME | text | Routine Load job name.
| `example_routine_load` |
+| CREATE_TIME | text | Job creation time.
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | Most recent job pause time. It is
`NULL` if the job has not been paused.
| `NULL` |
+| END_TIME | text | Job end time. It is `NULL` if the job
has not ended.
| `NULL` |
+| DB_NAME | text | Database name of the job.
| `default_cluster:testdb` |
+| TABLE_NAME | text | Target table name of the job. For
multi-table import jobs, this value is `multi-table`.
| `test_routineload_tbl` |
+| STATE | text | Job running status, including
`NEED_SCHEDULE`, `RUNNING`, `PAUSED`, `STOPPED`, and `CANCELLED`.
| `RUNNING` |
+| CURRENT_TASK_NUM | text | Number of subtasks currently being
scheduled or executed.
| `1` |
+| JOB_PROPERTIES | text | Job property configurations, including
batch size, concurrency, import format, column mapping, and error tolerance.
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | Data source property configurations.
For Kafka, this includes topic, broker list, and current Kafka partitions.
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | Custom properties configured when
creating the job. For Kafka, this usually includes offsets, group id, and Kafka
client parameters passed with `property.` prefixes. |
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | Job runtime statistics. Common fields
include `receivedBytes`, `loadedRows`, `errorRows`, `committedTaskNum`,
`abortedTaskNum`, `loadRowsRate`, and `taskExecuteTimeMs`. |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | Job running progress. For Kafka, it
shows the consumed offset of each partition.
| `{"0":"2"}` |
+| LAG | text | Job lag information. For Kafka, it
shows the consumption lag of each partition.
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | Reason for the job state change. It is
usually empty for normally running jobs, and records the specific reason when
the job is paused or cancelled abnormally. | `The number of failed task
exceeded max_error_number` |
+| ERROR_LOG_URLS | text | Error log URLs for viewing filtered
data that failed quality checks. It is empty if there is no error log.
|
`http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | User who created or operated the job.
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | Current number of failed subtasks.
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | Whether the job was paused abnormally
by the system instead of manually by a user. `true` indicates an abnormal
system pause, and `false` indicates no abnormal pause. | `false` |
+
+## Query Abnormal Jobs
+
+If a job is abnormally paused, has failed tasks, or is in the `RUNNING` state
but has no running subtasks while Kafka still has consumption lag, it should be
investigated. You can use the following SQL to query these jobs:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+After finding an abnormal job, switch to the corresponding database and use
`SHOW ROUTINE LOAD` to view the job details:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+`LAG` is the Kafka consumption lag information for each partition. `LAG REGEXP
':[[:space:]]*[1-9][0-9]*'` matches jobs where at least one partition has a lag
greater than 0.
diff --git
a/versioned_docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
b/versioned_docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
index f35b0ee1955..8acc853f73b 100644
---
a/versioned_docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
+++
b/versioned_docs/version-4.x/admin-manual/system-tables/information_schema/routine_load_job.md
@@ -10,31 +10,60 @@
Used to view information about routine load jobs.
-## Database
+## Database and Table
-`information_schema`
+`information_schema.routine_load_jobs`
## Table Information
-| Column Name | Type | Description
|
-| :-------------------- | :-------- |
:----------------------------------------- |
-| JOB_ID | text | Job ID
|
-| JOB_NAME | text | Job name
|
-| CREATE_TIME | text | Job creation time
|
-| PAUSE_TIME | text | Job pause time
|
-| END_TIME | text | Job end time
|
-| DB_NAME | text | Database name
|
-| TABLE_NAME | text | Table name
|
-| STATE | text | Job status
|
-| CURRENT_TASK_NUM | text | Current number of subtasks
|
-| JOB_PROPERTIES | text | Job property configurations
|
-| DATA_SOURCE_PROPERTIES| text | Data source property configurations
|
-| CUSTOM_PROPERTIES | text | Custom property configurations
|
-| STATISTIC | text | Job statistics information
|
-| PROGRESS | text | Job progress information
|
-| LAG | text | Job delay information
|
-| REASON_OF_STATE_CHANGED| text | Reason for job status change
|
-| ERROR_LOG_URLS | text | Error log URLs
|
-| USER_NAME | text | Username
|
-| CURRENT_ABORT_TASK_NUM| int | Current number of failed tasks
|
-| IS_ABNORMAL_PAUSE | boolean | Whether paused by system (non-user pause)
|
\ No newline at end of file
+| Column Name | Type | Description
| Example |
+| :----------------------- | :------ |
:------------------------------------------------------------------------------------------------------------------------------------------------------
| :------ |
+| JOB_ID | text | Job ID generated by Doris.
| `12025` |
+| JOB_NAME | text | Routine Load job name.
| `example_routine_load` |
+| CREATE_TIME | text | Job creation time.
| `2024-01-15 08:12:42` |
+| PAUSE_TIME | text | Most recent job pause time. It is
`NULL` if the job has not been paused.
| `NULL` |
+| END_TIME | text | Job end time. It is `NULL` if the job
has not ended.
| `NULL` |
+| DB_NAME | text | Database name of the job.
| `default_cluster:testdb` |
+| TABLE_NAME | text | Target table name of the job. For
multi-table import jobs, this value is `multi-table`.
| `test_routineload_tbl` |
+| STATE | text | Job running status, including
`NEED_SCHEDULE`, `RUNNING`, `PAUSED`, `STOPPED`, and `CANCELLED`.
| `RUNNING` |
+| CURRENT_TASK_NUM | text | Number of subtasks currently being
scheduled or executed.
| `1` |
+| JOB_PROPERTIES | text | Job property configurations, including
batch size, concurrency, import format, column mapping, and error tolerance.
|
`{"max_batch_rows":"200000","format":"csv","columnToColumnExpr":"user_id,name,age","max_filter_ratio":"1.0"}`
|
+| DATA_SOURCE_PROPERTIES | text | Data source property configurations.
For Kafka, this includes topic, broker list, and current Kafka partitions.
|
`{"topic":"test-topic","currentKafkaPartitions":"0","brokerList":"192.168.88.62:9092"}`
|
+| CUSTOM_PROPERTIES | text | Custom properties configured when
creating the job. For Kafka, this usually includes offsets, group id, and Kafka
client parameters passed with `property.` prefixes. |
`{"kafka_default_offsets":"OFFSET_BEGINNING","group.id":"example_routine_load_73daf600-884e-46c0-a02b-4e49fdf3b4dc"}`
|
+| STATISTIC | text | Job runtime statistics. Common fields
include `receivedBytes`, `loadedRows`, `errorRows`, `committedTaskNum`,
`abortedTaskNum`, `loadRowsRate`, and `taskExecuteTimeMs`. |
`{"receivedBytes":28,"runningTxns":[],"errorRows":0,"committedTaskNum":3,"loadedRows":3,"loadRowsRate":0,"abortedTaskNum":0,"errorRowsAfterResumed":0,"totalRows":3,"unselectedRows":0,"receivedBytesRate":0,"taskExecuteTimeMs":30069}`
|
+| PROGRESS | text | Job running progress. For Kafka, it
shows the consumed offset of each partition.
| `{"0":"2"}` |
+| LAG | text | Job lag information. For Kafka, it
shows the consumption lag of each partition.
| `{"0":0}` |
+| REASON_OF_STATE_CHANGED | text | Reason for the job state change. It is
usually empty for normally running jobs, and records the specific reason when
the job is paused or cancelled abnormally. | `The number of failed task
exceeded max_error_number` |
+| ERROR_LOG_URLS | text | Error log URLs for viewing filtered
data that failed quality checks. It is empty if there is no error log.
|
`http://fe_host:8030/api/_load_error_log?file=error.log` |
+| USER_NAME | text | User who created or operated the job.
| `root` |
+| CURRENT_ABORT_TASK_NUM | int | Current number of failed subtasks.
| `0` |
+| IS_ABNORMAL_PAUSE | boolean | Whether the job was paused abnormally
by the system instead of manually by a user. `true` indicates an abnormal
system pause, and `false` indicates no abnormal pause. | `false` |
+
+## Query Abnormal Jobs
+
+If a job is abnormally paused, has failed tasks, or is in the `RUNNING` state
but has no running subtasks while Kafka still has consumption lag, it should be
investigated. You can use the following SQL to query these jobs:
+
+```sql
+SELECT DB_NAME, JOB_NAME
+FROM information_schema.routine_load_jobs
+WHERE IS_ABNORMAL_PAUSE = TRUE
+ OR (
+ STATE = 'RUNNING'
+ AND (
+ CURRENT_ABORT_TASK_NUM > 0
+ OR (
+ CAST(CURRENT_TASK_NUM AS INT) = 0
+ AND `LAG` REGEXP ':[[:space:]]*[1-9][0-9]*'
+ )
+ )
+ );
+```
+
+After finding an abnormal job, switch to the corresponding database and use
`SHOW ROUTINE LOAD` to view the job details:
+
+```sql
+USE `db_name`;
+SHOW ROUTINE LOAD FOR `job_name`;
+```
+
+`LAG` is the Kafka consumption lag information for each partition. `LAG REGEXP
':[[:space:]]*[1-9][0-9]*'` matches jobs where at least one partition has a lag
greater than 0.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]