[
https://issues.apache.org/jira/browse/FLINK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17354767#comment-17354767
]
Yun Tang commented on FLINK-22806:
----------------------------------
I noticed that you have already opened related issue in Chinese user mailing
list, and replied in that thread.
I planed to close this ticket first as Apache JIRA is not somewhere to ask
questions. If some internal bugs really exist, we could then create related
ticket.
> The folder /checkpoint/shared state of FLINKSQL is getting bigger and bigger
> -----------------------------------------------------------------------------
>
> Key: FLINK-22806
> URL: https://issues.apache.org/jira/browse/FLINK-22806
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Checkpointing, Runtime / State Backends
> Affects Versions: 1.12.0
> Reporter: YUJIANBO
> Priority: Major
>
> I have added the parameter :
> 1、tableEnv.getConfig().setIdleStateRetention: 3600 (one hour),
> 2、state.checkpoints.num-retained:3
> 3、sql:
> {code:sql}
> // demo:
> select count(1), LISTAGG(concat(m,n)) from tabeA group by a, b,
> time_minute
> //details:
> CREATE TABLE user_behavior (
> `request_ip` STRING,
> `request_time` BIGINT,
> `header` STRING ,
> // timestamp converted to specific per minute
> `t_min` as cast(`request_time`-(`request_time` + 28800000)%60000 as
> BIGINT),
> `ts` as TO_TIMESTAMP(FROM_UNIXTIME(`request_time`/1000-28800,'yyyy-MM-dd
> HH:mm:ss')),
> WATERMARK FOR `ts` AS `ts` - INTERVAL '60' MINUTE)
> with (
> 'connector' = 'kafka',
> ........
> );
> CREATE TABLE blackhole_table (
> `cnt` BIGINT,
> `lists` STRING
> ) WITH (
> 'connector' = 'blackhole'
> );
> insert into blackhole_table
> select
> count(*) as cnt,
> LISTAGG(concat(`request_ip`, `header`, cast(`request_time` as STRING)))
> as lists
> from user_behavior
> group by `request_ip`,`header`,`t_min`;
> {code}
> 4、state.backend: rocksdb
> 5、state.backend.incremental is true
> I set the checkpoint state for one hour, but the size of the folder directory
> /checkpoint/shared is still growing. I observed it for two days and guessed
> that there was expired data in the /checkpoint/shared folder that had not
> been cleared?
> What else can limit the growth of state?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)