shared state of FLINKSQL is getting bigger and bigger

Yun Tang (Jira) Mon, 31 May 2021 20:13:08 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17354767#comment-17354767
 ]


Yun Tang commented on FLINK-22806:
----------------------------------

I noticed that you have already opened related issue in Chinese user mailing 
list, and replied in that thread.

I planed to close this ticket first as Apache JIRA is not somewhere to ask 
questions. If some internal bugs really exist, we could then create related 
ticket.

> The folder /checkpoint/shared state of  FLINKSQL is getting bigger and bigger
> -----------------------------------------------------------------------------
>
>                 Key: FLINK-22806
>                 URL: https://issues.apache.org/jira/browse/FLINK-22806
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Checkpointing, Runtime / State Backends
>    Affects Versions: 1.12.0
>            Reporter: YUJIANBO
>            Priority: Major
>
> I have added the parameter ：
>    1、tableEnv.getConfig().setIdleStateRetention:   3600 (one hour),  
>    2、state.checkpoints.num-retained:3
>    3、sql：
> {code:sql}
> // demo:  
>     select count(1),  LISTAGG(concat(m,n)) from tabeA group by a, b, 
> time_minute
> //details:
> CREATE TABLE user_behavior (
>    `request_ip` STRING,
>    `request_time` BIGINT,
>    `header` STRING ,
>    // timestamp  converted to specific per minute   
>    `t_min` as cast(`request_time`-(`request_time` + 28800000)%60000 as 
> BIGINT),
>    `ts` as TO_TIMESTAMP(FROM_UNIXTIME(`request_time`/1000-28800,'yyyy-MM-dd 
> HH:mm:ss')),
>    WATERMARK FOR `ts` AS `ts` - INTERVAL '60' MINUTE) 
> with (
>    'connector' = 'kafka',
>    ........ 
> );
> CREATE TABLE blackhole_table (
>    `cnt` BIGINT,
>    `lists` STRING
> ) WITH (
>  'connector' = 'blackhole'
> );
> insert into blackhole_table 
> select 
>     count(*) as cnt, 
>     LISTAGG(concat(`request_ip`, `header`, cast(`request_time` as STRING))) 
> as lists
> from user_behavior 
> group by `request_ip`,`header`,`t_min`;
> {code}
>    4、state.backend： rocksdb  
>    5、state.backend.incremental is true
> I set the checkpoint state for one hour, but the size of the folder directory 
> /checkpoint/shared is still growing.  I observed it for two days and guessed 
> that there was expired data in the  /checkpoint/shared folder that had not 
> been cleared?
> What else can limit the growth of state？



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (FLINK-22806) The folder /checkpoint/shared state of FLINKSQL is getting bigger and bigger

Reply via email to