[ https://issues.apache.org/jira/browse/FLINK-18235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Flink Jira Bot updated FLINK-18235: ----------------------------------- Labels: stale-major (was: ) > Improve the checkpoint strategy for Python UDF execution > -------------------------------------------------------- > > Key: FLINK-18235 > URL: https://issues.apache.org/jira/browse/FLINK-18235 > Project: Flink > Issue Type: Improvement > Components: API / Python > Reporter: Dian Fu > Priority: Major > Labels: stale-major > Fix For: 1.13.0 > > > Currently, when a checkpoint is triggered for the Python operator, all the > data buffered will be flushed to the Python worker to be processed. This will > increase the overall checkpoint time in case there are a lot of elements > buffered and Python UDF is slow. We should improve the checkpoint strategy to > improve this, e.g. buffering the data into state instead of flushing them > out. We can also let users to config the checkpoint strategy if needed. -- This message was sent by Atlassian Jira (v8.3.4#803005)