[
https://issues.apache.org/jira/browse/FLINK-16053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dian Fu updated FLINK-16053:
Description: We have recorded the metrics about how many elements it has
processed in Python UDF. This kind of information is not necessary as there is
also this kind of information in the Java operator. I have performed a simple
test and find that removing it could improve the performance about 5% - 10%.
Besides, as these metrics are still not exposed and it will be safe to remove
it. (was: We have recorded the metrics about how many elements it has
processed in Python UDF. This kind of information is not necessary as there is
also this kind of information in the Java operator. I have performed a simple
test and find that removing it could improve the performance about 5% - 10%.)
> Remove redundant metrics in PyFlink
> ---
>
> Key: FLINK-16053
> URL: https://issues.apache.org/jira/browse/FLINK-16053
> Project: Flink
> Issue Type: Improvement
> Components: API / Python
>Reporter: Dian Fu
>Assignee: Dian Fu
>Priority: Major
> Labels: pull-request-available
> Fix For: 1.11.0
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> We have recorded the metrics about how many elements it has processed in
> Python UDF. This kind of information is not necessary as there is also this
> kind of information in the Java operator. I have performed a simple test and
> find that removing it could improve the performance about 5% - 10%. Besides,
> as these metrics are still not exposed and it will be safe to remove it.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)