Hello,

Currently we have polaris.iceberg-metrics.reporting and the ability to persists 
those metrics to the backend. By default, this can be enabled by change log 
level for org.apache.polaris.service.reporting to INFO for log based metrics 
and polaris.iceberg-metrics.reporting.type to persistent if we want it to be 
persisted on the backend. Currently this setting is all or nothing. This means, 
with the settings enabled, all tables' metrics will be report/persist. Should 
we introduce a filter (include/exclusion type settings) which people can fine 
tune on what to include/exclude (and default to include all)?

There are couple use cases such as:
1. exclude noise tables
2. enable metrics for a given namespace during troubleshooting without enable 
all (e.g. only certain tables are user facing and we would want to close 
monitor the performance metrics on them while other tables may be batched and 
latency is not that sensitive compared to those)
3. avoid potential storage growth as there is lack of cleanup job atm (raised 
in a different ML) and avoid extra I/O to the backend RDS if metrics for 
majority of the tables are not necessary

Thanks,
Yong Zheng 

Reply via email to