Re: Vacuum statistics

Alena Rybakina Thu, 12 Mar 2026 10:43:05 -0700

Hi)
Thank you for your valuable feedback!

On 12.03.2026 18:28, Andrei Lepikhov wrote:

On 12/3/26 13:02, Andrei Lepikhov wrote:
On 9/3/26 16:46, Alena Rybakina wrote:
I discovered that my last patches were incorrectly formed. I updatedthe correct version.
I see that v29-0001-* is a quite separate feature itself at themoment. It makes sense to remove the commit message phrase forvm_new_frozen_pages and vm_new_visible_pages, introduced in laterpatches.
This patch itself looks good to me.
Since this patch is almost ready for commit, I reviewed it carefully.I noticed a documentation entry was missing, so I added it. Please seethe attachment.

I have added it in the documentation in the extension that you havenoticed before, but I agree with your suggestion to move it in the corepatch.

While updating the patch file, I also made a few small adjustments,including changing the parameter order in the struct and VIEW. Thecommit message is also fixed.

Thank you) I agree with your fixes)

In addition, it makes sense to discuss how these parameters aresupposed to be used. I see the following use cases:
1. Which tables have the most VM churn? - monitoringrev_all_visible_pages normalised on the table size and its averagetuple width might expose the most suspicious tables (in terms of tablestatistics).2. DML Skew. Dividing rev_all_visible_pages by the number of tupleupdates/deletes, normalised by the average table and tuple sizes,might indicate whether changes are localised within the table.3. IndexOnlyScan effectiveness. Considering the speed ofrev_all_visible_pages change, normalised to the value of therelallvisible statistic, we may detect tables where Index-Only Scanmight be inefficiently used.


Now it can be useful to track what table's pages are frozen by vacuum most.

By analyzing the ratio of frozen to unfrozen pages, you can see how wellthe balance is maintained. Ideally, this ratio should approach 1. If wehave a higher ratio of unfrozen to frozen pages, it means the backand isfrequently accessing the table, which could indicate that this tablepotentially requires attention to how well it's being handled by thevacuum. There may be unpredictability or even a seasonal trend — a pageis frequently accessed only during certain periods (this is purely myobservation). Also, if the ratio of frozen pages is higher, the vacuummay be configured too aggressively.

With the parameter that was included before (pg_class_relallfrozen andrelallvisiblehttps://github.com/MasaoFujii/postgresql/commit/99f8f3fbbc8f743290844e8c676d39dad11c5d5d)in the pg_stat_tables, I think I can provide isolation test to prove it- I can use my isolation test vacuum-extending-in-repetable-read.specthat I have added in the extension (ext_vacuum_statistics). What do youthink?

Feel free to criticise it or add your own - I’m just a developer, nota DBA. Also, I’m not sure what use cases there are for therev_all_frozen_pages parameter.

Also, I would ask you if you don't mind to review the code in theextension that I have provided to store and control vacuum statistics.No one has ever looked at it unfortunately and any feedback is valuablenow.

In addition, I'm currently working on the parameter that can track someparts of statistics. For example, we can track only buffer or walstatistics. If you are interested, I'll send you the code on my github.However I have already noticed that it requires to add dynamical memoryallocation based on the guc value. I know that it requires a lot ofattention in development but it will help to save memory during savingstatistics. What do you think about this idea? To be honest, it wassuggested before in this thread and I'm trying to realize it.

Re: Vacuum statistics

Reply via email to