On Thu, 6 Jul 2023, Dimitrios Apostolou wrote:
+ Then I broke the command above in many small chunks
WITH rows AS (
DELETE FROM tbl_legacy AS t
WHERE (partition_key_column < $1)
RETURNING t.*
) INSERT INTO tbl SELECT * FROM rows;
COMMIT;
I increase the parameter $1 and keep going in a loop. At first this
goes OK, after one day though I notice that it has slowed down
significantly. My investigation shows that the simple query
SELECT min(partition_key_column) from tbl_legacy;
takes 38s, despite having an index on the column! A VACUUM fixes that,
so I guess the index has a ton of dead tuples. I guess autovacuum does
not do its job because the table is constantly busy.
Unfortunately VACUUM takes long (several hours) on this huge table, so I
can't add in the loop after the DELETE command.
Is there a better way to avoid the bloat in the loop above? Why can't
the DELETE command update the index by pointing the minimum element
beyond the dead tuples?
Any comments on this one? It annoys me that a simple loop deteriorated so
much and kept filling the table with bloat. What is that VACUUM does that
DELETE can't do, to keep the index fresh?