[HACKERS] Dead Space Map

Heikki Linnakangas Mon, 27 Feb 2006 09:53:54 -0800

Hi,

The idea of using a so called dead space map to speed up vacuum has comeup multiple times in this list in the last couple of years. I wrote aninitial implementation of it to measure the performance impact it has onupdates and on vacuum.


Potential uses for a dead space map are:

* speed up vacuum when there's few dead tuples

Vacuum will need to be modified to use index lookups to find index tuplescorresponding the dead heap tuples. Otherwise you have to scan throughall the indexes anyway.


* vacuuming pages one by one as they're written by bgwriter

I'm not sure how much difference this would make, but it would be aninteresting experiment. In theory, you could save a lot of total I/O,because you would not need to come back to vacuum the pages later, but youwould have to read in any index pages pointing to the dead heap tuplesinside bgwriter.


* implementation of index-only scans

An index scan would not have to check the visibility information of heaptuples on those heap pages that are marked as clean in the dead space map.This requires that the dead space map is implemented so that a page isreliably marked as dirty in all circumstances when it contains any tuplesthat are not visible to all backends.

The obvious drawback is that heap updates need to update the dead spacemap too.

My current implementation stores a bitmap of 32k bits in the special spaceof every 32k heap pages. Each bit in the bitmap corresponds one heap page.The bit is set every time a tuple is updated, and it's cleared by vacuum.This is a very simple approach, and doesn't take much space.


Is there something I'm missing? Any ideas?

I'm going to have some spare time to hack PostgreSQL in the comingmonths, and I'm thinking of refining this if there's interest. Is anyoneelse working on this?


- Heikki

---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

[HACKERS] Dead Space Map

Reply via email to