Re: [PERFORM] GiST index performance

Yeb Havinga Mon, 22 Mar 2010 07:03:08 -0700

Matthew Wakeling wrote:

On Sat, 20 Mar 2010, Yeb Havinga wrote:
The gist virtual pages would then match more the original blocksizesthatwere used in Guttman's R-tree paper (first google result, then figure4.5).Since the nature/characteristics of the underlying datatypes and keysis notchanged, it might be that with the disk pages getting larger, gistindexing
has therefore become unexpectedly inefficient.
Yes, that is certainly a factor. For example, the page size for biosegwhich we use here is 130 entries, which is very excessive, and doesn'tallow very deep trees. On the other hand, it means that a single discseek performs quite a lot of work.

Yeah, I only did in-memory fitting tests and wondered about increasedio's. However I bet that even for bigger than ram db's, the benefit ofhaving to fan out to less pages still outweighs the over-general nonleaf nodes and might still result in less disk io's. I redid someearlier benchmarking with other datatypes with a 1kB block size and alsomulticolumn gist and the multicolumn variant had an ever greater benefitthan the single column indexes, both equality and range scans. (Likeexecution times down to 20% of original). If gist is important to you, Ireally recommend doing a test with 1kB blocks.


regards,
Yeb Havinga

--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] GiST index performance

Reply via email to