[PERFORM] Trees: integer[] outperformed by ltree

Jan Walter Tue, 05 Nov 2013 04:26:22 -0800

Hi,

I am in a need of a very robust (esp. fast in read, non-blocking inupdate) tree structure storage (95% trees are of depth <4, current max.is 12). We have 10k-100k trees now, millions in the future.I made many tests, benchmarks of usual operations, and after all,materialized path ('1.5.3' path notation) seems most promising.

My last candidates for its storage are ltree and integer[]. So I amcomparing the following benchmarking tables with exactly same data (5regular trees - each node except leaves has 5 children, depth=9, totalnodes=2.441.405, id numbered according to breadth-first traversal):



*TABLES:*
A/ integer[]
CREATE TABLE test (
    id SERIAL,
    lpath ltree,
    CONSTRAINT test_pkey PRIMARY KEY(id)
);

CREATE INDEX test_idx1 ON test USING gist (lpath gist_ltree_ops);

B/ ltree
CREATE TABLE test (
    id SERIAL,
    apath INTEGER[],
    CONSTRAINT test_pkey PRIMARY KEY(id)
);

CREATE INDEX test_idx1 ON test USING gin (apath);

Separate single-table dbases, vacuum(analyz)ed.


*TESTING MACHINE:*
Windows 7, postgres 9.3.0, 4GB RAM
effective_cache_size = 2GB
work_mem = 512MB
shared_buffers = 1GB


*THE PROBLEM:*

My intuition says integer[] should not be much worse than ltree (ratherthe other way) but I am not able to reach such results. I believe I ammaking some trivial mistake rather than assuming false hypothesis. Mygeneral question is, what more can I do to get better performance.



*WHAT I DID:

*/*1/ I checked gist index for integer[] via intarray extension. Queryplans for <@ and @> operators do not use it (reported bug/feature).That's why I am using gin.*/


/*2/ Getting ancestors - same qplans, ltree slightly wins:*/
A:
select * from test where apath@>(select apath from test where id=1)

Bitmap Heap Scan on test (cost=159.04..33950.48 rows=12206 width=60)(actual time=80.912..224.182 rows=488281 loops=1)

  Recheck Cond: (apath @> $0)
  Buffers: shared hit=18280
  InitPlan 1 (returns $0)

-> Index Scan using test_pkey on test test_1 (cost=0.43..8.45rows=1 width=56) (actual time=0.022..0.023 rows=1 loops=1)

          Index Cond: (id = 1)
          Buffers: shared hit=4

-> Bitmap Index Scan on test_idx1 (cost=0.00..147.54 rows=12206width=0) (actual time=76.896..76.896 rows=488281 loops=1)

        Index Cond: (apath @> $0)
        Buffers: shared hit=369
Total runtime: 240.408 ms

B:
select * from test where lpath<@(select lpath from test where id=1)

Bitmap Heap Scan on test (cost=263.81..8674.72 rows=2445 width=83)(actual time=85.395..166.683 rows=488281 loops=1)

  Recheck Cond: (lpath <@ $0)
  Buffers: shared hit=22448
  InitPlan 1 (returns $0)

-> Index Scan using test_pkey on test test_1 (cost=0.43..8.45rows=1 width=79) (actual time=0.024..0.025 rows=1 loops=1)

          Index Cond: (id = 1)
          Buffers: shared hit=4

-> Bitmap Index Scan on test_idx1 (cost=0.00..254.75 rows=2445width=0) (actual time=83.029..83.029 rows=488281 loops=1)

        Index Cond: (lpath <@ $0)
        Buffers: shared hit=12269
Total runtime: 182.239 ms

/*3/ Getting chosen nodes (eo) with chosen ancestors (ea) - index[]performs very poorly, it's qplan uses additional Bitmap Heap Scan, stillindices used in both cases.*/


A:
select *
from test eo
where id in (select generate_series(3, 3000000, 5000)) and
    exists (
        select 1
        from test ea
        where ea.id in(select generate_series(1000, 3000, 3)) and
            ea.apath <@ eo.apath
    )

Nested Loop Semi Join (cost=140.10..1302851104.53 rows=6103 width=60)(actual time=1768.862..210525.597 rows=104 loops=1)

  Buffers: shared hit=8420909

-> Nested Loop (cost=17.94..1652.31 rows=1220554 width=60) (actualtime=0.382..17.255 rows=489 loops=1)

        Buffers: shared hit=2292

-> HashAggregate (cost=17.51..19.51 rows=200 width=4) (actualtime=0.352..1.486 rows=600 loops=1)-> Result (cost=0.00..5.01 rows=1000 width=0) (actualtime=0.009..0.100 rows=600 loops=1)-> Index Scan using test_pkey on test eo (cost=0.43..8.15rows=1 width=60) (actual time=0.017..0.021 rows=1 loops=600)

              Index Cond: (id = (generate_series(3, 3000000, 5000)))
              Buffers: shared hit=2292

-> Hash Semi Join (cost=122.16..1133.92 rows=6103 width=56) (actualtime=430.482..430.482 rows=0 loops=489)

        Hash Cond: (ea.id = (generate_series(1000, 3000, 3)))
        Buffers: shared hit=8418617

-> Bitmap Heap Scan on test ea (cost=94.65..251.23 rows=12206width=60) (actual time=271.034..430.278 rows=8 loops=489)

              Recheck Cond: (apath <@ eo.apath)
              Rows Removed by Index Recheck: 444335
              Buffers: shared hit=8418617

-> Bitmap Index Scan on test_idx1 (cost=0.00..91.60rows=12206 width=0) (actual time=155.355..155.355 rows=488281 loops=489)

                    Index Cond: (apath <@ eo.apath)
                    Buffers: shared hit=237668

-> Hash (cost=15.01..15.01 rows=1000 width=4) (actualtime=0.305..0.305 rows=667 loops=1)

              Buckets: 1024  Batches: 1  Memory Usage: 24kB

-> Result (cost=0.00..5.01 rows=1000 width=0) (actualtime=0.003..0.116 rows=667 loops=1)

Total runtime: 210526.004 ms

B:
select *
from test eo
where id in (select generate_series(3, 3000000, 5000)) and
    exists (
        select 1
        from test ea
        where ea.id in(select generate_series(1000, 3000, 3)) and
            ea.lpath @> eo.lpath
    )

Nested Loop Semi Join (cost=45.86..276756955.40 rows=1223 width=83)(actual time=2.985..226.161 rows=104 loops=1)

  Buffers: shared hit=27675

-> Nested Loop (cost=17.94..1687.51 rows=1222486 width=83) (actualtime=0.660..5.987 rows=489 loops=1)

        Buffers: shared hit=2297

-> HashAggregate (cost=17.51..19.51 rows=200 width=4) (actualtime=0.632..1.008 rows=600 loops=1)-> Result (cost=0.00..5.01 rows=1000 width=0) (actualtime=0.007..0.073 rows=600 loops=1)-> Index Scan using test_pkey on test eo (cost=0.43..8.33rows=1 width=83) (actual time=0.007..0.007 rows=1 loops=600)

              Index Cond: (id = (generate_series(3, 3000000, 5000)))
              Buffers: shared hit=2297

-> Hash Semi Join (cost=27.92..242.30 rows=1223 width=79) (actualtime=0.449..0.449 rows=0 loops=489)

        Hash Cond: (ea.id = (generate_series(1000, 3000, 3)))
        Buffers: shared hit=25378

-> Index Scan using test_idx1 on test ea (cost=0.41..43.43rows=2445 width=83) (actual time=0.060..0.445 rows=8 loops=489)

              Index Cond: (lpath @> eo.lpath)
              Buffers: shared hit=25378

-> Hash (cost=15.01..15.01 rows=1000 width=4) (actualtime=0.178..0.178 rows=667 loops=1)

              Buckets: 1024  Batches: 1  Memory Usage: 24kB

-> Result (cost=0.00..5.01 rows=1000 width=0) (actualtime=0.003..0.071 rows=667 loops=1)

Total runtime: 226.308 ms

/*3.a/ If I turn off the index for B the query is much faster:*/

Nested Loop Semi Join (cost=35.88..20535547247.01 rows=6103 width=60)(actual time=17.257..1112.276 rows=104 loops=1)

  Join Filter: (ea.apath <@ eo.apath)
  Rows Removed by Join Filter: 287529
  Buffers: shared hit=1155469

-> Nested Loop (cost=17.94..1652.31 rows=1220554 width=60) (actualtime=0.334..5.307 rows=489 loops=1)

        Buffers: shared hit=2292

-> HashAggregate (cost=17.51..19.51 rows=200 width=4) (actualtime=0.297..0.598 rows=600 loops=1)-> Result (cost=0.00..5.01 rows=1000 width=0) (actualtime=0.008..0.088 rows=600 loops=1)-> Index Scan using test_pkey on test eo (cost=0.43..8.15rows=1 width=60) (actual time=0.007..0.007 rows=1 loops=600)

              Index Cond: (id = (generate_series(3, 3000000, 5000)))
              Buffers: shared hit=2292

-> Nested Loop (cost=17.94..1652.31 rows=1220554 width=56) (actualtime=0.004..1.946 rows=588 loops=489)

        Buffers: shared hit=1153177

-> HashAggregate (cost=17.51..19.51 rows=200 width=4) (actualtime=0.001..0.089 rows=588 loops=489)-> Result (cost=0.00..5.01 rows=1000 width=0) (actualtime=0.005..0.103 rows=667 loops=1)-> Index Scan using test_pkey on test ea (cost=0.43..8.15rows=1 width=60) (actual time=0.002..0.003 rows=1 loops=287633)

              Index Cond: (id = (generate_series(1000, 3000, 3)))
              Buffers: shared hit=1153177
Total runtime: 1112.411 ms

/*3.b/ With the index on, if I go down to effective_cache_size = 256MB,B also skips the index usage, same qplan as in 3.a is used. Still theindex is used for both versions of query 2 and ltree version of query 3.*/



*QUESTIONS:*

1/ Is my hypothesis about similar performance of ltree and integer[]correct?

2/ If so, what should I do to get it?

3/ Is there a way to improve the performance of <@ and @> operators? Infact, as I am having a tree path in the column, I only need to check ifpath_a 'starts_with' path_b to get ancestors/descendants. Thereforesomething more effective than 'contains' might be used. Is there any way?4/ Do I understand properly that index on integer[] is much morememory-consuming, and therefore there are differences in query plans /execution times?



Thanks for any help,

Jan

[PERFORM] Trees: integer[] outperformed by ltree

Reply via email to