Postgres 15 SELECT query doesn't use index under RLS

Alexander Okulovich Fri, 13 Oct 2023 01:42:36 -0700

Hello everyone!

Recently, we upgraded the AWS RDS instance from Postgres 12.14 to 15.4and noticed extremely high disk consumption on the following queryexecution:

select (exists (select 1 as "one" from "public"."indexed_commit" where"public"."indexed_commit"."repo_id" in (964992,964994,964999, ...);

For some reason, the query planner starts using Seq Scan instead of theindex on the "repo_id" column when requesting under user limited withRLS. On prod, it happens when there are more than 316 IDs in the IN partof the query, on stage - 3. If we execute the request from Superuser,the planner always uses the "repo_id" index.

Luckily, we can easily reproduce this on our stage database (which issmaller). If we add a multicolumn "repo_id, tenant_id" index, theplanner uses it (Index Only Scan) with any IN params count under RLS.

Could you please clarify if this is a Postgres bug or not? Should weinclude the "tenant_id" column in all our indexes to make them workunder RLS?



     Postgres version / Operating system+version

PostgreSQL 15.4 on aarch64-unknown-linux-gnu, compiled by gcc (GCC)7.3.1 20180712 (Red Hat 7.3.1-6), 64-bit



     Full Table and Index Schema

\d indexed_commit
                        Table "public.indexed_commit"

Column | Type | Collation | Nullable |Default

---------------+-----------------------------+-----------+----------+---------
 id            | bigint                      |           | not null |
 commit_hash   | character varying(40)       |           | not null |
 parent_hash   | text                        | |          |
 created_ts    | timestamp without time zone |           | not null |
 repo_id       | bigint                      |           | not null |
 lines_added   | bigint                      | |          |
 lines_removed | bigint                      | |          |
 tenant_id     | uuid                        |           | not null |
 author_id     | uuid                        |           | not null |
Indexes:
    "indexed-commit-repo-idx" btree (repo_id)

"indexed_commit_commit_hash_repo_id_key" UNIQUE CONSTRAINT, btree(commit_hash, repo_id) REPLICA IDENTITY "indexed_commit_repo_id_without_loc_idx" btree (repo_id) WHERElines_added IS NULL OR lines_removed IS NULL

Policies:
    POLICY "commit_isolation_policy"

USING ((tenant_id =(current_setting('app.current_tenant_id'::text))::uuid))



     Table Metadata

----------------+----------+--------------+---------------+---------+----------+----------------+---------------------------------------------------------------------------------------------------------------------------------------------+---------------

indexed_commit | 18170522 | 7.451964e+08 | 18104744 | r | 9 | f |{autovacuum_vacuum_scale_factor=0,autovacuum_analyze_scale_factor=0,autovacuum_vacuum_threshold=200000,autovacuum_analyze_threshold=100000}| 148903337984



     EXPLAIN (ANALYZE, BUFFERS), not just EXPLAIN

Production queries:

316 ids under RLS limited user
<https://explain.depesz.com/s/X7Iq>

392 ids under RLS limited user <https://explain.depesz.com/s/lbkX>

392 ids under Superuser <https://explain.depesz.com/s/uKSG>


     History

It became slow after the upgrade to 15.4. We never had any issues before.


     Hardware

AWS DB class db.t4g.large + GP3 400GB disk


     Maintenance Setup

Are you running autovacuum? Yes

If so, with what settings?

autovacuum_vacuum_scale_factor=0,autovacuum_analyze_scale_factor=0,autovacuum_vacuum_threshold=200000,autovacuum_analyze_threshold=100000

SELECT * FROM pg_stat_user_tables WHERE relname='indexed_commit';

-------+------------+----------------+----------+--------------+-----------+---------------+-----------+-----------+-----------+---------------+------------+------------+---------------------+--------------------+-------------+-------------------------------+--------------+-------------------------------+--------------+------------------+---------------+-------------------

24662 | public | indexed_commit | 2485 | 49215378424 |374533865 | 4050928807 | 764089750 | 2191615 | 18500311| 0 | 745241398 | 383 | 46018| 45343 | | 2023-10-11 23:51:29.170378+00| | 2023-10-11 23:50:18.922351+00 | 0| 672 | 0 | 753



     WAL Configuration

For data writing queries: have you moved the WAL to a different disk?Changed the settings? No.



     GUC Settings

What database configuration settings have you changed? We use defaultsettings.


What are their values?

----------------------+---------+------+---------------------------------------+------------------------------------------------------------------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------+---------+--------------------+---------+------------+----------+----------+-----------+------------+------------+-----------------

effective_cache_size | 494234 | 8kB | Query Tuning / Planner CostConstants | Sets the planner's assumption about the total size of thedata caches. | That is, the total size of the caches (kernel cache andshared buffers) used for PostgreSQL data files. This is measured in diskpages, which are normally 8 kB each. | user | integer |configuration file | 1 | 2147483647 | | 524288 |494234 | | | f shared_buffers | 247117 | 8kB | Resource Usage /Memory | Sets the number of shared memory buffers used bythe server. | | postmaster | integer | configuration file | 16 |1073741823 | | 16384 | 247117 | | | f work_mem | 4096 | kB | Resource Usage /Memory | Sets the maximum memory to be used for queryworkspaces. | This much memory can be used by eachinternal sort operation and hash table before switching to temporarydisk files. | user | integer | default | 64 | 2147483647 | |4096 | 4096 | | | f



     Statistics: n_distinct, MCV, histogram

Useful to check statistics leading to bad join plan. SELECT (SELECTsum(x) FROM unnest(most_common_freqs) x) frac_MCV, tablename, attname,inherited, null_frac, n_distinct, array_length(most_common_vals,1)n_mcv, array_length(histogram_bounds,1) n_hist, correlation FROMpg_stats WHERE attname='...' AND tablename='...' ORDER BY 1 DESC;


Returns 0 rows.


Kind regards,

Alexander

Postgres 15 SELECT query doesn't use index under RLS

Reply via email to