Re: n_distinct off by a factor of 1000

Adrian Klaver Tue, 23 Jun 2020 07:15:04 -0700

On 6/23/20 7:05 AM, Fabio Pardi wrote:

On 23/06/2020 14:42, Klaudie Willis wrote:
I got my first hint of why this problem occurs when I looked at thestatistics. For the column in question, "instrument_ref" thestatistics claimed it to be:
The default_statistics_target=500, and analyze has been run.
select * from pg_stats where attname like 'instr%_ref'; -- Result:*40.000*select count(distinct instrumentid_ref) from bigtable -- Result: *33385 922 (!!)*
That is an astonishing difference of almost a 1000X.
I think you are counting 2 different things here.
The first query returns all the columns "like 'instr%_ref'" present inthe statistics (so in the whole cluster), while the second is countingthe actual number of different rows in bigtable.


I believe the OP actually meant the query to be:

select n_distinct from pg_stats where attname like 'instr%_ref';



regards,

fabio pardi



--
Adrian Klaver
[email protected]

Re: n_distinct off by a factor of 1000

Reply via email to