Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-25 Thread Mike Broers
Very helpful thank you for the additional insight - I'd never checked into
pg_stats and that does reveal a difference in the distribution of the
validation_status_code between qa and production:

prod:
│ most_common_vals   │ {P,F}  │
│ most_common_freqs  │ {0.925967,0.00093} │
│ histogram_bounds   │ ❏  │
│ correlation│ 0.995533   │

qa:
│ most_common_vals │ {P} │
│ most_common_freqs │ {0.861633} │
│ histogram_bounds │ ❏ │
│ correlation │ 0.61 │

so the way I am reading this is that there is likely no sensible way to
avoid postgres thinking it will just have to scan the whole table because
of these statistics.  I can force it by setting session parameters for this
particular query but I probably shouldnt be looking at system settings to
brutally force random fetches.

thanks again for the assistance!



On Wed, Sep 20, 2017 at 6:05 PM, David Rowley 
wrote:

> On 21 September 2017 at 04:15, Mike Broers  wrote:
> > Ultimately I think this is just highlighting the need in my environment
> to
> > set random_page_cost lower (we are on an SSD SAN anyway..), but I dont
> think
> > I have a satisfactory reason by the row estimates are so bad in the QA
> > planner and why it doesnt use that partition index there.
>
> Without the index there are no stats to allow the planner to perform a
> good estimate on "e.body->>'SID' is not null", so it applies a default
> of 99.5%. So, as a simple example, if you have a partition with 1
> million rows. If you apply 99.5% to that you get 995000 rows. Now if
> you add the selectivity for "e.validation_status_code = 'P' ", let's
> say that's 50%, the row estimate for the entire WHERE clause would be
> 497500 (100 * 0.995 * 0.5). Since the 99.5% is applied in both
> cases, then the only variable part is validation_status_code. Perhaps
> validation_status_code  = 'P' is much more common in QA than in
> production.
>
> You can look at the stats as gathered by ANALYZE with:
>
> \x on
> select * from pg_stats where tablename = 'event__' and attname
> = 'validation_status_code';
> \x off
>
> --
>  David Rowley   http://www.2ndQuadrant.com/
>  PostgreSQL Development, 24x7 Support, Training & Services
>


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-20 Thread David Rowley
On 21 September 2017 at 04:15, Mike Broers  wrote:
> Ultimately I think this is just highlighting the need in my environment to
> set random_page_cost lower (we are on an SSD SAN anyway..), but I dont think
> I have a satisfactory reason by the row estimates are so bad in the QA
> planner and why it doesnt use that partition index there.

Without the index there are no stats to allow the planner to perform a
good estimate on "e.body->>'SID' is not null", so it applies a default
of 99.5%. So, as a simple example, if you have a partition with 1
million rows. If you apply 99.5% to that you get 995000 rows. Now if
you add the selectivity for "e.validation_status_code = 'P' ", let's
say that's 50%, the row estimate for the entire WHERE clause would be
497500 (100 * 0.995 * 0.5). Since the 99.5% is applied in both
cases, then the only variable part is validation_status_code. Perhaps
validation_status_code  = 'P' is much more common in QA than in
production.

You can look at the stats as gathered by ANALYZE with:

\x on
select * from pg_stats where tablename = 'event__' and attname
= 'validation_status_code';
\x off

-- 
 David Rowley   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training & Services


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-20 Thread Mike Broers
I was able to add the suggested indexes
like stage.event__00075000((body->>'SID'::text)); and indeed these helped
the QA environment use those indexes instead of sequence scanning.

I'm still perplexed by my original question, why production uses the
partition index and qa does not?

Index Scan using ix_event__00014695_landing_id on event__00014695 e_3
(cost=0.56..39137.89
rows=37697 width=564)   │

│ Index Cond: (landing_id =
t_sap.landing_id)


Ultimately I think this is just highlighting the need in my environment to
set random_page_cost lower (we are on an SSD SAN anyway..), but I dont
think I have a satisfactory reason by the row estimates are so bad in the
QA planner and why it doesnt use that partition index there.




On Fri, Sep 15, 2017 at 3:59 PM, Mike Broers  wrote:

> That makes a lot of sense, thanks for taking a look.  An index like you
> suggest would probably further improve the query.   Is that suggestion
> sidestepping the original problem that production is evaluating the
> landing_id bit with the partition index and qa is sequence scanning instead?
>
> AND exists (select 1 from t_sap where e.landing_id = t_sap.landing_id)) as
> rankings;
>
> Based on the difference in row estimate I am attempting an analyze with a
> higher default_statistic_target (currently 100) to see if that helps.
>
>
>
>
> On Fri, Sep 15, 2017 at 3:42 PM, Tom Lane  wrote:
>
>> Mike Broers  writes:
>> > If Im reading this correctly postgres thinks the partition will return
>> 6.5
>> > million matching rows but actually comes back with 162k.  Is this a case
>> > where something is wrong with the analyze job?
>>
>> You've got a lot of scans there that're using conditions like
>>
>> > │   ->  Seq Scan on event__ e_1
>> (cost=0.00..2527828.05 rows=11383021 width=778) (actual
>> time=25522.389..747238.885 rows=42 loops=1)
>> > │ Filter: (((body ->> 'SID'::text) IS
>> NOT NULL) AND (validation_status_code = 'P'::bpchar))
>> > │ Rows Removed by Filter: 12172186
>>
>> While I'd expect the planner to be pretty solid on estimating the
>> validation_status_code condition, it's not going to have any idea about
>> that JSON field test.  That's apparently very selective, but you're just
>> getting a default estimate, which is not going to think that a NOT NULL
>> test will exclude lots of rows.
>>
>> One thing you could consider doing about this is creating an index
>> on (body ->> 'SID'::text), which would prompt ANALYZE to gather statistics
>> about that expression.  Even if the index weren't actually used in the
>> plan, this might improve the estimates and the resulting planning choices
>> enough to make it worth maintaining such an index.
>>
>> Or you could think about pulling that field out and storing it on its own.
>> JSON columns are great for storing random unstructured data, but they are
>> less great when you want to do relational-ish things on subfields.
>>
>> regards, tom lane
>>
>
>


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-15 Thread Tomas Vondra
On 09/16/2017 12:05 AM, Tom Lane wrote:
> Andres Freund  writes:
>> I'm wondering if we should extend the new CREATE STATISTICS
>> framework to be able to do that without requiring an index.
> 
> I think that's already on the roadmap --- it's one of the reasons we 
> ended up with a SELECT-like syntax for CREATE STATISTICS. But it 
> didn't get done for v10.
> 

Right. It's one of the things I'd like to be working on after getting in
the more complex statistics types (MCV & histograms).

> If we do look at that as a substitute for "make an expression index
> just so you get some stats", it would be good to have a way to
> specify that you only want the standard ANALYZE stats on that value
> and not the extended ones.
> 

Not sure I understand what you mean by "extended" - the statistics we
collect for expression indexes, or the CREATE STATISTICS stuff? I assume
the former, because if you don't want the latter then just don't create
the statistics. Or am I missing something?

regards

-- 
Tomas Vondra  http://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-15 Thread Tom Lane
Andres Freund  writes:
> I'm wondering if we should extend the new CREATE STATISTICS framework to
> be able to do that without requiring an index.

I think that's already on the roadmap --- it's one of the reasons we
ended up with a SELECT-like syntax for CREATE STATISTICS.  But it
didn't get done for v10.

If we do look at that as a substitute for "make an expression index just
so you get some stats", it would be good to have a way to specify that you
only want the standard ANALYZE stats on that value and not the extended
ones.

regards, tom lane


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-15 Thread Andres Freund


On September 15, 2017 1:42:23 PM PDT, Tom Lane  wrote:
>One thing you could consider doing about this is creating an index
>on (body ->> 'SID'::text), which would prompt ANALYZE to gather
>statistics
>about that expression.  Even if the index weren't actually used in the
>plan, this might improve the estimates and the resulting planning
>choices
>enough to make it worth maintaining such an index.

I'm wondering if we should extend the new CREATE STATISTICS framework to be 
able to do that without requiring an index. I.e. allow expressions and add a 
new type of stats that just correspond to what normal columns have.  Could even 
create that implicitly for expression indexes, but allow to drop it, if the 
overtrading isn't worth it.

Andres
-- 
Sent from my Android device with K-9 Mail. Please excuse my brevity.


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-15 Thread Mike Broers
That makes a lot of sense, thanks for taking a look.  An index like you
suggest would probably further improve the query.   Is that suggestion
sidestepping the original problem that production is evaluating the
landing_id bit with the partition index and qa is sequence scanning instead?

AND exists (select 1 from t_sap where e.landing_id = t_sap.landing_id)) as
rankings;

Based on the difference in row estimate I am attempting an analyze with a
higher default_statistic_target (currently 100) to see if that helps.




On Fri, Sep 15, 2017 at 3:42 PM, Tom Lane  wrote:

> Mike Broers  writes:
> > If Im reading this correctly postgres thinks the partition will return
> 6.5
> > million matching rows but actually comes back with 162k.  Is this a case
> > where something is wrong with the analyze job?
>
> You've got a lot of scans there that're using conditions like
>
> > │   ->  Seq Scan on event__ e_1
> (cost=0.00..2527828.05 rows=11383021 width=778) (actual
> time=25522.389..747238.885 rows=42 loops=1)
> > │ Filter: (((body ->> 'SID'::text) IS
> NOT NULL) AND (validation_status_code = 'P'::bpchar))
> > │ Rows Removed by Filter: 12172186
>
> While I'd expect the planner to be pretty solid on estimating the
> validation_status_code condition, it's not going to have any idea about
> that JSON field test.  That's apparently very selective, but you're just
> getting a default estimate, which is not going to think that a NOT NULL
> test will exclude lots of rows.
>
> One thing you could consider doing about this is creating an index
> on (body ->> 'SID'::text), which would prompt ANALYZE to gather statistics
> about that expression.  Even if the index weren't actually used in the
> plan, this might improve the estimates and the resulting planning choices
> enough to make it worth maintaining such an index.
>
> Or you could think about pulling that field out and storing it on its own.
> JSON columns are great for storing random unstructured data, but they are
> less great when you want to do relational-ish things on subfields.
>
> regards, tom lane
>


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-15 Thread Tom Lane
Mike Broers  writes:
> If Im reading this correctly postgres thinks the partition will return 6.5
> million matching rows but actually comes back with 162k.  Is this a case
> where something is wrong with the analyze job?

You've got a lot of scans there that're using conditions like

> │   ->  Seq Scan on event__ e_1 
> (cost=0.00..2527828.05 rows=11383021 width=778) (actual 
> time=25522.389..747238.885 rows=42 loops=1)
> │ Filter: (((body ->> 'SID'::text) IS NOT 
> NULL) AND (validation_status_code = 'P'::bpchar))
> │ Rows Removed by Filter: 12172186

While I'd expect the planner to be pretty solid on estimating the
validation_status_code condition, it's not going to have any idea about
that JSON field test.  That's apparently very selective, but you're just
getting a default estimate, which is not going to think that a NOT NULL
test will exclude lots of rows.

One thing you could consider doing about this is creating an index
on (body ->> 'SID'::text), which would prompt ANALYZE to gather statistics
about that expression.  Even if the index weren't actually used in the
plan, this might improve the estimates and the resulting planning choices
enough to make it worth maintaining such an index.

Or you could think about pulling that field out and storing it on its own.
JSON columns are great for storing random unstructured data, but they are
less great when you want to do relational-ish things on subfields.

regards, tom lane


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-15 Thread Mike Broers
Query finally came back with an explain analyze :)

If Im reading this correctly postgres thinks the partition will return 6.5
million matching rows but actually comes back with 162k.  Is this a case
where something is wrong with the analyze job?

Seq Scan on event__00071000 e_4  (cost=0.00..2204374.94 rows=6523419
width=785) (actual time=7020.509..448368.247 rows=162912 loops=1)




┌───┐

│
QUERY PLAN
  │

├───┤

│ Subquery Scan on rankings  (cost=45357272.27..47351629.37 rows=39887142
width=24) (actual time=6117566.189..6117619.805 rows=25190 loops=1)
  │

│   ->  WindowAgg  (cost=45357272.27..46952757.95 rows=39887142 width=772)
(actual time=6117566.101..6117611.266 rows=25190 loops=1)
  │

│ ->  Sort  (cost=45357272.27..45456990.12 rows=39887142 width=772)
(actual time=6117566.054..6117572.121 rows=25190 loops=1)
│

│   Sort Key: ((e.body ->> 'SID'::text)), ((e.body ->>
'Timestamp'::text)) DESC
  │

│   Sort Method: quicksort  Memory: 13757kB

│

│   ->  Hash Join  (cost=46.38..24740720.18 rows=39887142
width=772) (actual time=1511499.761..6117335.382 rows=25190 loops=1)
  │

│ Hash Cond: (e.landing_id = t_sap.landing_id)

│

│ ->  Append  (cost=0.00..24387085.38 rows=79774283
width=776) (actual time=25522.442..6116672.504 rows=2481659 loops=1)
│

│   ->  Seq Scan on event e  (cost=0.00..1.36
rows=1 width=97) (actual time=0.049..0.049 rows=0 loops=1)
  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by Filter: 24

│

│   ->  Seq Scan on event__ e_1
(cost=0.00..2527828.05
rows=11383021 width=778) (actual time=25522.389..747238.885 rows=42 loops=1)
  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by Filter: 12172186

│

│   ->  Seq Scan on event__00069000 e_2
(cost=0.00..1462613.93
rows=5957018 width=771) (actual time=4486.295..370098.760 rows=183696
loops=1)  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by Filter: 6956029

│

│   ->  Seq Scan on event__0007 e_3
(cost=0.00..1534702.41
rows=5991507 width=787) (actual time=3415.907..361606.800 rows=199081
loops=1)  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by Filter: 7177444

│

│   ->  Seq Scan on event__00071000 e_4
(cost=0.00..2204374.94
rows=6523419 width=785) (actual time=7020.509..448368.247 rows=162912
loops=1)  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by Filter: 8091470

│

│   ->  Seq Scan on event__00072000 e_5
(cost=0.00..1531430.89
rows=5814704 width=792) (actual time=25.304..343612.826 rows=214891 loops=1)
  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by Filter: 7301151

│

│   ->  Seq Scan on event__00073000 e_6
(cost=0.00..1384865.48
rows=5876959 width=767) (actual time=1631.133..424827.603 rows=163959
loops=1)  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by Filter: 6523673

│

│   ->  Seq Scan on event__00074000 e_7
(cost=0.00..1289048.37
rows=4747343 width=801) (actual time=3287.286..280317.057 rows=204394
loops=1)  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar))
  │

│ Rows Removed by 

Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-14 Thread Mike Broers
Thanks for the suggestions, I'll futz with random_page_cost  and
effective_cache_size a bit and follow up, as well as try to provide an
explain analyze on both (if the longer query ever returns!)

Most appreciated.

On Wed, Sep 13, 2017 at 4:57 PM, David Rowley 
wrote:

> On 14 September 2017 at 08:28, Mike Broers  wrote:
> > I have a query of a partitioned table that uses the partition index in
> > production but uses sequence scans in qa.  The only major difference I
> can
> > tell is the partitions are much smaller in qa.  In production the
> partitions
> > range in size from around 25 million rows to around 60 million rows, in
> QA
> > the partitions are between 4 and 12 million rows.  I would think this
> would
> > be big enough to get the planner to prefer the index but this is the
> major
> > difference between the two database as far as I can tell.
>
>
> QA:
>
> > │   ->  Seq Scan on event__ e_1
> > (cost=0.00..2527918.06 rows=11457484 width=782)│
> >
>
> Production:
> >
> > │   ->  Index Scan using
> > ix_event__00011162_landing_id on event__00011162 e_1
> (cost=0.56..15476.59
> > rows=23400 width=572)   │
>
>
> If QA has between 4 and 12 million rows, then the planner's row
> estimate for the condition thinks 11457484 are going to match, so a
> Seqscan is likely best here. If those estimates are off then it might
> be worth double checking your nightly analyze is working correctly on
> QA.
>
> The planner may be able to be coaxed into using the index with a
> higher effective_cache_size and/or a lower random_page_cost setting,
> although you really should be looking at those row estimates first.
> Showing us the EXPLAIN ANALYZE would have been much more useful so
> that we could have seen if those are accurate or not.
>
> --
>  David Rowley   http://www.2ndQuadrant.com/
>  PostgreSQL Development, 24x7 Support, Training & Services
>


Re: [PERFORM] query of partitioned object doesnt use index in qa

2017-09-13 Thread David Rowley
On 14 September 2017 at 08:28, Mike Broers  wrote:
> I have a query of a partitioned table that uses the partition index in
> production but uses sequence scans in qa.  The only major difference I can
> tell is the partitions are much smaller in qa.  In production the partitions
> range in size from around 25 million rows to around 60 million rows, in QA
> the partitions are between 4 and 12 million rows.  I would think this would
> be big enough to get the planner to prefer the index but this is the major
> difference between the two database as far as I can tell.


QA:

> │   ->  Seq Scan on event__ e_1
> (cost=0.00..2527918.06 rows=11457484 width=782)│
>

Production:
>
> │   ->  Index Scan using
> ix_event__00011162_landing_id on event__00011162 e_1  (cost=0.56..15476.59
> rows=23400 width=572)   │


If QA has between 4 and 12 million rows, then the planner's row
estimate for the condition thinks 11457484 are going to match, so a
Seqscan is likely best here. If those estimates are off then it might
be worth double checking your nightly analyze is working correctly on
QA.

The planner may be able to be coaxed into using the index with a
higher effective_cache_size and/or a lower random_page_cost setting,
although you really should be looking at those row estimates first.
Showing us the EXPLAIN ANALYZE would have been much more useful so
that we could have seen if those are accurate or not.

-- 
 David Rowley   http://www.2ndQuadrant.com/
 PostgreSQL Development, 24x7 Support, Training & Services


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


[PERFORM] query of partitioned object doesnt use index in qa

2017-09-13 Thread Mike Broers
Postgres 9.5

I have a query of a partitioned table that uses the partition index in
production but uses sequence scans in qa.  The only major difference I can
tell is the partitions are much smaller in qa.  In production the
partitions range in size from around 25 million rows to around 60 million
rows, in QA the partitions are between 4 and 12 million rows.  I would
think this would be big enough to get the planner to prefer the index but
this is the major difference between the two database as far as I can tell.

When I run the query in qa with enable seqscan=false I get the much faster
plan.  Both systems are manually vacuumed and analyzed each night.  Both
systems have identical settings for memory and are allocated the same for
system resources. Neither system is showing substantial index or table
bloat above .1-1% for any of the key indexes in question.



Here is the query with the seq scan plan in qa:

 explain select rankings from (select

e.body->>'SID' as temp_SID,

CASE WHEN e.source_id = 168 THEN e.body->>'Main Menu' ELSE e.body->>'Prompt
Selection 1' END as temp_ivr_selection_prompt1,

e.body->>'Existing Customer' as temp_ivr_selection_prompt2,

e.body->>'Business Services' as temp_ivr_selection_prompt3,

e.body->>'Prompt for ZIP' as temp_ivr_selection_zip,

rank() over (Partition by e.body->>'SID' order by e.body->>'Timestamp'
desc) as rank1

from stage.event e

where e.validation_status_code = 'P'

AND e.body->>'SID' is not null --So that matches are not made on NULL values


AND exists (select 1 from t_sap where e.landing_id = t_sap.landing_id)) as
rankings;

┌───┐

│QUERY PLAN
│

├───┤

│ Subquery Scan on rankings  (cost=42370434.66..44254952.76 rows=37690362
width=24) │

│   ->  WindowAgg  (cost=42370434.66..43878049.14 rows=37690362 width=769)
│

│ ->  Sort  (cost=42370434.66..42464660.56 rows=37690362 width=769)
  │

│   Sort Key: ((e.body ->> 'SID'::text)), ((e.body ->>
'Timestamp'::text)) DESC │

│   ->  Hash Join  (cost=46.38..22904737.49 rows=37690362
width=769)│

│ Hash Cond: (e.landing_id = t_sap.landing_id)
│

│ ->  Append  (cost=0.00..22568797.21 rows=75380725
width=773)  │

│   ->  Seq Scan on event e  (cost=0.00..1.36
rows=1 width=97)  │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__ e_1
(cost=0.00..2527918.06
rows=11457484 width=782)│

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__00069000 e_2
(cost=0.00..1462329.01
rows=5922843 width=772) │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__0007 e_3
(cost=0.00..1534324.60
rows=6003826 width=785) │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__00071000 e_4
(cost=0.00..2203954.48
rows=6508965 width=780) │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__00072000 e_5
(cost=0.00..1530805.89
rows=5759797 width=792) │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__00073000 e_6
(cost=0.00..1384818.75
rows=569 width=759) │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__00074000 e_7
(cost=0.00..1288777.54
rows=4734867 width=806) │

│ Filter: (((body ->> 'SID'::text) IS NOT
NULL) AND (validation_status_code = 'P'::bpchar)) │

│   ->  Seq Scan on event__00075000 e_8
(cost=0.00..1231949.17
rows=3934318 width=788)