Re: [HACKERS] drop/truncate table sucks for large values of shared buffers

Simon Riggs Wed, 01 Jul 2015 07:58:24 -0700

On 1 July 2015 at 15:39, Amit Kapila <[email protected]> wrote:



> Okay. I think we can maintain the list in similar way as we do for
> UNLINK_RELATION_REQUEST in RememberFsyncRequest(), but
> why to wait till 64 tables?
>

I meant once per checkpoint cycle OR every N tables, whichever is sooner. N
would need to vary according to size of Nbuffers.


> We already scan whole buffer list in each
> checkpoint cycle, so during that scan we can refer this dropped relation
> list and avoid syncing such buffer contents.  Also for ENOENT error
> handling for FileWrite, we can use this list to refer relations for which
> we
> need to ignore the error.  I think we are already doing something similar
> in
> mdsync to avoid the problem of Dropped tables, so it seems okay to
> have it in mdwrite as well.
>
> The crucial thing in this idea to think about is avoiding reassignment of
> relfilenode (due to wrapped OID's) before we have ensured that none of
> the buffers contains tag for that relfilenode.  Currently we avoid this for
> Fsync case by retaining the first segment of relation (which will avoid
> reassignment of relfilenode) till checkpoint ends, I think if we just
> postpone
> it till we have validated it in shared_buffers, then we can avoid this
> problem
> in new scheme and it should be delay of maximum one checkpoint cycle
> for unlinking such file assuming we refer dropped relation list in each
> checkpoint
> cycle during buffer scan.
>

Yes

So you are keeping more data around for longer, right? I think we would
need some way to trigger a scan when the amount of deferred dropped data
files hits a certain size.

-- 
Simon Riggs                http://www.2ndQuadrant.com/
<http://www.2ndquadrant.com/>
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: [HACKERS] drop/truncate table sucks for large values of shared buffers

Reply via email to