Re: [PERFORM] Very long deletion time on a 200 GB database

Reuven M. Lerner Thu, 23 Feb 2012 07:26:31 -0800

Hi, everyone. Thanks for all of the help and suggestions so far; I'lltry to respond to some of them soon. Andrew wrote:

How about:


DELETE FROM B
WHERE r_id IN (SELECT distinct R.id
FROM R WHERE r.end_date< (NOW() - (interval '1 day' * 30))

?


Or possibly without the DISTINCT. But I agree that the original query
shouldn't have B in the subquery - that alone could well make it crawl.

I put B in the subquery so as to reduce the number of rows that would bereturned, but maybe that was indeed backfiring on me. Now that I thinkabout it, B is a huge table, and R is a less-huge one, so including B inthe subselect was probably a mistake.


What is the distribution of end_dates? It might be worth running this in
several steps, deleting records older than, say, 90 days, 60 days, 30 days.

I've suggested something similar, but was told that we have limited timeto execute the DELETE, and that doing it in stages might not be possible.


Reuven


--
Reuven M. Lerner -- Web development, consulting, and training
Mobile: +972-54-496-8405 * US phone: 847-230-9795
Skype/AIM: reuvenlerner

--
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] Very long deletion time on a 200 GB database

Reply via email to