Re: [HACKERS] Read-ahead and parallelism in redo recovery

Heikki Linnakangas Fri, 29 Feb 2008 12:46:42 -0800

Decibel! wrote:

On Feb 29, 2008, at 8:10 AM, Florian Weimer wrote:
In the end, I wouldn't be surprised if for most loads, cache warming
effects dominated recovery times, at least when the machine is not
starved on RAM.
Uh... that's exactly what all the synchronous reads are doing... warmingthe cache. And synchronous reads are only fast if the system understandswhat's going on and reads a good chunk of data in at once. I don't knowthat that happens.
Perhaps a good short-term measure would be to have recovery allocate a16M buffer and read in entire xlog files at once.


The problem isn't reading the WAL. The OS prefetches that just fine.

The problem is the random reads, when we read in the blocks mentioned inthe WAL records, to replay the changes to them. The OS has no way ofguessing and prefetching those blocks, and we read them synchronously,one block at a time, no matter how big your RAID array is.

I used to think it's a big problem, but I believe the full-page-writeoptimization in 8.3 made it much less so. Especially with the smoothedcheckpoints: as checkpoints have less impact on response times, you canshorten checkpoint interval, which helps to keep the recovery timereasonable.

It'd still be nice to do the prefetching; I'm sure there's stillworkloads where it would be a big benefit. But as Tom pointed out, weshouldn't invent something new just for recovery. I think we should lookat doing prefetching for index accesses etc. first, and once we have theinfrastructure in place and tested, we can consider use it for recoveryas well.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
      subscribe-nomail command to [EMAIL PROTECTED] so that your
      message can get through to the mailing list cleanly

Re: [HACKERS] Read-ahead and parallelism in redo recovery

Reply via email to