Re: Discussion of incremental checkpointing

Øystein Grøvlen Fri, 17 Feb 2006 13:53:15 -0800

Dibyendu Majumdar wrote:

I guess the main difference between ARIES and Derby's method is that inARIES, checkpoints happen independently of Buffer Writes. However, inthe end the amount of work done is the same, as the data pages must bewritten out anyway.

I do not think it is necessarily true that amount of work will be thesame. Different ways of doing checkpointing may cause a page to bewritten for every update or for every tenth update.

1. Make checkpoints lightweight like ARIES? If so, this would requiremajor changes to recovery logic.

I think the main change would be that it need to be possible to startthe redo activity from any position in the log, not only from theposition of a checkpoint log record.

2. Optimize the IO during checkpoints? I guess if this is the goal, thenone should concentrate on improving the current IO logic - potentiallyby ensuring that:a) IO is non-disruptive - ie - pages are not locked out while they arebeing written.
b) IO is optimized - batch write consecutive pages, etc.
c) Have multiple concurrent threads handle IO.
However these changes are useful in large scale environments and willactually have a negative impact on a small scale system. No point inincreasing IO if there is a single disk, for example.3. Slow down the IO during checkpoints? I think this is what Mike wantsto do. As far as I know, increasing the time taken to perform acheckpoint should have no adverse effect. For example, you could reducethe sleep time between checkpoints but slow down the checkpoint itself.This way you could keep the total number of pages written fairlyconstant without causing peeks of IO.

I think the goal should be, as you state in 3, to spread the disk I/Oevenly over time and avoid bursts of IO when checkpointing.


--
Øystein

Re: Discussion of incremental checkpointing

Reply via email to