Re: Discussion of incremental checkpointing

Dibyendu Majumdar Tue, 14 Feb 2006 16:41:30 -0800

Hi,

Mohan's paper on ARIES describes a way to perform lightweightcheckpoints. The basic idea is:

+ Checkpoints don't cause data pages to be flushed. Instead, a list ofdirty pages is recorded in the checkpoint. Along with the id of eachdirty page, its Recovery LSN is stored, which is the LSN of the oldestlog record that may have made changes to the page.

+ Dirty pages are flushed separately using a Buffer Writer.

Derby at present implements the fuzzy checkpoint algorithm described insection 11.3 of TPCT book.

For anyone interested in a pure ARIES implementation - please have alook at my project - www.simpledbm.org.

I guess the main difference between ARIES and Derby's method is that inARIES, checkpoints happen independently of Buffer Writes. However, inthe end the amount of work done is the same, as the data pages must bewritten out anyway.

I think I agree with Mike's comment that we need to be clear about whatit is that we are trying to solve. Are you trying to:

1. Make checkpoints lightweight like ARIES? If so, this would requiremajor changes to recovery logic.2. Optimize the IO during checkpoints? I guess if this is the goal, thenone should concentrate on improving the current IO logic - potentiallyby ensuring that:a) IO is non-disruptive - ie - pages are not locked out while they arebeing written.

b) IO is optimized - batch write consecutive pages, etc.
c) Have multiple concurrent threads handle IO.

However these changes are useful in large scale environments and willactually have a negative impact on a small scale system. No point inincreasing IO if there is a single disk, for example.3. Slow down the IO during checkpoints? I think this is what Mike wantsto do. As far as I know, increasing the time taken to perform acheckpoint should have no adverse effect. For example, you could reducethe sleep time between checkpoints but slow down the checkpoint itself.This way you could keep the total number of pages written fairlyconstant without causing peeks of IO.


Regards

Dibyendu

Re: Discussion of incremental checkpointing

Reply via email to