Re: [jira] Commented: (DERBY-239) Need a online backup feature that does not block update operations when online backup is in progress.

David W. Van Couvering Thu, 06 Oct 2005 06:22:26 -0700

Another candidate for the Wiki site?

David


Suresh Thalamati wrote:

Attached is the functional/design spec based on the input I got fromeveryone on this topic. I think, most of the issues are covered in thisdoc, except the backup of jars stored inside the database.
Thanks
-suresh


------------------------------------------------------------------------

Online Backup Functional/Design Spec
    Overview
Support real-time */online backup/* mechanism that does not blockdatabase operations when backup is in progress. When backup is running,currently Derby only permits reads from the database. By adding supportto permit writes, it will be easier for the users to schedule backupswithout being concerned about the impact of backup operation on theapplications that are accessing the database.
    Introduction
New online backup mechanism will not change the way database can berestored from the backup. In the current system, if there any non-loggedoperations when backup starts, they will be rolled back on restorebecause database is frozen at the start of the backup. To supportreal-time online backups that do not block user operations, it isnecessary to have the transaction log for all the operations that happenafter the backup starts to bring the database to the consistent stateduring restore. Because a transaction that has non-logged operationscan commit when backup is in progress. To make a consistent backup ofthe database, there should not be any pending transactions withnon-logged operations.
    JDBC
Backup operations are support through system procedures. They can becalled using JDBC program or through IJ.
      Existing Backup Procedures:
After this enhancement, existing backup procedures(SYSCS_UTIL.SYSCS_BACKUP_DATABASE ...etc) will allow writes to thedatabase.
Information about existing online backup procedures can be found in theDerby admin guide. Derby Admin Guide<http://db.apache.org/derby/docs/dev/adminguide/> . These backupprocedures will *WAIT *for the transaction with non-logged operations tocommit before proceeding with the backup. __
      New Procedures:
Two new procedures are added to make backup wait/return error, if anynon-logged operations are pending at the start of the backup.
Full backup:

*/SYSCS_UTIL.SYSCS_BACKUP_DATABASE(IN BACKUPDIR VARCHAR(), /*
*/ /_IN WAIT_FOR_NONLOGGED_OPERATIONSSMALLINT_/) /*
Full backup with log archive mode:
*SYSCS_UTIL.SYSCS_BACKUP_DATABASE_AND_ENABLE_LOG_ARCHIVE_MODE
     (IN BACKUPDIR VARCHAR(32672), *

*      IN SMALLINT DELETE_ARCHIVED_LOG_FILES,
      /_IN WAIT_FOR_NONLOGGED_OPERATIONS SMALLINT_/_))_

*
Arguments
*/BACKUPDIR:/**//*
An input argument of type VARCHAR(32672) that specifies the full systempath to the database directory to be backed up.
*/DELETE_ARCHIVED_LOG_FILES:/*
A non-zero input argument of type SMALLINT will make backup proceduresto delete existing online archived log files that were created beforethis backup will be deleted. The log files are deleted only after asuccessful backup.
* *

*/WAIT_FOR_NONLOGGED_OPERATIONS/* :
A non-zero input argument of type SMALLINT will make backup proceduresto wait for the non-logged operation that are currently running tocommit, while an argument of 0 will make backup fail if there are anyuncommitted non-logged operations.
    Design
Current Online backup copies all the database files (log, seg0 ...Etc)to the specified location when the database is frozen (writes are blocked).
In the new scheme, goal is to copy the database information to thebackup location, without blocking any user operation for the wholeduration of the backup. Following sections discuss how the informationis copied into the backup.
      Containers:
At the start of the backup, all the containers that need to be backed upare found by finding all the files in the data segment (seg0). Then eachcontainer is copied to the backup by reading through the page cacheinstead of the direct file copy.
· Individual pages are latched, when they are being copied to thebackup to prevent modifications to the page.
· Any new containers created when backup is in progress will berecreated on restore using the information from the transaction log.
· Containers that were in the process of creation will not bebacked up, if the creation is not complete. Container cache will makesure that backup can not get handle to the containers that are beingcreated.· If committed drop occurs on a container that is being backedup, backup will wait for the committed drop to complete and then copythe stub to the backup.
· Committed container stubs that are created after the checkpointfor the backup will not be deleted until the backup is complete. Acheckpoint that occurs after the backup garbage collects them.
· All requests to open containers in non-logged mode will beautomatically converted to logged mode after the backup starts.
      Log files:
Transaction log is used to bring the database to the consistent state onrestore from the backup. A checkpoint is made at the start of thebackup; this checkpoint will be starting point for the recovery from thebackup. Log files are copied to the backup using regular file copy.
    * Backup Checkpoint information in the log control files is copied
      to the backup location at the start.
    * All the transaction log files starting from the backup checkpoint
      to the log file that is active at the end of the backup are copied
      into the backup.
    * State of the database in the backup depends on last log file that
      goes into the back up.  After all the containers are copied to the
backup , last log file that goes into the backup is determined.* Database engine wills not delete the log files that are to yet to
      be backed up, when backup is running even if they are no longer
required for regular crash recovery.* Checkpoints can occur in parallel when the backup is progress.
      Jar files:
Storing Jar files in the database is not a logged operations and theydon’t go through the page cache; they will be copied to the backupdirectly as it is done now. I am not sure at the moment, if that willlead to any issues. Any input about jar files will be helpful.
      Other files:

    * Database bootstrap file Service.Properties will be to be copied to
      the backup.  Need to check if any synchronization is required to
      copy this file.
    * Copy the BACKUP.HISTORY file at the end of the backup.
    *  Lock files should not be copied to the backup.
    * Tmp directory should not be backed up.
    Testing:
Most of the current backup tests are single threaded. I will add newfunctional tests case that will execute read/writes on the database,when backup is running. Some of the testing scenarios:
1. Test the new procedure calls.
2.      Checkpoint in parallel to the backup.

3.      DML operations in parallel to the backup.

4.      Grow the container after it is backed up / parallel.
5. Non-logged operations in parallel to the backup and pending whenbackup starts.
6.      DDL operations in parallel to the backup.

7.      In-Place compress in parallel to the backup.

8.      ….etc.
      References:
· Derby Admin Guide Backup Section<http://db.apache.org/derby/docs/dev/adminguide/>
· Derby-Dev Thread<http://nagoya.apache.org/eyebrowse/ReadMsg?listId=270&msgNo=5038>
·        Version 1. 10/5/2005 :  [EMAIL PROTECTED]

begin:vcard
fn:David W Van Couvering
n:Van Couvering;David W
org:Sun Microsystems, Inc.;Database Technology Group
email;internet:[EMAIL PROTECTED]
title:Senior Staff Software Engineer
tel;work:510-550-6819
tel;cell:510-684-7281
x-mozilla-html:TRUE
version:2.1
end:vcard

Re: [jira] Commented: (DERBY-239) Need a online backup feature that does not block update operations when online backup is in progress.

Reply via email to