[Bacula-users] Restore Errors, terminates early

2008-11-06 Thread Roland Roberts
Below is the console log from my failing restore job.  As you can see, the
number of files restored is WAY low.  I'm trying to figure out how to get
what I can out of this restore.

I've done small restores before, a file or two, a even a small directory. 
This is the first time I've had to restore a lot of stuff.

Unfortunately, this is not a test :-(

Any ideas?

TIA,

roland


Run Restore job
JobName:RestoreFiles
Bootstrap:  /var/spool/bacula/archos-dir.restore.3.bsr
Where:  /tmp/bacula-restores
Replace:always
FileSet:System Set
Client: aristarchus-fd
Storage:File
When:   2008-11-06 14:06:05
Catalog:MyCatalog
Priority:   10
OK to run? (yes/mod/no): yes
Job queued. JobId=289
06-Nov 14:06 archos-dir: Start Restore Job RestoreFiles.2008-11-06_14.06.08
06-Nov 14:06 archos-sd: Ready to read from volume Aristarchus-0004 on
device FileStorage (/backup).
06-Nov 14:06 archos-sd: Forward spacing Volume Aristarchus-0004 to
file:block 15:2571201622.
06-Nov 14:10 archos-sd: RestoreFiles.2008-11-06_14.06.08 Error: block.c:317
Volume data error at 18:63957951!
Block checksum mismatch in block=1199365 len=64512: calc=90c6023e blk=c0deabb4
06-Nov 14:10 aristarchus-fd JobId 289: Error: attribs.c:421 File size of
restored file
/tmp/bacula-restores/home/roland/tmp/20080429-AstroTrac/img_3470.png not
correct. Original 36831200, restored 26476544.
06-Nov 14:10 archos-dir: RestoreFiles.2008-11-06_14.06.08 Error: Bacula
2.0.3 (06Mar07): 06-Nov-2008 14:10:12
  JobId:  289
  Job:RestoreFiles.2008-11-06_14.06.08
  Client: aristarchus-fd
  Start time: 06-Nov-2008 14:06:10
  End time:   06-Nov-2008 14:10:12
  Files Expected: 126,091
  Files Restored: 14,446
  Bytes Restored: 8,439,830,107
  Rate:   34875.3 KB/s
  FD Errors:  1
  FD termination status:  Error
  SD termination status:  Error
  Termination:*** Restore Error ***

06-Nov 14:10 archos-dir: Begin pruning Jobs.
06-Nov 14:10 archos-dir: No Jobs found to prune.
06-Nov 14:10 archos-dir: Begin pruning Files.
06-Nov 14:10 archos-dir: No Files found to prune.
06-Nov 14:10 archos-dir: End auto prune.




-- 
   PGP Key ID: 66 BC 3B CD
Roland B. Roberts, PhD RL Enterprises
[EMAIL PROTECTED]6818 Madeline Court
[EMAIL PROTECTED]   Brooklyn, NY 11220


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Restore Errors, terminates early

2008-11-06 Thread Roland Roberts

Roland Roberts wrote:
 Below is the console log from my failing restore job.  As you can see, the
 number of files restored is WAY low.  I'm trying to figure out how to get
 what I can out of this restore.

 I've done small restores before, a file or two, a even a small directory.
 This is the first time I've had to restore a lot of stuff.

 Unfortunately, this is not a test :-(

It would appear the problem is in the backend with quoting file names.  I
have some configuration files that were created via a Java webstart task. 
Who cares?  Well, they are arguably misconfigured 'cause they create their
config files as c:\jobwatch.properties which ends up in my home directory
as /home/roland/c\:\\jobwatch.properties.  That name doesn't get quoted
correctly in the SQL query that goes to PostgreSQL, so the query fails (and
I get an error in syslog from the postmaster).

I've just unmarked those files and will see how far I can get now.  It is
looking better (since it is still running).

I assume I should log this as a bug

roland
-- 
   PGP Key ID: 66 BC 3B CD
Roland B. Roberts, PhD RL Enterprises
[EMAIL PROTECTED]6818 Madeline Court
[EMAIL PROTECTED]   Brooklyn, NY 11220


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Restore Errors, terminates early

2008-11-06 Thread Roland Roberts

Roland Roberts wrote:

 It would appear the problem is in the backend with quoting file names.  I
 have some configuration files that were created via a Java webstart task.
 Who cares?  Well, they are arguably misconfigured 'cause they create their
 config files as c:\jobwatch.properties which ends up in my home directory
 as /home/roland/c\:\\jobwatch.properties.  That name doesn't get quoted
 correctly in the SQL query that goes to PostgreSQL, so the query fails (and
 I get an error in syslog from the postmaster).

 I've just unmarked those files and will see how far I can get now.  It is
 looking better (since it is still running).

Well, I spoke too soon.

It's clear that this is not the whole story.  I'm not getting any logs on
the server side to help me with this.  It's still quitting early, and syslog
does show postgresql errors coincident with the job termination.  They look
like this:

Nov  6 17:14:06 archos postgres[31135]: [30-1] ERROR:  table delcandidates
does not exist
Nov  6 17:14:06 archos postgres[31135]: [30-2] STATEMENT:  DROP TABLE
DelCandidates
Nov  6 17:14:06 archos postgres[31135]: [31-1] ERROR:  index delinx1 does
not exist
Nov  6 17:14:06 archos postgres[31135]: [31-2] STATEMENT:  DROP INDEX DelInx1
Nov  6 17:14:06 archos postgres[31135]: [32-1] ERROR:  index delinx1 does
not exist
Nov  6 17:14:06 archos postgres[31135]: [32-2] STATEMENT:  DROP INDEX DelInx1

But that may be innocuous as I also seem to get this message when I *issue*
the restore command:

Nov  6 17:22:09 archos postgres[31135]: [33-1] ERROR:  table temp does not
exist
Nov  6 17:22:09 archos postgres[31135]: [33-2] STATEMENT:  DROP TABLE temp
Nov  6 17:22:09 archos postgres[31135]: [34-1] ERROR:  table temp1 does
not exist
Nov  6 17:22:09 archos postgres[31135]: [34-2] STATEMENT:  DROP TABLE temp1

The restore seems to terminate when it gets any error like a file size not
matching.  This isn't what I expected from the manual where I expected it to
continue on until all files were restored as best as possible.

I'm now picking directories, one at a time, and restoring them.

Any better ideas on tracking this down?

roland

-- 
   PGP Key ID: 66 BC 3B CD
Roland B. Roberts, PhD RL Enterprises
[EMAIL PROTECTED]6818 Madeline Court
[EMAIL PROTECTED]   Brooklyn, NY 11220


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users