I think that this (serious) bug could be forwarded to -devel list ... as
it seems no to be a configuration mistake ?


On Mon, 2007-07-30 at 16:56 +0300, Doytchin Spiridonov wrote:
> Hello,
> 
> unfortunately spooling didn't helped here, so may you just had a
> chance not getting errors in your case? A rule is that if it hapens
> once that means it could happen more. And here is what we've got in
> one of our tests:
> 
> ---
> enabled spooling (Maximum Job Spool Size
> = 500m) performed the same and AGAIN the first job I tested to restore
> ~44K files are missing:
> 
>   Files Expected:         348,120
>   Files Restored:         304,654
> 
> Another restore jobs is similar:
>   Files Expected:         190,741
>   Files Restored:         154,016
> 
> The case is slightly different as this time there is NO other errors
> generated (like file with wrong size or the "Record header file index
> not equal").
> ---
> 
> As it shows spooling doesn't solve the problem of concurrent jobs.
> 
> Regards.
> 
> 
> Monday, July 30, 2007, 11:03:56 AM:
> 
> MWa> Hello,
> 
> MWa> In my case spooling brought a remarkable improvement. Where as I had
> MWa> hundreds of errors on one restore I hardly see them now again with
> MWa> spooling in place. 
> 
> MWa> Doytchin,
> MWa> I also saw the same behavior like you did. With the concurrent jobs = 1,
> MWa> there were no errors. In this case no matter with or without spooling.
> 
> MWa> Unfortunately setting the concurrent jobs = 1 is not an option in our
> MWa> environment. So my current setting is spooling = on and concurrent jobs
> MWa> = 5. 
> 
> MWa> With this settings, it looks like that the Linux systems (OpenSuse and
> MWa> red hat) are ok but Solaris still has problems.
> 
> MWa> For example below is a restore I did on Friday. Of course I don't mind
> MWa> about the door files. But the difference between expected and restored
> MWa> files is ways to much. And even worse, I have no idea what happened to
> MWa> the missing files. I don't know if this has to do something with the
> MWa> restore errors we saw. This could also be something different.
> 
> MWa> During the weekend I moved bacula to a new and separate server. It runs
> MWa> on OpenSuse 10.2 with the latest patches in place now. The few restore
> MWa> jobs I've done so far with this went ok. 
> 
> MWa> All in all I still don't feel very comfortable with this, it needs more
> MWa> tests to be done. I will continue with testing and keep you updated. 
> 
> MWa> Wolfgang
> 
> 
> MWa> 27-Jul 11:07 porsche-dir: Start Restore Job
> MWa> RestoreFiles.2007-07-27_11.07.03 27-Jul 11:07 porsche-sd: Ready to read
> MWa> from volume "full-27-7-2007.20" on device "FileStorageFull"
> MWa> (/export/bacula-dump).
> MWa> 27-Jul 11:07 porsche-sd: Forward spacing Volume "full-27-7-2007.20" to
> MWa> file:block 0:3999802558.
> MWa> 27-Jul 11:08 porsche-sd: End of file 1 on device "FileStorageFull"
> MWa> (/export/bacula-dump), Volume "full-27-7-2007.20"
> MWa> 27-Jul 11:08 porsche-sd: End of Volume at file 1 on device
> MWa> "FileStorageFull" (/export/bacula-dump), Volume "full-27-7-2007.20"
> MWa> 27-Jul 11:08 porsche-sd: Ready to read from volume "full-27-7-2007.21"
> MWa> on device "FileStorageFull" (/export/bacula-dump).
> MWa> 27-Jul 11:08 porsche-sd: Forward spacing Volume "full-27-7-2007.21" to
> MWa> file:block 0:200.
> MWa> 27-Jul 11:33 prinz-fd: RestoreFiles.2007-07-27_11.07.03 Error:
> MWa> create_file.c:245 Cannot make node /export/xxx/dev/.zone_reg_door:
> MWa> ERR=Invalid argument 27-Jul 11:33 prinz-fd:
> MWa> RestoreFiles.2007-07-27_11.07.03 Error: create_file.c:245 Cannot make
> MWa> node /export/xxx/dev/.devfsadm_synch_door: ERR=Invalid argument 27-Jul
> MWa> 11:33 prinz-fd: RestoreFiles.2007-07-27_11.07.03 Error:
> MWa> create_file.c:245 Cannot make node
> MWa> /export/xxx/etc/sysevent/devfsadm_event_channel/reg_door: ERR=Invalid
> MWa> argument 27-Jul 11:33 prinz-fd: RestoreFiles.2007-07-27_11.07.03 Error:
> MWa> create_file.c:245 Cannot make node
> MWa> /export/xxx/etc/sysevent/devfsadm_event_channel/1: ERR=Invalid argument
> MWa> 27-Jul 11:33 prinz-fd: RestoreFiles.2007-07-27_11.07.03 Error:
> MWa> create_file.c:245 Cannot make node
> MWa> /export/xxx/etc/sysevent/syseventconfd_event_channel/reg_door:
> MWa> ERR=Invalid argument 27-Jul 11:33 prinz-fd:
> MWa> RestoreFiles.2007-07-27_11.07.03 Error: create_file.c:245 Cannot make
> MWa> node /export/xxx/etc/sysevent/sysevent_door: ERR=Invalid argument 27-Jul
> MWa> 11:33 prinz-fd: RestoreFiles.2007-07-27_11.07.03 Error:
> MWa> create_file.c:245 Cannot make node
> MWa> /export/xxx/etc/sysevent/piclevent_door: ERR=Invalid argument 27-Jul
> MWa> 11:15 porsche-sd: End of file 1 on device "FileStorageFull"
> MWa> (/export/bacula-dump), Volume "full-27-7-2007.21"
> MWa> 27-Jul 11:15 porsche-sd: End of Volume at file 1 on device
> MWa> "FileStorageFull" (/export/bacula-dump), Volume "full-27-7-2007.21"
> MWa> 27-Jul 11:15 porsche-sd: Ready to read from volume "full-27-7-2007.22"
> MWa> on device "FileStorageFull" (/export/bacula-dump).
> MWa> 27-Jul 11:15 porsche-sd: Forward spacing Volume "full-27-7-2007.22" to
> MWa> file:block 0:200.
> MWa> 27-Jul 11:21 porsche-sd: End of file 1 on device "FileStorageFull"
> MWa> (/export/bacula-dump), Volume "full-27-7-2007.22"
> MWa> 27-Jul 11:21 porsche-sd: End of Volume at file 1 on device
> MWa> "FileStorageFull" (/export/bacula-dump), Volume "full-27-7-2007.22"
> MWa> 27-Jul 11:21 porsche-sd: Ready to read from volume "full-27-7-2007.23"
> MWa> on device "FileStorageFull" (/export/bacula-dump).
> MWa> 27-Jul 11:21 porsche-sd: Forward spacing Volume "full-27-7-2007.23" to
> MWa> file:block 0:200.
> MWa> 27-Jul 11:28 porsche-sd: End of file 1 on device "FileStorageFull"
> MWa> (/export/bacula-dump), Volume "full-27-7-2007.23"
> MWa> 27-Jul 11:28 porsche-sd: End of Volume at file 1 on device
> MWa> "FileStorageFull" (/export/bacula-dump), Volume "full-27-7-2007.23"
> MWa> 27-Jul 11:28 porsche-sd: End of all volumes.
> MWa> 27-Jul 11:28 porsche-dir: RestoreFiles.2007-07-27_11.07.03 Error: Bacula
> MWa> 2.0.3 (06Mar07): 27-Jul-2007 11:28:34
> MWa>   JobId:                  40
> MWa>   Job:                    RestoreFiles.2007-07-27_11.07.03
> MWa>   Client:                 prinz-fd
> MWa>   Start time:             27-Jul-2007 11:07:05
> MWa>   End time:               27-Jul-2007 11:28:34
> MWa>   Files Expected:         303,761
> MWa>   Files Restored:         301,923
> MWa>   Bytes Restored:         27,412,500,483
> MWa>   Rate:                   21266.5 KB/s
> MWa>   FD Errors:              7
> MWa>   FD termination status:  Error
> MWa>   SD termination status:  OK
> MWa>   Termination:            *** Restore Error ***
> 
> MWa> 27-Jul 11:28 porsche-dir: Begin pruning Jobs.
> MWa> 27-Jul 11:28 porsche-dir: No Jobs found to prune.
> MWa> 27-Jul 11:28 porsche-dir: Begin pruning Files.
> MWa> 27-Jul 11:28 porsche-dir: No Files found to prune.
> MWa> 27-Jul 11:28 porsche-dir: End auto prune.
> 
> 
> MWa>  
> 
> MWa> -----Original Message-----
> MWa> From: [EMAIL PROTECTED]
> MWa> [mailto:[EMAIL PROTECTED] On Behalf Of
> MWa> Doytchin Spiridonov
> MWa> Sent: Saturday, July 28, 2007 02:02
> MWa> To: bacula-users
> MWa> Subject: Re: [Bacula-users] Restore errors
> 
> MWa> Hello,
> 
> MWa> just to note that several days after a full backup and incremental
> MWa> bacpus, restores are OK, which again proves that the problem was caused
> MWa> by running concurrent jobs.
> 
> MWa> Wolfgang do you have the same results?
> 
> MWa> Regards
> 
> 
> MWa> Wednesday, July 25, 2007, 8:12:25 PM:
> 
> DS>> Hello,
> 
> DS>> 2nd day w/o concurrent jobs: we have 1xFULL and 1xINCREMENTAL for 
> DS>> all clients.
> 
> DS>> Restore OK of all jobs.
> 
> DS>> Seems this (concurrent jobs) is the problem.
> 
> DS>> Regards.
> 
> 
> DS>> Tuesday, July 24, 2007, 9:57:35 PM:
> 
> DS>>> I don't have any other ideas to check with to provide more cases. 
> DS>>> It's developers turn now...
> 
> 
> 
> MWa> ------------------------------------------------------------------------
> MWa> -
> MWa> This SF.net email is sponsored by: Splunk Inc.
> MWa> Still grepping through log files to find problems?  Stop.
> MWa> Now Search log events and configuration files using AJAX and a browser.
> MWa> Download your FREE copy of Splunk now >>  http://get.splunk.com/
> MWa> _______________________________________________
> MWa> Bacula-users mailing list
> MWa> Bacula-users@lists.sourceforge.net
> MWa> https://lists.sourceforge.net/lists/listinfo/bacula-users
> 
> 
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >>  http://get.splunk.com/
> _______________________________________________
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users


-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >>  http://get.splunk.com/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to