Scott,

On Monday 01 October 2007 23:50, Scott Ruckh wrote:
> This is what you said Josh Fisher
>
> > Scott Ruckh wrote:
> >> I am running CentOS 4.5 x64 with self compiled kernel 2.6.13.4.  I have
> >> installed the latest bacula 2.2.4 Director, Client, and Storage daemon
> >> on
> >> this server.
> >>
> >> Bacula run flawlessly most of the time.
> >>
> >> Unfortunately when I run VMWare Workstation and have a virtual machine
> >> running, Bacula crashes the entire system.  I am running the latest
> >> version of the 5.x series of VMWare.  I am also running fluxbox as my
> >> Window Manager, although I don't think that has anything to do with the
> >> problem.
> >>
> >> This problem has existed with all versions of Bacula starting with
> >> 1.3.8.11 which is the first version I installed.
> >>
> >> My backups are to an external USB disk connected to the Host OS.
> >>
> >> The virtual machine is not configured with USB port because at one time
> >> I
> >> thought there might be contention between the Physical USB disk and the
> >> USB port configured with the Virtual Machine.
> >>
> >> All virtual machines and host system work fine when bacula backups are
> >> not
> >> running.
> >>
> >> I have tested this with all sorts of guest Virtual Machines and the
> >> results are the same; bacula will crash (completely unusable) the host
> >> machine.  The only recovery method it to reboot the server.
> >
> > All bacula daemons are running in user mode and only the FD is running
> > as root. A hard lock like this means something is amiss at the kernel
> > mode level. You could try backing up to a directory on SCSI/SATA/IDE
> > disk as a test. If all works as expected when backing up to the non-USB
> > drive, then maybe something is wrong in the way USB is being
> > virtualized/configured.
> >
> >> Is anyone else successfully running a similar environment?  As bacula is
> >> the only program that appears to cause the problem I am making an
> >> assumption it is a problem with bacula.
> >>
> >> If anyone has this type of environment working successfully I would like
> >> to hear about it.
>
> I had reported it was believed that not backing up the VMware disk files
> of a live virtual machine solved the problem.  Unfortunately I was wrong
> and the problem persists.
>
> Now that more testing has been completed, I can officially say the problem
> still exists.
>
> I recompiled the kernel, upgraded VMWare Workstation (to version 6),
> Upgraded VMWare Tools, and rebuilt all the VMWare modules using the same
> gcc that the kernel was built with, but none of that helped.  I have the
> exact same issue.
>
> Unfortunately the problem is not always present as the backups do complete
> some of time.  The only constant is that the hard crash only happens when
> bacula is running.
>
> Although the VM is not configured with a USB device there might possibly
> be some contention with the physical USB drive I back up to and the VM
> instance.  As the crash does not happen every time it is hard to determine
> what exactly is going on.
>
> As no other programs really access the USB device, and the crash only
> happens when bacula is running, I am just guessing the problem might be
> with that combination.
>
> I know this is not really a priority on anyone's list, but I thought I
> would keep everyone informed who might have been following this thread.

I'm rather late to the party, but I have an idea that this might be a RAM 
problem or, to be specific, that you are running out of RAM. I encountered 
something similar a year or two ago. The good news is that the solution is 
simply a matter of adjusting a kernel parameter. The kernel parameter that I 
had to adjust was vm.min_free_kbytes, setting it to 16224 instead of the 
default 4060.

Take a look at your kernel logs while the backup job is running and/or run 
vmstat. If this is the problem you will quickly see it. I'm afraid that I 
don't remember the details of why this worked or how I chose that particular 
value. I stumbled upon some emails discussing it after googling the 
particular kernel error that I was receiving; made the change & then forgot 
about it.

HTH!

cmr


-- 
Debian 'Etch' - Registered Linux User #241964
--------
"More laws, less justice." -- Marcus Tullius Ciceroca, 42 BC

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to