Re: [Bacula-users] Problems with version 9.0.3 failing since the upgrade

2017-12-02 Thread Bill Arlofski
On 12/01/2017 10:35 AM, Jerry Lowry wrote:
> list,
> 
> I am re-iterating my problem, due to no response.  As a side note the storage
> server and director are running on the same server.  This happens on a copy
> job from one disk to another disk on the same system.
> It is a definite problem as I am loosing backup data!


Hi Jerry,

I am pretty sure that you will need to upgrade to v9.0.6 that was recently
announced and released. 9.0.6 fixes several problems. One that I am sure of is
jobs spanning more than one volume.

Please try this upgrade and report back so we can help if the upgrade does not
solve this issue for you.

Best regards,

Bill


-- 
Bill Arlofski
http://www.revpol.com/bacula
-- Not responsible for anything below this line --

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Problems with version 9.0.3 failing since the upgrade

2017-12-01 Thread Jerry Lowry
list,

I am re-iterating my problem, due to no response.  As a side note the
storage server and director are running on the same server.  This happens
on a copy job from one disk to another disk on the same system.
It is a definite problem as I am loosing backup data!

bacula 9.0.3
mariadb 10.2.8
centos 6.9

I upgraded bacula from 5.2.13 which worked very well, to version 9.0.3.
Basically installed new version from source and then upgraded the database
structure.  The source was compiled with the following:

./configure --sbindir=/usr/bacula/bin --sysconfdir=/usr/bacula/bin
--with-pid-dir=/var/run/bacula --with-subsys-dir=/var/run/bacula/working
--enable-smartalloc --with-mysql --with-working-dir=/usr/bacula/bin/working
--with-dump-email=u...@domain.com --with-job-email=u...@domain.com
--with-smtp-host=smtp.googlemail.com --enable-bat

The problem started with my offsite backups. I will get the following error:

13-Nov 01:18 distress JobId 33429: Fatal error: Socket error on Data
received command: ERR=No data available
13-Nov 01:18 distress JobId 33429: Fatal error: fd_cmds.c:157 Read
data not accepted

And the backup fails. Most of the time it is on a backup that spans
multiple disks.  So, I chatted with the ATTO raid support folks and they
suggested that I use a different hotswap raid enclosure due to the one I
was using was not very reliable in their opinion.  Although this enclosure
had worked very reliably for well into 10 years without a problem!  So, I
moved the system to a completely new system ( Supermicro with ATTO raid ).
The problem still persists!  I have rebuilt the raid disk structure and
changed the working of the backups. To no avail!
My backups worked flawlessly before the upgrade!  Once going to v 9 I can
not count how many offsite backups the have failed to complete with this
type of error.  I also get

13-Nov 01:23 distress JobId 33430: Warning: mount.c:210 Open of File
device "BottomSwap" (/BottomSwap) Volume "dcBS-104" failed:
ERR=file_dev.c:190 Could not
open(/BottomSwap/dcBS-104,OPEN_READ_WRITE,0640): ERR=No such file or
directory
and
15-Nov 17:20 kilchis JobId 35825: Error: bsock.c:849 Read error from
Storage daemon:kilchis:9103: ERR=Connection reset by peer
15-Nov 17:20 kilchis JobId 35825: Fatal error: append.c:271 Network
error reading from FD. ERR=Connection reset by peer

All of this happens on one storage server, well actually two storage
servers but they

service two different subnets/domains. It all started with the upgrade!

please, tell me that you have fixed this with a new version!

thanks


On Mon, Nov 27, 2017 at 7:45 AM, Jerry Lowry  wrote:

> list,
>
> bacula 9.0.3
> mariadb 10.2.8
> centos 6.9
>
> I upgraded bacula from 5.2.13 which worked very well, to version 9.0.3.
> Basically installed new version from source and then upgraded the database
> structure.  The source was compiled with the following:
>
> ./configure --sbindir=/usr/bacula/bin --sysconfdir=/usr/bacula/bin
> --with-pid-dir=/var/run/bacula --with-subsys-dir=/var/run/bacula/working
> --enable-smartalloc --with-mysql --with-working-dir=/usr/bacula/bin/working
> --with-dump-email=u...@domain.com --with-job-email=u...@domain.com
> --with-smtp-host=smtp.googlemail.com --enable-bat
>
> The problem started with my offsite backups. I will get the following
> error:
>
> 13-Nov 01:18 distress JobId 33429: Fatal error: Socket error on Data received 
> command: ERR=No data available
> 13-Nov 01:18 distress JobId 33429: Fatal error: fd_cmds.c:157 Read data not 
> accepted
>
> And the backup fails. Most of the time it is on a backup that spans
> multiple disks.  So, I chatted with the ATTO raid support folks and they
> suggested that I use a different hotswap raid enclosure due to the one I
> was using was not very reliable in their opinion.  Although this enclosure
> had worked very reliably for well into 10 years without a problem!  So, I
> moved the system to a completely new system ( Supermicro with ATTO raid ).
> The problem still persists!  I have rebuilt the raid disk structure and
> changed the working of the backups. To no avail!
> My backups worked flawlessly before the upgrade!  Once going to v 9 I can
> not count how many offsite backups the have failed to complete with this
> type of error.  I also get
>
> 13-Nov 01:23 distress JobId 33430: Warning: mount.c:210 Open of File device 
> "BottomSwap" (/BottomSwap) Volume "dcBS-104" failed: ERR=file_dev.c:190 Could 
> not open(/BottomSwap/dcBS-104,OPEN_READ_WRITE,0640): ERR=No such file or 
> directory
> and
> 15-Nov 17:20 kilchis JobId 35825: Error: bsock.c:849 Read error from Storage 
> daemon:kilchis:9103: ERR=Connection reset by peer
> 15-Nov 17:20 kilchis JobId 35825: Fatal error: append.c:271 Network error 
> reading from FD. ERR=Connection reset by peer
>
> All of this happens on one storage server, well actually two storage servers 
> but they
>
> service two different subnets/domains. It all started with the upgrade!
>
> 

Re: [Bacula-users] Problems with version 9.0.3 failing since the upgrade

2017-11-28 Thread Ralf Brinkmann

Am 28.11.2017 um 13:50 schrieb Martin Simmons:

On Tue, 28 Nov 2017 12:06:20 +0100, Ralf Brinkmann said:


Bacula 9.0.6 - static client Ubuntu generated


MAIL:   23-Nov 18:05 ubuntu-client-fd JobId 3286: Error: bsock.c:223 gethostbyname() for 
host "bacula-host.wemhoener.de" failed: ERR=Name or service not known


I think the system C library loads dynamic libraries for each type of lookup
defined in /etc/nsswitch.conf, so linking it statically might break DNS
lookups.

__Martin


Thank you Martin,

for a test on one client I just added the Bacula host to the /etc/hosts
table of the client.

--
Ralf Brinkmann

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Problems with version 9.0.3 failing since the upgrade

2017-11-28 Thread Martin Simmons
> On Tue, 28 Nov 2017 12:06:20 +0100, Ralf Brinkmann said:
> 
> Bacula 9.0.6 - static client Ubuntu generated
> 
> > MAIL:   23-Nov 18:05 ubuntu-client-fd JobId 3286: Error: bsock.c:223 
> > gethostbyname() for host "bacula-host.wemhoener.de" failed: ERR=Name or 
> > service not known

I think the system C library loads dynamic libraries for each type of lookup
defined in /etc/nsswitch.conf, so linking it statically might break DNS
lookups.

__Martin

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Problems with version 9.0.3 failing since the upgrade

2017-11-28 Thread Ralf Brinkmann

I switched the Bacula Host with success to the latest version, several
clients to a static 9.0.5 file daemon version too.

With the exception of two Ubuntu Servers. After different attempts I
moved back to the elder dynamically linked file daemons.

Last two attempts:

Bacula 9.0.5 - static client Suse generated


SYSLOG: Nov 22 18:05:02 ubuntu-client bacula-fd: Bacula interrupted by signal 
11: Segmentation violation
MAIL:   22-Nov 18:05 bacula-host-dir JobId 3278: Fatal error: Socket error on 
Storage command: ERR=Keine Daten verfügbar


Bacula 9.0.6 - static client Ubuntu generated


MAIL:   23-Nov 18:05 ubuntu-client-fd JobId 3286: Error: bsock.c:223 gethostbyname() for 
host "bacula-host.wemhoener.de" failed: ERR=Name or service not known


--
Ralf Brinkmann

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users