Problem finishing old backup .. !?

2005-01-10 Thread oliver simon
This morning, I cam back from holiday, and had to see that there was no 
successful backup since some days ... :-(

amstatus shows me, that almost all hosts have been backed up, but some 
still have "dumping to tape" running for over 26 hours, although there 
is no running amanda-process or anything else ...

I suppose, that amandas logfiles are in an inconsistent state !?
I can run amcheck without any error ...
...
0 dumpers busy :  0:00:00  (  0.00%)
 1 dumper busy  :  5:49:25  (  9.80%)not-idle:  5:49:25 
(100.00%)
 2 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
 3 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
 4 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
 5 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
 6 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
 7 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
 8 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
 9 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
10 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
11 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
12 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
13 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
14 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
15 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
16 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
17 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
18 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
19 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
20 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
21 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
22 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
23 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
24 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
25 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
26 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
27 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
28 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
29 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
30 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
31 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
32 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
33 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
34 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
35 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
36 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
37 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
38 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
39 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
40 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
41 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)
42 dumpers busy :  0:00:30  (  0.01%)not-idle:  0:00:30 
(100.00%)


Were great, anybody had a hint ...
...olli



amanda + virtual interfaces

2004-06-18 Thread Oliver Simon
Hi everyone,
Has anyone of you experiences or does anyone know if it is possible to 
let amanda run over a virtual interface ?

We are trying to implement a new backupserver, which shall replace the 
old one after some weeks of parrallel running.

The problem is, the we have to put the new server in 46 hosts´ 
.amandahosts and don´t want to do the same again in some weeks. So we 
thought about giving the new server a virtual ip, attach a dnsname like 
backup1 to it, and put that in all .amandahosts

BigPic:
- lemmy -- current hot backupservers dns-name
- neo-lemmy -- current new hot backupservers dns-name
- neo-lemmy2 -- current cold-standby backup, will become hot in the next 
weeks.

--
- lemmy is in all .amandahosts
- neo-lemmy has to be put in
if lemmy goes offline, name neo-lemmy will be switched to name lemmy
the aim would be, put -for example- backup1 in all .amandahosts. Give 
neo-lemmy a virtual-ip, and attach the name backup1 to that ip.

In this way, we could switch the primary hostname to lucifer or 
whatever, and had no problem with the .amandahosts files on the clients.

Thanks in advance,
...olli


sporadic errors - could not connect ...

2004-06-07 Thread Oliver Simon
Hi List !
Were great anyone could help ...
We are experiencing strange errors at the time.
We have an amand-server that backs up about 30 hosts a night.
Most of the time it went well, but now there are very often etrange errors.
It sais:
x /opt lev 0 FAILED 20040606[could not connect to x]
And that on about 15 of 30 hosts. The more interesting is, that the 
first filesystems (or dle´s) are fine, only some of them are failing.
Next day everything is fine, or other filesystems on other hosts are 
failing .. !?
On some days
There seem to be no networking problems, all hosts are 100 Mbit 
connected by a 1 Gbit HP Switch separated in VLANS.
The amanda-server is a 1GHz Pentium 3 with 2 2TB RAids attached to it, 
so Space should not be the problem.
I wrote a little script, that monitors wih amdump the backup every 3 
minutes in a file named date-time, but the output doesnt´t give any 
hints ...
Neither the amanda-server, nor the clients tell anything in the 
log-files ... !?

Would it be an idea to set (if existent) a higher debug-level at the 
amanda-server, and where should i do that ?

Thanks in Advance ...
...olli


Re: amandad Problem on redhat 7.2

2004-04-30 Thread Oliver Simon
Hi group !
Thanks for your advices !
I tried Pauls suggestion an corrected the ownership-problem Bernard 
found ...

All fine now ! Thanks a lot !
...olli
Oliver Simon wrote:
Hi List-Members ...
At the moment we are experiencing another problem on one of our clients.
Were great anyone had a hint for us to solve this.
After starting xinetd on the client ---

# default: off
# description: Amanda backup client
#
service amanda
{
socket_type = dgram
protocol= udp
wait= no
user= amanda
group   = disk
server  = /usr/lib/amanda/amandad
disable = no
}

(#> ll /usr/lib/amanda/amandad
-rwxr-xr-x1 rpm  disk42617 Sep 12  2002 
/usr/lib/amanda/amandad )

--- everything looks fine. If we start an "amcheck DailySet1" the only 
client that fails is this redhat machine.
In /var/log/messages you can see the following:

Apr 30 11:47:52 god xinetd[7397]: Started working: 3 available services
Apr 30 11:47:54 god xinetd: Starten von xinetd succeeded
Apr 30 11:47:59 god xinetd[7397]: file descriptor of service amanda has 
been closed
Apr 30 11:47:59 god xinetd[7397]: select reported EBADF but no bad file 
descriptors were found

Machine Data is:
-
Linux version 2.4.7-10smp ([EMAIL PROTECTED]) (gcc 
version 2.96 2731 (Red Hat Linux 7.1 2.96-98))
-
build: VERSION="Amanda-2.4.2p2"
   BUILT_DATE="Thu Sep 12 18:06:07 UTC 2002"
   BUILT_MACH="Linux D5 2.4.19 #1 SMP Tue Sep 10 00:06:19 UTC 2002 
i686 unknown"
   CC="gcc"
-
xinetd from rpm: xinetd-2.3.11-1.7x
-

Can anyone help ?
Thanks in advance,
...olli


amandad Problem on redhat 7.2

2004-04-30 Thread Oliver Simon
Hi List-Members ...
At the moment we are experiencing another problem on one of our clients.
Were great anyone had a hint for us to solve this.
After starting xinetd on the client ---

# default: off
# description: Amanda backup client
#
service amanda
{
socket_type = dgram
protocol= udp
wait= no
user= amanda
group   = disk
server  = /usr/lib/amanda/amandad
disable = no
}

(#> ll /usr/lib/amanda/amandad
-rwxr-xr-x1 rpm  disk42617 Sep 12  2002 
/usr/lib/amanda/amandad )

--- everything looks fine. If we start an "amcheck DailySet1" the only 
client that fails is this redhat machine.
In /var/log/messages you can see the following:

Apr 30 11:47:52 god xinetd[7397]: Started working: 3 available services
Apr 30 11:47:54 god xinetd: Starten von xinetd succeeded
Apr 30 11:47:59 god xinetd[7397]: file descriptor of service amanda has 
been closed
Apr 30 11:47:59 god xinetd[7397]: select reported EBADF but no bad file 
descriptors were found

Machine Data is:
-
Linux version 2.4.7-10smp ([EMAIL PROTECTED]) (gcc 
version 2.96 2731 (Red Hat Linux 7.1 2.96-98))
-
build: VERSION="Amanda-2.4.2p2"
   BUILT_DATE="Thu Sep 12 18:06:07 UTC 2002"
   BUILT_MACH="Linux D5 2.4.19 #1 SMP Tue Sep 10 00:06:19 UTC 2002 
i686 unknown"
   CC="gcc"
-
xinetd from rpm: xinetd-2.3.11-1.7x
-

Can anyone help ?
Thanks in advance,
...olli


Problems while amanda is running !?

2004-04-20 Thread Oliver Simon
Hi group !

Maybe anyone can help me solving this problem or had the same probs ?
After running amanda for quite long time without any problems, the following messages 
crop up in my mails every morning.
Can anyone help ?
The only thing we changed, was to put a new switch online, that has GBit-Capabilities.
The mail every morning really looks shi$&, because of 272 DLE´s and not a half of them 
can be done without problems. 

-

hal/var lev 1 FAILED 20040420[could not connect to hal]
hydra  /opt lev 0 FAILED 20040420[could not connect to hydra]
helena1/boot lev 0 FAILED 20040420[could not connect to helena1]

-

Were great, anyone had a hint !

Greetings, ...oliver



Amplot -> ps-File ?

2004-02-24 Thread Oliver Simon
Hi Group !

It´s just a "nice to have", but maybe anyoune can help .. ?

Does anyone know a possibility to put amplot-stats in ps-Files to move
them to jpg´s or something like that ?
Amplot has only the ablility to push the graphics to display (?), 
ps-files were much better for our needs to organize them in a directory
structure and view them as html-embedded jpgs.

Looking forward to your answers ...olli



Re: Urgent help needed, degraded mode, tapes too small ?

2004-01-27 Thread Oliver Simon
Hi again ...

Am Di, 2004-01-27 um 16.43 schrieb Jon LaBadie:
> On Tue, Jan 27, 2004 at 12:32:06PM +0100, Oliver Simon wrote:
> > Hi Group !
> > 
> > 
> > This smorning we got the problem, that many servers were backuped in
> > "degraded mode (host /disk lev 1 FAILED)". I think this means, the
> > defined hdd-tapes became too small, right ?
> > 
> > 1) We have 17 tapes defined each one with 75000 MB. 
> > Can I expand this to 10 MB without loosing the current backups ?
> 
> Stefan provided answers, I'm curious though.
> 
> Each of your "vtapes" is 75GB.  From your past experience in the
> amdump reports, how full have they been?  If they have regularly
> been half or more, say 40GB/amdump run, does your total disk
> storage allocated for vtapes support this?  I.e. it will take
> perhaps 17*40GB or 680GB total for your collection of vtapes.
> Of course now that you are filling 75GB vtapes, and considering
> making them 100GB, will you be allocating a terrabyte of disk
> storage to your vtapes?

FilesystemSize  Used Avail Use% Mounted on
/dev/sdb1 1.3T 1006G  254G  80% /r01
/dev/sdc1 2.8T  611G  2.2T  24% /r02 ---> DailySet1 
/dev/sdd1 2.0T  1.3T  712G  65% /r03


> I guess what I'm driving at, is your problem that 75GB per vtape
> is too small or is the "total" storage capacity too small.  Did
> the file system run out of space?

Nope, I don´t think so ...



Urgent help needed, degraded mode, tapes too small ?

2004-01-27 Thread Oliver Simon
Hi Group !


This smorning we got the problem, that many servers were backuped in
"degraded mode (host /disk lev 1 FAILED)". I think this means, the
defined hdd-tapes became too small, right ?

1) We have 17 tapes defined each one with 75000 MB. 
Can I expand this to 10 MB without loosing the current backups ?

2) What other options do I have ? Last night, the backups partial ran in
degraded mode, will amanda try again to do those dumps additional to the
normal run, will it continue normally, or what ?

3) What if I remove some entries from the disklist ? Will amanda
recognize the removed parts although these could not be backuped fine
last night ? Some of the data could be removed from the disklist, until
we get the new backup-servers and raids to backup to, but until that, we
still need to do a backup .. !? Backup needs about 4 1/2 hours by now,
if amanda would also try to do something from last night, we will run
into some problems ... 

Thanks in advance,

...olli



Re: disk offline ?

2004-01-08 Thread Oliver Simon
Hello again ...

Am Do, 2004-01-08 um 17.37 schrieb Frank Smith:
> --On Thursday, January 08, 2004 17:21:48 +0100 Oliver Simon <[EMAIL PROTECTED]> 
> wrote:
> 
> > I can su to amanda and chdir to usr and everything else thats failling ?
> > Is it maybe because of the filesystem xfs on the target-machine ?
> > I think amanda wants to do a xfsdump there, right ? I tried to run it as
> > amanda and got it complaining to be root to run ... !?
> 
> You must use the right version of dump for the filesystem.  Was Amanda
> built with support for xfsdump ? 

Yes, we have about 15 (SuSI´s/SLES/SLOX) hosts running with xfs. No
problem there.

>  Are the disks on all your clients xfs or just on the one new client?

The rest is 10 Solaris (ufs) and some other linux´ (redhat ext3).

>   If you have a mixture of filesystem types, I'm not sure how you configure multiple 
> flavors of dump in one
> setup, hopefully someone else here has done it and will say how.

Thought, amanda examines it itself, and takes the correct version
(dump/xfsdump) from the clients ??

> 
> Frank


...olli
> 
> > 
> > Thanks in advance 
> > 
> > ...olli
> > 
> > Am Do, 2004-01-08 um 16.39 schrieb Frank Smith:
> >> --On Thursday, January 08, 2004 15:51:21 +0100 Oliver Simon <[EMAIL PROTECTED]> 
> >> wrote:
> >> 
> >> > Hi Group !
> >> > 
> >> > Maybe anyone has a hint for me ?
> >> > We have about 30 Servers, all UNIX. They are all backed up with amanda,
> >> > which worked very good, until some problems occured.
> >> > 
> >> > I defined a TestSet an tried to put in a new host. But I don´t get it to
> >> > work. I want to do a dump of the following partitions, which are really
> >> > partitions on the target-host.
> >> > 
> >> > -
> >> > daisy   /   comp-hd-dump
> >> > daisy   /boot   comp-hd-dump
> >> > daisy   /optcomp-hd-dump
> >> > daisy   /usrcomp-hd-dump
> >> > -
> >> > 
> >> > If I put comp-hd-tar in here, all works fine ... !?
> >> > 
> >> > -
> >> > FAILURE AND STRANGE DUMP SUMMARY:
> >> >   daisy  /usr lev 0 FAILED [disk /usr offline on daisy?]
> >> >   daisy  /opt lev 0 FAILED [disk /opt offline on daisy?]
> >> >   daisy  /lev 0 FAILED [disk / offline on daisy?]
> >> 
> >> Disk offline messages are generally a result of permissions
> >> problems (if the disk were really offline you would have much
> >> bigger problems).  Make sure your backup user is part of the
> >> correct group to have access to the disk.  Try to 'su - backupuser'
> >> and check if you can see those disks.
> >>Tar works because Aamnda uses a suid runtar wrapper script that
> >> runs tar as root (although it isn't used by amcheck, so you would
> >> still get errors from amcheck even though amdump would work).
> >> 
> >> Frank
> >> 
> >> > 
> >> > ...
> >> > 
> >> > DUMP SUMMARY: DUMPER STATSTAPER STATS 
> >> > HOSTNAME  DISK  L ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
> >> > -- - 
> >> > daisy /   0 FAILED ---
> >> > daisy /boot   06410   5472  85.4   0:023511.7   0:023484.9
> >> > daisy /opt0 FAILED ---
> >> > daisy /usr0 FAILED ---
> >> > 
> >> > 
> >> > 
> >> > Why does /boot work ??? Ups, just saw it worked only the first time ...
> >> > Now its the same message ...
> >> > 
> >> > And, does comp-hd-tar eat more cpu ? I would think so, or am I false ?
> >> > Last Backup this night 
> >> > 
> >> > Were great anyone had a tip !
> >> > 
> >> > ...olli




Re: disk offline ?

2004-01-08 Thread Oliver Simon
I can su to amanda and chdir to usr and everything else thats failling ?
Is it maybe because of the filesystem xfs on the target-machine ?
I think amanda wants to do a xfsdump there, right ? I tried to run it as
amanda and got it complaining to be root to run ... !?

Thanks in advance 

...olli

Am Do, 2004-01-08 um 16.39 schrieb Frank Smith:
> --On Thursday, January 08, 2004 15:51:21 +0100 Oliver Simon <[EMAIL PROTECTED]> 
> wrote:
> 
> > Hi Group !
> > 
> > Maybe anyone has a hint for me ?
> > We have about 30 Servers, all UNIX. They are all backed up with amanda,
> > which worked very good, until some problems occured.
> > 
> > I defined a TestSet an tried to put in a new host. But I don´t get it to
> > work. I want to do a dump of the following partitions, which are really
> > partitions on the target-host.
> > 
> > -
> > daisy   /   comp-hd-dump
> > daisy   /boot   comp-hd-dump
> > daisy   /optcomp-hd-dump
> > daisy   /usrcomp-hd-dump
> > -
> > 
> > If I put comp-hd-tar in here, all works fine ... !?
> > 
> > -
> > FAILURE AND STRANGE DUMP SUMMARY:
> >   daisy  /usr lev 0 FAILED [disk /usr offline on daisy?]
> >   daisy  /opt lev 0 FAILED [disk /opt offline on daisy?]
> >   daisy  /lev 0 FAILED [disk / offline on daisy?]
> 
> Disk offline messages are generally a result of permissions
> problems (if the disk were really offline you would have much
> bigger problems).  Make sure your backup user is part of the
> correct group to have access to the disk.  Try to 'su - backupuser'
> and check if you can see those disks.
>Tar works because Aamnda uses a suid runtar wrapper script that
> runs tar as root (although it isn't used by amcheck, so you would
> still get errors from amcheck even though amdump would work).
> 
> Frank
> 
> > 
> > ...
> > 
> > DUMP SUMMARY: DUMPER STATSTAPER STATS 
> > HOSTNAME  DISK  L ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
> > -- - 
> > daisy /   0 FAILED ---
> > daisy /boot   06410   5472  85.4   0:023511.7   0:023484.9
> > daisy /opt0 FAILED ---
> > daisy /usr0 FAILED ---
> > 
> > 
> > 
> > Why does /boot work ??? Ups, just saw it worked only the first time ...
> > Now its the same message ...
> > 
> > And, does comp-hd-tar eat more cpu ? I would think so, or am I false ?
> > Last Backup this night 
> > 
> > Were great anyone had a tip !
> > 
> >...olli



disk offline ?

2004-01-08 Thread Oliver Simon
Hi Group !

Maybe anyone has a hint for me ?
We have about 30 Servers, all UNIX. They are all backed up with amanda,
which worked very good, until some problems occured.

I defined a TestSet an tried to put in a new host. But I don´t get it to
work. I want to do a dump of the following partitions, which are really
partitions on the target-host.

-
daisy   /   comp-hd-dump
daisy   /boot   comp-hd-dump
daisy   /optcomp-hd-dump
daisy   /usrcomp-hd-dump
-

If I put comp-hd-tar in here, all works fine ... !?

-
FAILURE AND STRANGE DUMP SUMMARY:
  daisy  /usr lev 0 FAILED [disk /usr offline on daisy?]
  daisy  /opt lev 0 FAILED [disk /opt offline on daisy?]
  daisy  /lev 0 FAILED [disk / offline on daisy?]

...

DUMP SUMMARY: DUMPER STATSTAPER STATS 
HOSTNAME  DISK  L ORIG-KB OUT-KB COMP% MMM:SS  KB/s MMM:SS  KB/s
-- - 
daisy /   0 FAILED ---
daisy /boot   06410   5472  85.4   0:023511.7   0:023484.9
daisy /opt0 FAILED ---
daisy /usr0 FAILED ---



Why does /boot work ??? Ups, just saw it worked only the first time ...
Now its the same message ...

And, does comp-hd-tar eat more cpu ? I would think so, or am I false ?
Last Backup this night 

Were great anyone had a tip !

...olli