subject:"amandad"

RE: amandad paths

2021-11-19 Thread David Simpson

Thanks Jose, when I find time I will be making the ansible role smarter. It 
will detect which of the amandad paths is needed (for the host being 
configured) by searching through a list.

These entries will be part of that search.

-
David Simpson - Senior Systems Engineer
ARCCA, Redwood Building,
King Edward VII Avenue,
Cardiff, CF10 3NB   
    

David Simpson - peiriannydd uwch systemau
ARCCA, Adeilad Redwood,
King Edward VII Avenue,
Caerdydd, CF10 3NB


-Original Message-
From: Jose M Calhariz  
Sent: 17 November 2021 13:27
To: David Simpson 
Cc: amanda-users@amanda.org
Subject: Re: amandad paths

External email to Cardiff University - Take care when replying/opening 
attachments or links.
Nid ebost mewnol o Brifysgol Caerdydd yw hwn - Cymerwch ofal wrth ateb/agor 
atodiadau neu ddolenni.

Re: amandad paths

2021-11-17 Thread Jose M Calhariz

Let me add what dumptypes I have:

define dumptype bsd {
  amandad_path "/usr/local/libexec/amanda/amandad"
  client_username "amanda"
}

define dumptype rhel {
  amandad-path "/usr/lib64/amanda/amandad"
  client_username "amandabackup"
}

define dumptype osx {
  client_username "amandabackup"
}

define dumptype upstream {
  amandad-path "/usr/libexec/amanda/amandad"
  client_username "amandabackup"
}

define dumptype source {
  amandad-path "/usr/local/libexec/amanda"
  client_username "amanda"
}


Kind regards
Jose M Calhariz




On Thu, Nov 11, 2021 at 04:47:25PM +, David Simpson wrote:
> Appear to have come across 3 possible amandad paths so far:
> 
> 
> Debian 11 (from repo): /usr/lib/amanda/amandad
> 
> Alamlinux 8 (from repo @appstream): /usr/lib64/amanda/amandad
> 
> RedHat 7 Zmanda supplied RPM (from website):  /usr/libexec/amanda/amandad
> 
> 
> 
> Do these cover all known (and current) combinations/everything?
> 
> David
> 
> 
> 
> -
> David Simpson - Senior Systems Engineer
> ARCCA, Redwood Building,
> King Edward VII Avenue,
> Cardiff, CF10 3NB
> 
> David Simpson - peiriannydd uwch systemau
> ARCCA, Adeilad Redwood,
> King Edward VII Avenue,
> Caerdydd, CF10 3NB

-- 
--
Bill Gates e Scott McNealy estavam jogando frisbee perto de um lago.
Constantemente, Bill, demonstrando horrível pontaria, jogava muito alto e o
frisbee caía no meio do lago. Scott ia até lá, andava sobre a água e buscava o
frisbee, e o jogo prosseguia.

Na manhã seguinte, a Microsoft publica dois press-releases:

- Lançamentos da Microsoft superam expectativas!

- Scott não sabe nadar


signature.asc
Description: PGP signature

amandad paths

2021-11-11 Thread David Simpson

Appear to have come across 3 possible amandad paths so far:


Debian 11 (from repo): /usr/lib/amanda/amandad

Alamlinux 8 (from repo @appstream): /usr/lib64/amanda/amandad

RedHat 7 Zmanda supplied RPM (from website):  /usr/libexec/amanda/amandad



Do these cover all known (and current) combinations/everything?

David



-
David Simpson - Senior Systems Engineer
ARCCA, Redwood Building,
King Edward VII Avenue,
Cardiff, CF10 3NB

David Simpson - peiriannydd uwch systemau
ARCCA, Adeilad Redwood,
King Edward VII Avenue,
Caerdydd, CF10 3NB

Re: amandad defunct on client calcsize/sendsize hung

2019-10-29 Thread Nathan Stratton Treadway

On Fri, Sep 20, 2019 at 13:11:08 +, Robert Reilly wrote:
> Nathan, the backup completed successfully with setting the estimate to
> server. Here is the planner log from last nights failed backup.
> https://gist.github.com/rreilly-edr/4f9f7373049124b28fd1da75ffcfe917

Robert, did you ever have any luck tracking down what was causing the
calcsize hangs in your environment?

Nathan


Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-20 Thread Nathan Stratton Treadway

On Fri, Sep 20, 2019 at 13:28:42 +, Robert Reilly wrote:
> 
> First one is the failed backup second is the successful backup
> 
> https://gist.github.com/rreilly-edr/debff9f0ce3a1759993083b54a966cda

It would still be helpeful to see your actual disklist file, but it
looks like you have two Amanda clients configured via SSH and one using
BSDTCP auth.  

In the successful run, all three clients respond to the "estimate
server" request essentially immediately and the planner shuts down
cleanly, around 2 seconds after it starts.

In the failed run, the BSDTCP client reponds with its size estimates
within about 18 seconds, but the other two clients never send any
calcsize data back to the planner, and after (I believe) the etimeout
period expires the planner gives up and shuts down.

So, I guess the next thing to look at is the exact status of the
"calcsize" processes on the clients during this waiting period.  Data
from those processes has to go back to the server via a complicated
chain (simplified, this chain seems to be something like: calcsize ->
sendsize -> amandad -> sshd ---[network to server]--> ssh-auth-driver ->
planner), so it could be somewhere in there

Or it could be that "calcsize" just isn't running/terminating, for some
reason (though this seems less likely, since the problem seems to have
something to do with upgrading to 3.5.1 on the server side...).

Anyway, in addition to looking at processes on the client and
strace/lsof of the calcsize (and sendsize) processes during the waiting
period, more generally looking at the "sendsize" debug files on the
clients might tell us have far along the clients are getting and if
there are any errors along the way.

Nathan

Nathan Stratton Treadway - natha...@ontko.com - Mid-Atlantic region Ray
Ontko & Co.  - Software consulting services - http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-20 Thread Nathan Stratton Treadway

On Fri, Sep 20, 2019 at 13:28:42 +, Robert Reilly wrote:
> 
> First one is the failed backup second is the successful backup
> 
> https://gist.github.com/rreilly-edr/debff9f0ce3a1759993083b54a966cda

Okay, thanks.  I see you posted a link to your amanda.conf earlier in
the thread; can you also post your disklist file?  (I take it it has
only four entries, right?)

I will try to take a closer look at the planner logs later today... 

Nathan

Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

RE: amandad defunct on client calcsize/sendsize hung

2019-09-20 Thread Robert Reilly



First one is the failed backup second is the successful backup

https://gist.github.com/rreilly-edr/debff9f0ce3a1759993083b54a966cda

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Nathan Stratton Treadway
Sent: Friday, September 20, 2019 9:21 AM
To: amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

On Fri, Sep 20, 2019 at 13:11:08 +, Robert Reilly wrote:
> Nathan, the backup completed successfully with setting the estimate to 
> server. Here is the planner log from last nights failed backup.
> https://gist.github.com/rreilly-edr/4f9f7373049124b28fd1da75ffcfe917

Can you post the planner log from the successful estimate-server run as well? 
We might learn something more by comparing the two

Nathan



Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-20 Thread Nathan Stratton Treadway

On Fri, Sep 20, 2019 at 13:11:08 +, Robert Reilly wrote:
> Nathan, the backup completed successfully with setting the estimate to
> server. Here is the planner log from last nights failed backup.
> https://gist.github.com/rreilly-edr/4f9f7373049124b28fd1da75ffcfe917

Can you post the planner log from the successful estimate-server run as
well? We might learn something more by comparing the two

Nathan



Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

RE: amandad defunct on client calcsize/sendsize hung

2019-09-20 Thread Robert Reilly

Nathan, the backup completed successfully with setting the estimate to server. 
Here is the planner log from last nights failed backup.
https://gist.github.com/rreilly-edr/4f9f7373049124b28fd1da75ffcfe917

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Nathan Stratton Treadway
Sent: Thursday, September 19, 2019 10:06 PM
To: amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

On Wed, Sep 11, 2019 at 17:21:10 +, Robert Reilly wrote:
> not sure what other debug logs make sense to send..

"amdump" in a way is just a wrapper that kicks of several other components to 
do the real work.

In this case, it's the *planner* log on the server side that would show you 
exactly what the server is sending to the client at the time that the sendsize 
step gets hung.

Nathan

Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

RE: amandad defunct on client calcsize/sendsize hung

2019-09-20 Thread Robert Reilly

Nathan, I will try with the server estimate, also I am able to run calcsize 
manually with no problem. I will try this now and report back.
Thanks
Robert

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Nathan Stratton Treadway
Sent: Thursday, September 19, 2019 9:56 PM
To: amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

On Thu, Sep 12, 2019 at 17:48:28 +, Robert Reilly wrote:
> I did a while loop started the backup and I see that noop is also 
> defunct but dose get cleaned up
> 
> backup8919  8829  0 17:45 ?00:00:00 sshd: backup@notty
> backup8928  8919  0 17:45 ?00:00:00 /usr/lib/amanda/amandad 
> -auth=ssh
> backup9056  8928  0 17:45 ?00:00:00 [noop] 
> backup9057  8928  0 17:45 ?    00:00:00 [amandad] 

(I am not sure how long you had between cycles of your while loop, but I as far 
as I can tell it's normal operation for the spawned "noop"
process to remain defunct for a short period of time while the main amandad 
process copies the noop process's data back over the network to the server, but 
then once that copying has finished the "noop" process is cleaned up.

Probably the same thing would happen with the "sendsize" process -- but since 
that is never finishing you never get to the point that there's a defunct 
sendsize process.

Then defunct "amandad" processes, on the other hand, seem to be subproceses 
amandad spawns of itself to offload some work -- and as far as I can tell those 
may not get cleaned up until amandad as a whole
exits.)

Nathan

Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-19 Thread Nathan Stratton Treadway

On Wed, Sep 11, 2019 at 17:21:10 +, Robert Reilly wrote:
> not sure what other debug logs make sense to send..

"amdump" in a way is just a wrapper that kicks of several other
components to do the real work.

In this case, it's the *planner* log on the server side that would show
you exactly what the server is sending to the client at the time that
the sendsize step gets hung.

Nathan


Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-19 Thread Nathan Stratton Treadway

On Thu, Sep 12, 2019 at 17:48:28 +, Robert Reilly wrote:
> I did a while loop started the backup and I see that noop is also
> defunct but dose get cleaned up
> 
> backup8919  8829  0 17:45 ?00:00:00 sshd: backup@notty
> backup8928  8919  0 17:45 ?00:00:00 /usr/lib/amanda/amandad 
> -auth=ssh
> backup9056  8928  0 17:45 ?00:00:00 [noop] 
> backup9057  8928  0 17:45 ?    00:00:00 [amandad] 

(I am not sure how long you had between cycles of your while loop, but I
as far as I can tell it's normal operation for the spawned "noop"
process to remain defunct for a short period of time while the main
amandad process copies the noop process's data back over the network to
the server, but then once that copying has finished the "noop" process
is cleaned up.

Probably the same thing would happen with the "sendsize" process -- but
since that is never finishing you never get to the point that there's a
defunct sendsize process.

Then defunct "amandad" processes, on the other hand, seem to be
subproceses amandad spawns of itself to offload some work -- and as far
as I can tell those may not get cleaned up until amandad as a whole
exits.)

Nathan

Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-19 Thread Nathan Stratton Treadway

On Wed, Sep 11, 2019 at 17:21:10 +, Robert Reilly wrote:
> Here is the latest amandad debug file from the client 
> https://gist.github.com/rreilly-edr/fcc7660a9393d292bfd1437c7636826a
> 
> amdump log from server
> https://gist.github.com/rreilly-edr/cf306ce10c594bc7b9bf5c06a98722b2
> 
> not sure what other debug logs make sense to send..

I take it you did set etimeout to "3600"?   Looks like what's happening
now is that the Amanda server side times out (after that 1 hour period)
when it hasn't heard back from the client, whereas before it was amandad
on the client which timed out (after 5.5 hours).  But both would seem to
be consistent with the underlying problem of calcsize never completing,
for whatever reason...

Have you tried switching to e.g. "estimate server" on one of the
affected DLEs?  That would give a hint as to whether the problem is with
"calcsize" per se or a more general problem with communication between
the clinet and the server.  (That is, with "estimate server" the client
shoudln't need to run calcsize and thus there wouldn't be a calcsize
process to get stuck... but then does the same thing happen with the
next activity that the client tries to do?)

Nathan

Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-19 Thread Nathan Stratton Treadway

On Thu, Sep 12, 2019 at 17:30:44 +, Robert Reilly wrote:
> KJ, unfortunately the something amandad on the client gets defunct and
> there procs do not go away until I kill them. The procs hang ( I think
> until timeout ) and get an EOF in the log.
> 
> The fact that amandad is defunct must be what is hanging the backup, I
> have not been able to figure out why amandad is going defunct though.
> rob
> 
> sshd?????amandad?2*[amandad]
>??sendsize?2*[sendsize?calcsize]
> 
> root 20206  1322  0 17:18 ?00:00:00 sshd: backup [priv]
> backup   20217 1  0 17:18 ?00:00:00 /lib/systemd/systemd --user
> backup   20218 20217  0 17:18 ?00:00:00 (sd-pam)
> backup   20255 20206  0 17:18 ?00:00:00 sshd: backup@notty
> backup   20256 20255  0 17:18 ?00:00:00 /usr/lib/amanda/amandad 
> -auth=ssh
> backup   20261 20256  0 17:18 ?00:00:00 [amandad] 
> backup   20288 20256  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
> amandad ssh
> backup   20289 20256  0 17:18 ?00:00:00 [amandad] 
> backup   20290 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
> amandad ssh
> backup   20291 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
> amandad ssh
> backup   20292 20290  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR / / 
> 0 0
> backup   20293 20291  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR 
> /opt /opt 0 0 

I'm still trying to look though all the emails and linked log files in
this thread, but may not be able to do that today, so thought I would
sent a quick note for now:

I don't use ssh auth myself and it's not currently clear to me why there
would be two defunct amandad processes like this, but amandad definitely
spawns subprocesses for each "service" it's running, and I'm fairly
certain that it doesn't reap child processes until it is shutting
down

So in this case I think the fact that there are defunct amandad
processes is probably just a harmless side-effect of the true problem,
which seems to be that the calcsize proceses aren't finishing. 

Are you still getting this same behavior?  Can you lsof/strace the
calcsize process and/or look closely in the
/var/log/amanda/client/DailySet1/sendsize.*.debug log to find out what
it causing it to hang?

(Once you've finished investigating the calcisize processes, you could
test my first theory by killing those processes and seeing if amandad at
that point cleans up the defunct processes and exits cleanly...)

Nathan




Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Kamil Jońca

Robert Reilly  writes:

> All, Is there a way to run the backup manually to see whats happening on the 
> client outside of running strace on amdump ?
> Rob

Please:
 1. clear environment (kill zombies, hung processes, etc)
 2. try to backup single host
 3. try to backup all hosts EXCEPT amanda server
I am curious if it is my problem (see:
 https://www.mail-archive.com/amanda-users@amanda.org/msg49744.html)

KJ

-- 
http://stopstopnop.pl/stop_stopnop.pl_o_nas.html
There are no manifestos like cannon and musketry.
-- The Duke of Wellington

RE: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Robert Reilly

I did a while loop started the backup and I see that noop is also defunct but 
dose  get cleaned up

backup8919  8829  0 17:45 ?00:00:00 sshd: backup@notty
backup8928  8919  0 17:45 ?00:00:00 /usr/lib/amanda/amandad 
-auth=ssh
backup9056  8928  0 17:45 ?00:00:00 [noop] 
backup9057  8928  0 17:45 ?00:00:00 [amandad] 
Rob

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Robert Reilly
Sent: Thursday, September 12, 2019 1:31 PM
To: Kamil Jońca ; amanda-users@amanda.org
Subject: RE: amandad defunct on client calcsize/sendsize hung

KJ, unfortunately the something amandad on the client gets defunct and there 
procs do not go away until I kill them. The procs hang ( I think until timeout 
) and get an EOF in the log. 

The fact that amandad is defunct must be what is hanging the backup, I have not 
been able to figure out why amandad is going defunct though.
rob

sshd───amandad─┬─2*[amandad]
   └─sendsize───2*[sendsize───calcsize]

root 20206  1322  0 17:18 ?00:00:00 sshd: backup [priv]
backup   20217 1  0 17:18 ?00:00:00 /lib/systemd/systemd --user
backup   20218 20217  0 17:18 ?00:00:00 (sd-pam)
backup   20255 20206  0 17:18 ?00:00:00 sshd: backup@notty
backup   20256 20255  0 17:18 ?00:00:00 /usr/lib/amanda/amandad 
-auth=ssh
backup   20261 20256  0 17:18 ?00:00:00 [amandad] 
backup   20288 20256  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20289 20256  0 17:18 ?00:00:00 [amandad] 
backup   20290 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20291 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20292 20290  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR / / 0 0
backup   20293 20291  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR /opt 
/opt 0 0 

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Kamil Jonca
Sent: Thursday, September 12, 2019 1:05 PM
To: amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

Robert Reilly  writes:

> Hi, All I have setup Amanda, it works great for the Amanda server itself but 
> fails to backup any clients.
> Server:
> Ubuntu 18.04
> Amanda 3.5.1 ( ubuntu repo pkg )

Whats happening when you try to backup single client?

i.e. amdump  clienthost?

does it works?
KJ

-- 
http://stopstopnop.pl/stop_stopnop.pl_o_nas.html
Epperson's law:
When a man says it's a silly, childish game, it's probably
something his wife can beat him at.

RE: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Robert Reilly

All, Is there a way to run the backup manually to see whats happening on the 
client outside of running strace on amdump ?
Rob

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Robert Reilly
Sent: Thursday, September 12, 2019 1:31 PM
To: Kamil Jońca ; amanda-users@amanda.org
Subject: RE: amandad defunct on client calcsize/sendsize hung

KJ, unfortunately the something amandad on the client gets defunct and there 
procs do not go away until I kill them. The procs hang ( I think until timeout 
) and get an EOF in the log. 

The fact that amandad is defunct must be what is hanging the backup, I have not 
been able to figure out why amandad is going defunct though.
rob

sshd───amandad─┬─2*[amandad]
   └─sendsize───2*[sendsize───calcsize]

root 20206  1322  0 17:18 ?00:00:00 sshd: backup [priv]
backup   20217 1  0 17:18 ?00:00:00 /lib/systemd/systemd --user
backup   20218 20217  0 17:18 ?00:00:00 (sd-pam)
backup   20255 20206  0 17:18 ?00:00:00 sshd: backup@notty
backup   20256 20255  0 17:18 ?00:00:00 /usr/lib/amanda/amandad 
-auth=ssh
backup   20261 20256  0 17:18 ?00:00:00 [amandad] 
backup   20288 20256  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20289 20256  0 17:18 ?00:00:00 [amandad] 
backup   20290 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20291 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20292 20290  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR / / 0 0
backup   20293 20291  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR /opt 
/opt 0 0 

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Kamil Jonca
Sent: Thursday, September 12, 2019 1:05 PM
To: amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

Robert Reilly  writes:

> Hi, All I have setup Amanda, it works great for the Amanda server itself but 
> fails to backup any clients.
> Server:
> Ubuntu 18.04
> Amanda 3.5.1 ( ubuntu repo pkg )

Whats happening when you try to backup single client?

i.e. amdump  clienthost?

does it works?
KJ

-- 
http://stopstopnop.pl/stop_stopnop.pl_o_nas.html
Epperson's law:
When a man says it's a silly, childish game, it's probably
something his wife can beat him at.

RE: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Robert Reilly

KJ, unfortunately the something amandad on the client gets defunct and there 
procs do not go away until I kill them. The procs hang ( I think until timeout 
) and get an EOF in the log. 

The fact that amandad is defunct must be what is hanging the backup, I have not 
been able to figure out why amandad is going defunct though.
rob

sshd───amandad─┬─2*[amandad]
   └─sendsize───2*[sendsize───calcsize]

root 20206  1322  0 17:18 ?00:00:00 sshd: backup [priv]
backup   20217 1  0 17:18 ?00:00:00 /lib/systemd/systemd --user
backup   20218 20217  0 17:18 ?00:00:00 (sd-pam)
backup   20255 20206  0 17:18 ?00:00:00 sshd: backup@notty
backup   20256 20255  0 17:18 ?00:00:00 /usr/lib/amanda/amandad 
-auth=ssh
backup   20261 20256  0 17:18 ?00:00:00 [amandad] 
backup   20288 20256  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20289 20256  0 17:18 ?00:00:00 [amandad] 
backup   20290 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20291 20288  0 17:18 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   20292 20290  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR / / 0 0
backup   20293 20291  0 17:18 ?00:00:00 calcsize DailySet1 GNUTAR /opt 
/opt 0 0 

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Kamil Jonca
Sent: Thursday, September 12, 2019 1:05 PM
To: amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

Robert Reilly  writes:

> Hi, All I have setup Amanda, it works great for the Amanda server itself but 
> fails to backup any clients.
> Server:
> Ubuntu 18.04
> Amanda 3.5.1 ( ubuntu repo pkg )

Whats happening when you try to backup single client?

i.e. amdump  clienthost?

does it works?
KJ

-- 
http://stopstopnop.pl/stop_stopnop.pl_o_nas.html
Epperson's law:
When a man says it's a silly, childish game, it's probably
something his wife can beat him at.

Re: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Kamil Jońca

Robert Reilly  writes:

> Hi, All I have setup Amanda, it works great for the Amanda server itself but 
> fails to backup any clients.
> Server:
> Ubuntu 18.04
> Amanda 3.5.1 ( ubuntu repo pkg )

Whats happening when you try to backup single client?

i.e. amdump  clienthost?

does it works?
KJ

-- 
http://stopstopnop.pl/stop_stopnop.pl_o_nas.html
Epperson's law:
When a man says it's a silly, childish game, it's probably
something his wife can beat him at.

Re: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Jens Berg

Too bad that it did not help. I still have no idea what the reason for
that problem is.

I assume you are using the packages amanda-server and amanda-client from
Ubuntu's repository. I remember there were several issues with them
under Debian and Ubuntu in the past. Maybe it would help to build amanda
from the source tarball?

Not directly related to your problem (I think) but probably of interest
if you are using package amanda-client on Ubuntu 18.04:
https://utcc.utoronto.ca/~cks/space/blog/linux/Ubuntu1804AmandaProblem

Jens

On 9/11/2019 6:34 PM, Robert Reilly wrote:
> I added the timeouts but still see the following in the amandad log and
> backup does not complete and the procs are still defunct. If I run
> caclsize manually completes:
> 
> time ./calcsize DailySet1 GNUTAR / / 0 0
> 
> / 0 SIZE 3472863
> 
>  
> 
> real    0m0.428s
> 
> user   0m0.132s
> 
> sys  0m0.292s
> 
>  
> 
> “Wed Sep 11 15:59:02 2019: thd-0x5575c3fe1400: amandad: receive error:
> EOF on read from tf-p-ubu-amanda-00.xx.com”
> 
>  
> 
> *From:* owner-amanda-us...@amanda.org 
> *On Behalf Of *Robert Reilly
> *Sent:* Wednesday, September 11, 2019 8:31 AM
> *To:* amanda-users@amanda.org
> *Subject:* RE: amandad defunct on client calcsize/sendsize hung
> 
>  
> 
> All,
> 
> I also tried this on an 18.04 client wit 3.5.1 amanda client and I get
> the same result defunct process and no backup. I am at a loss as what
> could be causing this I have spent many hours debugging. Any thoughts ?
> 
>  
> 
> *From:* owner-amanda-us...@amanda.org
> <mailto:owner-amanda-us...@amanda.org>  <mailto:owner-amanda-us...@amanda.org>> *On Behalf Of *Robert Reilly
> *Sent:* Friday, September 6, 2019 10:55 AM
> *To:* amanda-users@amanda.org <mailto:amanda-users@amanda.org>
> *Subject:* amandad defunct on client calcsize/sendsize hung
> 
>  
> 
> Hi, All I have setup Amanda, it works great for the Amanda server itself
> but fails to backup any clients.
> 
> Server:
> 
> Ubuntu 18.04
> 
> Amanda 3.5.1 ( ubuntu repo pkg )
> 
>  Client
> 
> Ubuntu 16.04
> 
> Amanda 3.3.6
> 
>  
> 
> Immediately after starting the backup this is the state of the procs on
> the client
> 
> backup   30608 1  0 11:36 ?    00:00:00 /lib/systemd/systemd --user
> 
> backup   30609 30608  0 11:36 ?    00:00:00 (sd-pam)
> 
> backup   30645 30606  0 11:36 ?    00:00:00 sshd: backup@notty
> 
> backup   30646 30645  0 11:36 ?    00:00:00 /usr/lib/amanda/amandad
> -auth=ssh
> 
> backup   30648 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30679 30646  0 11:36 ?    00:00:00 /usr/lib/amanda/sendsize
> amandad ssh
> 
> backup   30680 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30683 30679  0 11:36 ?    00:00:00 /usr/lib/amanda/sendsize
> amandad ssh
> 
>  
> 
>  Amanda.conf
> 
> https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb
> 
>  
> 
> #
> 
>  
> 
> Client Debug Log
> 
>  
> 
> https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d
> 
>  
> 
> ##
> 
> Thanks in advance..
> 
> Robert
> 
>  
> 
>  
>

Re: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Jens Berg

Hi,

the hanging processes are strange, no idea about that.

But I noticed you haven't defined "etimeout" (the time amanda waits for
estimate results in seconds) in your amanda.conf. IIRC it defaults to
something around 10 min. and that might be too short for the client.
Does it help to set
etimeout 3600
in amanda.conf? If yes, it's probably a good idea to define "dtimeout"
as well and set it to twice or thrice the value of etimeout.

Jens

On 9/11/2019 2:30 PM, Robert Reilly wrote:
> All,
> 
> I also tried this on an 18.04 client wit 3.5.1 amanda client and I get
> the same result defunct process and no backup. I am at a loss as what
> could be causing this I have spent many hours debugging. Any thoughts ?
> 
>  
> 
> *From:* owner-amanda-us...@amanda.org 
> *On Behalf Of *Robert Reilly
> *Sent:* Friday, September 6, 2019 10:55 AM
> *To:* amanda-users@amanda.org
> *Subject:* amandad defunct on client calcsize/sendsize hung
> 
>  
> 
> Hi, All I have setup Amanda, it works great for the Amanda server itself
> but fails to backup any clients.
> 
> Server:
> 
> Ubuntu 18.04
> 
> Amanda 3.5.1 ( ubuntu repo pkg )
> 
>  Client
> 
> Ubuntu 16.04
> 
> Amanda 3.3.6
> 
>  
> 
> Immediately after starting the backup this is the state of the procs on
> the client
> 
> backup   30608 1  0 11:36 ?    00:00:00 /lib/systemd/systemd --user
> 
> backup   30609 30608  0 11:36 ?    00:00:00 (sd-pam)
> 
> backup   30645 30606  0 11:36 ?    00:00:00 sshd: backup@notty
> 
> backup   30646 30645  0 11:36 ?    00:00:00 /usr/lib/amanda/amandad
> -auth=ssh
> 
> backup   30648 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30679 30646  0 11:36 ?    00:00:00 /usr/lib/amanda/sendsize
> amandad ssh
> 
> backup   30680 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30683 30679  0 11:36 ?    00:00:00 /usr/lib/amanda/sendsize
> amandad ssh
> 
>  
> 
>  Amanda.conf
> 
> https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb
> 
>  
> 
> #
> 
>  
> 
> Client Debug Log
> 
>  
> 
> https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d
> 
>  
> 
> ##
> 
> Thanks in advance..
> 
> Robert
> 
>  
> 
>  
>

RE: amandad defunct on client calcsize/sendsize hung

2019-09-12 Thread Robert Reilly

Jens, I am using the included packages was trying to quickly POC Amanda for our 
env for a few file servers. I will build from source and see if I get anywhere.
Thanks !
rob

-Original Message-
From: Jens Berg  
Sent: Thursday, September 12, 2019 3:19 AM
To: Robert Reilly ; amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

Too bad that it did not help. I still have no idea what the reason for that 
problem is.

I assume you are using the packages amanda-server and amanda-client from 
Ubuntu's repository. I remember there were several issues with them under 
Debian and Ubuntu in the past. Maybe it would help to build amanda from the 
source tarball?

Not directly related to your problem (I think) but probably of interest if you 
are using package amanda-client on Ubuntu 18.04:
https://utcc.utoronto.ca/~cks/space/blog/linux/Ubuntu1804AmandaProblem

Jens

On 9/11/2019 6:34 PM, Robert Reilly wrote:
> I added the timeouts but still see the following in the amandad log 
> and backup does not complete and the procs are still defunct. If I run 
> caclsize manually completes:
> 
> time ./calcsize DailySet1 GNUTAR / / 0 0
> 
> / 0 SIZE 3472863
> 
>  
> 
> real    0m0.428s
> 
> user   0m0.132s
> 
> sys  0m0.292s
> 
>  
> 
> "Wed Sep 11 15:59:02 2019: thd-0x5575c3fe1400: amandad: receive error:
> EOF on read from tf-p-ubu-amanda-00.xx.com"
> 
>  
> 
> *From:* owner-amanda-us...@amanda.org  
> *On Behalf Of *Robert Reilly
> *Sent:* Wednesday, September 11, 2019 8:31 AM
> *To:* amanda-users@amanda.org
> *Subject:* RE: amandad defunct on client calcsize/sendsize hung
> 
>  
> 
> All,
> 
> I also tried this on an 18.04 client wit 3.5.1 amanda client and I get 
> the same result defunct process and no backup. I am at a loss as what 
> could be causing this I have spent many hours debugging. Any thoughts ?
> 
>  
> 
> *From:* owner-amanda-us...@amanda.org
> <mailto:owner-amanda-us...@amanda.org>  <mailto:owner-amanda-us...@amanda.org>> *On Behalf Of *Robert Reilly
> *Sent:* Friday, September 6, 2019 10:55 AM
> *To:* amanda-users@amanda.org <mailto:amanda-users@amanda.org>
> *Subject:* amandad defunct on client calcsize/sendsize hung
> 
>  
> 
> Hi, All I have setup Amanda, it works great for the Amanda server 
> itself but fails to backup any clients.
> 
> Server:
> 
> Ubuntu 18.04
> 
> Amanda 3.5.1 ( ubuntu repo pkg )
> 
>  Client
> 
> Ubuntu 16.04
> 
> Amanda 3.3.6
> 
>  
> 
> Immediately after starting the backup this is the state of the procs 
> on the client
> 
> backup   30608 1  0 11:36 ?    00:00:00 /lib/systemd/systemd 
> --user
> 
> backup   30609 30608  0 11:36 ?    00:00:00 (sd-pam)
> 
> backup   30645 30606  0 11:36 ?    00:00:00 sshd: backup@notty
> 
> backup   30646 30645  0 11:36 ?    00:00:00 
> /usr/lib/amanda/amandad -auth=ssh
> 
> backup   30648 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30679 30646  0 11:36 ?    00:00:00 
> /usr/lib/amanda/sendsize amandad ssh
> 
> backup   30680 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30683 30679  0 11:36 ?    00:00:00 
> /usr/lib/amanda/sendsize amandad ssh
> 
>  
> 
>  Amanda.conf
> 
> https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb
> 
>  
> 
> #
> 
>  
> 
> Client Debug Log
> 
>  
> 
> https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d
> 
>  
> 
> ##
> 
> Thanks in advance..
> 
> Robert
> 
>  
> 
>  
>

RE: amandad defunct on client calcsize/sendsize hung

2019-09-11 Thread Robert Reilly

Here is the latest amandad debug file from the client 
https://gist.github.com/rreilly-edr/fcc7660a9393d292bfd1437c7636826a

amdump log from server
https://gist.github.com/rreilly-edr/cf306ce10c594bc7b9bf5c06a98722b2

not sure what other debug logs make sense to send..
Thanks
Rob

-Original Message-
From: owner-amanda-us...@amanda.org  On Behalf 
Of Robert Reilly
Sent: Wednesday, September 11, 2019 10:30 AM
To: Jens Berg ; amanda-users@amanda.org
Subject: RE: amandad defunct on client calcsize/sendsize hung

Thanks I will give it a try and report back.

-Original Message-
From: Jens Berg 
Sent: Wednesday, September 11, 2019 10:18 AM
To: Robert Reilly ; amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

Hi,

the hanging processes are strange, no idea about that.

But I noticed you haven't defined "etimeout" (the time amanda waits for 
estimate results in seconds) in your amanda.conf. IIRC it defaults to something 
around 10 min. and that might be too short for the client.
Does it help to set
etimeout 3600
in amanda.conf? If yes, it's probably a good idea to define "dtimeout"
as well and set it to twice or thrice the value of etimeout.

Jens

On 9/11/2019 2:30 PM, Robert Reilly wrote:
> All,
> 
> I also tried this on an 18.04 client wit 3.5.1 amanda client and I get 
> the same result defunct process and no backup. I am at a loss as what 
> could be causing this I have spent many hours debugging. Any thoughts ?
> 
>  
> 
> *From:* owner-amanda-us...@amanda.org  
> *On Behalf Of *Robert Reilly
> *Sent:* Friday, September 6, 2019 10:55 AM
> *To:* amanda-users@amanda.org
> *Subject:* amandad defunct on client calcsize/sendsize hung
> 
>  
> 
> Hi, All I have setup Amanda, it works great for the Amanda server 
> itself but fails to backup any clients.
> 
> Server:
> 
> Ubuntu 18.04
> 
> Amanda 3.5.1 ( ubuntu repo pkg )
> 
>  Client
> 
> Ubuntu 16.04
> 
> Amanda 3.3.6
> 
>  
> 
> Immediately after starting the backup this is the state of the procs 
> on the client
> 
> backup   30608 1  0 11:36 ?    00:00:00 /lib/systemd/systemd 
> --user
> 
> backup   30609 30608  0 11:36 ?    00:00:00 (sd-pam)
> 
> backup   30645 30606  0 11:36 ?    00:00:00 sshd: backup@notty
> 
> backup   30646 30645  0 11:36 ?    00:00:00 
> /usr/lib/amanda/amandad -auth=ssh
> 
> backup   30648 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30679 30646  0 11:36 ?    00:00:00 
> /usr/lib/amanda/sendsize amandad ssh
> 
> backup   30680 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30683 30679  0 11:36 ?    00:00:00 
> /usr/lib/amanda/sendsize amandad ssh
> 
>  
> 
>  Amanda.conf
> 
> https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb
> 
>  
> 
> #
> 
>  
> 
> Client Debug Log
> 
>  
> 
> https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d
> 
>  
> 
> ##
> 
> Thanks in advance..
> 
> Robert
> 
>  
> 
>  
>

RE: amandad defunct on client calcsize/sendsize hung

2019-09-11 Thread Robert Reilly

I added the timeouts but still see the following in the amandad log and backup 
does not complete and the procs are still defunct. If I run caclsize manually 
completes:
time ./calcsize DailySet1 GNUTAR / / 0 0
/ 0 SIZE 3472863

real0m0.428s
user   0m0.132s
sys  0m0.292s

"Wed Sep 11 15:59:02 2019: thd-0x5575c3fe1400: amandad: receive error: EOF on 
read from tf-p-ubu-amanda-00.xx.com"

From: owner-amanda-us...@amanda.org  On Behalf 
Of Robert Reilly
Sent: Wednesday, September 11, 2019 8:31 AM
To: amanda-users@amanda.org
Subject: RE: amandad defunct on client calcsize/sendsize hung

All,
I also tried this on an 18.04 client wit 3.5.1 amanda client and I get the same 
result defunct process and no backup. I am at a loss as what could be causing 
this I have spent many hours debugging. Any thoughts ?

From: owner-amanda-us...@amanda.org<mailto:owner-amanda-us...@amanda.org> 
mailto:owner-amanda-us...@amanda.org>> On Behalf 
Of Robert Reilly
Sent: Friday, September 6, 2019 10:55 AM
To: amanda-users@amanda.org<mailto:amanda-users@amanda.org>
Subject: amandad defunct on client calcsize/sendsize hung

Hi, All I have setup Amanda, it works great for the Amanda server itself but 
fails to backup any clients.
Server:
Ubuntu 18.04
Amanda 3.5.1 ( ubuntu repo pkg )
 Client
Ubuntu 16.04
Amanda 3.3.6

Immediately after starting the backup this is the state of the procs on the 
client
backup   30608 1  0 11:36 ?00:00:00 /lib/systemd/systemd --user
backup   30609 30608  0 11:36 ?00:00:00 (sd-pam)
backup   30645 30606  0 11:36 ?00:00:00 sshd: backup@notty
backup   30646 30645  0 11:36 ?    00:00:00 /usr/lib/amanda/amandad 
-auth=ssh
backup   30648 30646  0 11:36 ?00:00:00 [amandad] 
backup   30679 30646  0 11:36 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   30680 30646  0 11:36 ?00:00:00 [amandad] 
backup   30683 30679  0 11:36 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh

 Amanda.conf
https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb

#

Client Debug Log

https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d

##
Thanks in advance..
Robert

RE: amandad defunct on client calcsize/sendsize hung

2019-09-11 Thread Robert Reilly

Thanks I will give it a try and report back.

-Original Message-
From: Jens Berg  
Sent: Wednesday, September 11, 2019 10:18 AM
To: Robert Reilly ; amanda-users@amanda.org
Subject: Re: amandad defunct on client calcsize/sendsize hung

Hi,

the hanging processes are strange, no idea about that.

But I noticed you haven't defined "etimeout" (the time amanda waits for 
estimate results in seconds) in your amanda.conf. IIRC it defaults to something 
around 10 min. and that might be too short for the client.
Does it help to set
etimeout 3600
in amanda.conf? If yes, it's probably a good idea to define "dtimeout"
as well and set it to twice or thrice the value of etimeout.

Jens

On 9/11/2019 2:30 PM, Robert Reilly wrote:
> All,
> 
> I also tried this on an 18.04 client wit 3.5.1 amanda client and I get 
> the same result defunct process and no backup. I am at a loss as what 
> could be causing this I have spent many hours debugging. Any thoughts ?
> 
>  
> 
> *From:* owner-amanda-us...@amanda.org  
> *On Behalf Of *Robert Reilly
> *Sent:* Friday, September 6, 2019 10:55 AM
> *To:* amanda-users@amanda.org
> *Subject:* amandad defunct on client calcsize/sendsize hung
> 
>  
> 
> Hi, All I have setup Amanda, it works great for the Amanda server 
> itself but fails to backup any clients.
> 
> Server:
> 
> Ubuntu 18.04
> 
> Amanda 3.5.1 ( ubuntu repo pkg )
> 
>  Client
> 
> Ubuntu 16.04
> 
> Amanda 3.3.6
> 
>  
> 
> Immediately after starting the backup this is the state of the procs 
> on the client
> 
> backup   30608 1  0 11:36 ?    00:00:00 /lib/systemd/systemd 
> --user
> 
> backup   30609 30608  0 11:36 ?    00:00:00 (sd-pam)
> 
> backup   30645 30606  0 11:36 ?    00:00:00 sshd: backup@notty
> 
> backup   30646 30645  0 11:36 ?    00:00:00 
> /usr/lib/amanda/amandad -auth=ssh
> 
> backup   30648 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30679 30646  0 11:36 ?    00:00:00 
> /usr/lib/amanda/sendsize amandad ssh
> 
> backup   30680 30646  0 11:36 ?    00:00:00 [amandad] 
> 
> backup   30683 30679  0 11:36 ?    00:00:00 
> /usr/lib/amanda/sendsize amandad ssh
> 
>  
> 
>  Amanda.conf
> 
> https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb
> 
>  
> 
> #
> 
>  
> 
> Client Debug Log
> 
>  
> 
> https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d
> 
>  
> 
> ##
> 
> Thanks in advance..
> 
> Robert
> 
>  
> 
>  
>

RE: amandad defunct on client calcsize/sendsize hung

2019-09-11 Thread Robert Reilly

All,
I also tried this on an 18.04 client wit 3.5.1 amanda client and I get the same 
result defunct process and no backup. I am at a loss as what could be causing 
this I have spent many hours debugging. Any thoughts ?

From: owner-amanda-us...@amanda.org  On Behalf 
Of Robert Reilly
Sent: Friday, September 6, 2019 10:55 AM
To: amanda-users@amanda.org
Subject: amandad defunct on client calcsize/sendsize hung

Hi, All I have setup Amanda, it works great for the Amanda server itself but 
fails to backup any clients.
Server:
Ubuntu 18.04
Amanda 3.5.1 ( ubuntu repo pkg )
 Client
Ubuntu 16.04
Amanda 3.3.6

Immediately after starting the backup this is the state of the procs on the 
client
backup   30608 1  0 11:36 ?00:00:00 /lib/systemd/systemd --user
backup   30609 30608  0 11:36 ?00:00:00 (sd-pam)
backup   30645 30606  0 11:36 ?00:00:00 sshd: backup@notty
backup   30646 30645  0 11:36 ?00:00:00 /usr/lib/amanda/amandad 
-auth=ssh
backup   30648 30646  0 11:36 ?00:00:00 [amandad] 
backup   30679 30646  0 11:36 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   30680 30646  0 11:36 ?00:00:00 [amandad] 
backup   30683 30679  0 11:36 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh

 Amanda.conf
https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb

#

Client Debug Log

https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d

##
Thanks in advance..
Robert

amandad defunct on client calcsize/sendsize hung

2019-09-06 Thread Robert Reilly

Hi, All I have setup Amanda, it works great for the Amanda server itself but 
fails to backup any clients.
 Server:
Ubuntu 18.04
Amanda 3.5.1 ( ubuntu repo pkg )
 Client
Ubuntu 16.04
Amanda 3.3.6

Immediately after starting the backup this is the state of the procs on the 
client
backup   30608 1  0 11:36 ?00:00:00 /lib/systemd/systemd --user
backup   30609 30608  0 11:36 ?00:00:00 (sd-pam)
backup   30645 30606  0 11:36 ?00:00:00 sshd: backup@notty
backup   30646 30645  0 11:36 ?00:00:00 /usr/lib/amanda/amandad 
-auth=ssh
backup   30648 30646  0 11:36 ?00:00:00 [amandad] 
backup   30679 30646  0 11:36 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh
backup   30680 30646  0 11:36 ?00:00:00 [amandad] 
backup   30683 30679  0 11:36 ?00:00:00 /usr/lib/amanda/sendsize 
amandad ssh

 Amanda.conf
https://gist.github.com/rreilly-edr/0fd11aff9cb4e157b614cf74c9b51ddb

#

Client Debug Log

https://gist.github.com/rreilly-edr/37ea6314b19ecc96f31c3d846df0219d

##
Thanks in advance..
Robert

Re: amanda 3.5.1 ignoring option amandad-path in amanda.conf

2018-10-30 Thread Jose M Calhariz

On Tue, Oct 30, 2018 at 07:08:31AM -0400, Nathan Stratton Treadway wrote:
> On Mon, Oct 29, 2018 at 19:01:45 +, Jose M Calhariz wrote:
> > To workaround this I am using a dumptype with:
> > 
> > amandad-path "/usr/amanda/amandad"
> > 
> > After some changes in disklist I think this option is not working.
> > That is strange.  Is anyone using successfully this option?  What
> > version of amanda are you using in the server?  Anyone can check the
> > code?
> 
> I haven't used this option myself, but from the amanda.conf man page
> entry for it the first question to ask is whether you are indeed using
> ssh authentication in this particular dumptype?

Yes, I am using only ssh authentication on this server.


> 
> In a quick scan of the (upstream) git repo changelog I don't see any
> recent changes that obviously affect this option.
> 
> If no one else with actual experience using this option chimes in, I
> could try to take a closer look at the code, but it would probably be
> easier to track down where in the source to look if you posted the full
> definition of the dumptype in question and also the exact text of the
> related messages you are finding in the log files

This is the debug dumptype, the comments are my try to pinpoint the
problem.


define dumptype special {
#  Global definitions
#  global
#  lev-medium
#  rhel

  # From global
  # comment "Global definitions"
  # index yes
auth "ssh"
ssh_keys "/var/backups/.ssh/id_rsa_amdump"
  # max-warnings 111
  # comprate 99, 99
  # compress best
exclude list  optional ".amanda-exclude.list"

  # From lev-medium
  # program "APPLICATION"
  # application "amgtar"
  # priority medium
  # maxpromoteday 6

  # From rhel
  client_username "amandabackup"
  amandad-path "/usr/lib64/amanda/amandad"
  }



The relevant error on amanda server, when I run "amcheck Daily -c
host.fqdn" is:

sh: /usr/lib/amanda/amandad

> 
>   Nathan
> 
> 
> Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
> Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
>  GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
>  Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239
> 
>

Kind regards
Jose M Calhariz



-- 
--

Todo homem deveria ser incendiário até os 25 anos de idade, e bombeiro dali em 
diante

--Alceu Amoroso Lima


signature.asc
Description: PGP signature

Re: amanda 3.5.1 ignoring option amandad-path in amanda.conf

2018-10-30 Thread Nathan Stratton Treadway

On Mon, Oct 29, 2018 at 19:01:45 +, Jose M Calhariz wrote:
> To workaround this I am using a dumptype with:
> 
> amandad-path "/usr/amanda/amandad"
> 
> After some changes in disklist I think this option is not working.
> That is strange.  Is anyone using successfully this option?  What
> version of amanda are you using in the server?  Anyone can check the
> code?

I haven't used this option myself, but from the amanda.conf man page
entry for it the first question to ask is whether you are indeed using
ssh authentication in this particular dumptype?

In a quick scan of the (upstream) git repo changelog I don't see any
recent changes that obviously affect this option.

If no one else with actual experience using this option chimes in, I
could try to take a closer look at the code, but it would probably be
easier to track down where in the source to look if you posted the full
definition of the dumptype in question and also the exact text of the
related messages you are finding in the log files

Nathan


Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

amanda 3.5.1 ignoring option amandad-path in amanda.conf

2018-10-29 Thread Jose M Calhariz


Hi,

I am using the latest amanda 3.5.1 in Debian stretch and I want to do
backup of other Linux systems, in this case CentOS.  The amandad
daemon in Debian packages is located in /usr/lib/amanda/amandad but in
CentOS rpms is located in /usr/lib64/amanda/amandad.

To workaround this I am using a dumptype with:

amandad-path "/usr/amanda/amandad"

After some changes in disklist I think this option is not working.
That is strange.  Is anyone using successfully this option?  What
version of amanda are you using in the server?  Anyone can check the
code?

Kind regards
Jose M Calhariz

-- 
--

Eu quero ficar rica, mas não quero fazer o que é preciso para ficar rica

--Gertrude Stein


signature.asc
Description: PGP signature

Re: how to change the amandad server (3.4.3) listening service port?

2017-03-29 Thread Charles Curley

On Wed, 29 Mar 2017 14:45:18 -0400
Jean-Louis Martineau  wrote:

> It is xinetd that is listening, so it must be changed there:
> In the xinetd file:
>service 11080
>...

Or set the 3.3.9 instance to use ssh. It is a bit more work, but will
allow you to get rid of xinetd entirely and make your installation just
a bit more secure.

-- 

The right of the people to be secure in their persons, houses, papers,
and effects, against unreasonable searches and seizures, shall not be
violated, and no Warrants shall issue, but upon probable cause,
supported by Oath or affirmation, and particularly describing the
place to be searched, and the persons or things to be seized.
-- U.S. Const. Amendment IV

Key fingerprint = CE5C 6645 A45A 64E4 94C0  809C FFF6 4C48 4ECD DFDB

Re: how to change the amandad server (3.4.3) listening service port?

2017-03-29 Thread Jean-Francois Malouin

* Jean-Louis Martineau  [20170329 14:47]:
> On 29/03/17 02:34 PM, Jean-Francois Malouin wrote:
> > Hi,
> >
> > For whatever reasons I'm stuck for a little while with an amanda server
> > with 2 different installations (one is community version 3.4.3 and the
> > other is a zmanda enterprise edition running 3.3.9). I'd like to have
> > them running concurrently and listening to different tcp ports under
> > xinetd until one of them (the 3.4.3) can be disposed of -- a few weeks I
> > guess until all the client DLEs are moved over.
> > Both are running bsdtcp auth.
> >
> > How can I have one instance of amanda running on the default amanda
> > service port 10080/tcp (the 3.3.9 server) and the other one (the 3.4.3
> > server) some other port? The 3.4.3 server is backing up only one client
> > running 3.3.0. I assume that for this lone client I can set the option
> > 'client-port [ int | string ]' in its dumptype but how do I make the
> > server listen to some other port than the default amanda port 10080? Say
> > 11080. Which configure/compile options do I have to set while
> > recompiling the 3.4.3 server? Any hints?
> 
> It is xinetd that is listening, so it must be changed there:
> In the xinetd file:
>service 11080
>...

Of course!  I'll give it a try.
Such a simple answer for a convoluted question!

Thanks Jean-Louis!
jf

> Jean-Louis

Re: how to change the amandad server (3.4.3) listening service port?

2017-03-29 Thread Jean-Louis Martineau

On 29/03/17 02:34 PM, Jean-Francois Malouin wrote:
> Hi,
>
> For whatever reasons I'm stuck for a little while with an amanda server
> with 2 different installations (one is community version 3.4.3 and the
> other is a zmanda enterprise edition running 3.3.9). I'd like to have
> them running concurrently and listening to different tcp ports under
> xinetd until one of them (the 3.4.3) can be disposed of -- a few weeks I
> guess until all the client DLEs are moved over.
> Both are running bsdtcp auth.
>
> How can I have one instance of amanda running on the default amanda
> service port 10080/tcp (the 3.3.9 server) and the other one (the 3.4.3
> server) some other port? The 3.4.3 server is backing up only one client
> running 3.3.0. I assume that for this lone client I can set the option
> 'client-port [ int | string ]' in its dumptype but how do I make the
> server listen to some other port than the default amanda port 10080? Say
> 11080. Which configure/compile options do I have to set while
> recompiling the 3.4.3 server? Any hints?

It is xinetd that is listening, so it must be changed there:
In the xinetd file:
   service 11080
   ...

Jean-Louis
This message is the property of CARBONITE, INC. and may contain confidential or 
privileged information.
If this message has been delivered to you by mistake, then do not copy or 
deliver this message to anyone.  Instead, destroy it and notify me by reply 
e-mail

how to change the amandad server (3.4.3) listening service port?

2017-03-29 Thread Jean-Francois Malouin

Hi,

For whatever reasons I'm stuck for a little while with an amanda server
with 2 different installations (one is community version 3.4.3 and the
other is a zmanda enterprise edition running 3.3.9). I'd like to have
them running concurrently and listening to different tcp ports under
xinetd until one of them (the 3.4.3) can be disposed of -- a few weeks I
guess until all the client DLEs are moved over. 
Both are running bsdtcp auth.

How can I have one instance of amanda running on the default amanda
service port 10080/tcp (the 3.3.9 server) and the other one (the 3.4.3
server) some other port? The 3.4.3 server is backing up only one client
running 3.3.0. I assume that for this lone client I can set the option
'client-port [ int | string ]' in its dumptype but how do I make the
server listen to some other port than the default amanda port 10080? Say
11080. Which configure/compile options do I have to set while
recompiling the 3.4.3 server? Any hints?

Thanks!
jf

Re: amandad running as the wrong user

2012-08-11 Thread Subscriptions

Thanks for the detailed info.

Given that 3.3.2 for Ubuntu was released recently I uninstalled the
server,common and the client, installed 3.3.2 and everything now works
as expected; no changes needed anywhere.

Thanks

Leo


On 12/08/12 09:32, Nathan Stratton Treadway wrote:
> On Mon, Aug 06, 2012 at 22:23:30 +1000, Subscriptions wrote:
>> Hi have posted this in the forum too, but have had no replies
>>
>> I have now downgraded/installed 3.3.0 on Ubuntu 12.04 LTS since the
>> problem has been fixed but am now getting the following error
>>
>> WARNING: ml.zudiewiener.com: selfcheck request failed:
>> tcpm_recv_token: invalid size: "amandad: running as user
>> \"amandabackup\" instead of \"backup\"\n"
>> Client check: 1 host checked in 0.045 seconds. 1 problem found.
>>
>>
>> Everything is set up to use amandabackup as the user (and this
>> worked for months prior to upgrading to 12.04), so not sure why it
>> expects the user to be 'backup'
> Debian (and thus Ubuntu) have always used the "backup" user in their
> standard Amanda packages.  So if your system was in fact using
> "amandabackup" before 12.04, then you must have installed Amanda some
> other way back then (e.g. from source, or using the .deb file from
> amanda.org).
>
> Note, though, that there are accounts in use on both the client and the
> server; it's certainly possible that your Ubuntu system was using
> "backup" as the account on the client side while your server has been
> using "amandabackup" all along.
>
> However, regarding your current problem: it does appear that the
> /etc/xinetd.d/amanda file included in the 12.04 amanda-common package
> specifies "user=amandabackup" for some reason; I assume this is a bug in
> the packaging.
>
> (If you do a 
>   $ grep inetd /var/lib/dpkg/info/amanda-common.postinst
> , you'll see that the setup command used for other-than-xinetd internet
> "super-server" daemons is
>   update-inetd --add "amanda stream tcp nowait backup /usr/lib/amanda/amandad 
> amandad -auth=bsdtcp amdump amindexd amidxtaped"
> ... i.e. it uses "backup" as the user.)
>
> So, if you haven't already done so, I think updating
> /etc/xinetd.d/amanda so that the user= line uses "backup" should fix
> your problem.
>
> Note that you probably don't need to edit the /etc/amandahosts file,
> since that grants incoming permission to the account on the server side
> (which presumably hasn't changed).
>  
> Hope that helps.
>
>   Nathan
>
> 
> Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
> Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
>  GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
>  Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad running as the wrong user

2012-08-11 Thread Nathan Stratton Treadway

On Mon, Aug 06, 2012 at 22:23:30 +1000, Subscriptions wrote:
> 
> Hi have posted this in the forum too, but have had no replies
> 
> I have now downgraded/installed 3.3.0 on Ubuntu 12.04 LTS since the
> problem has been fixed but am now getting the following error
> 
> WARNING: ml.zudiewiener.com: selfcheck request failed:
> tcpm_recv_token: invalid size: "amandad: running as user
> \"amandabackup\" instead of \"backup\"\n"
> Client check: 1 host checked in 0.045 seconds. 1 problem found.
> 
> 
> Everything is set up to use amandabackup as the user (and this
> worked for months prior to upgrading to 12.04), so not sure why it
> expects the user to be 'backup'

Debian (and thus Ubuntu) have always used the "backup" user in their
standard Amanda packages.  So if your system was in fact using
"amandabackup" before 12.04, then you must have installed Amanda some
other way back then (e.g. from source, or using the .deb file from
amanda.org).

Note, though, that there are accounts in use on both the client and the
server; it's certainly possible that your Ubuntu system was using
"backup" as the account on the client side while your server has been
using "amandabackup" all along.

However, regarding your current problem: it does appear that the
/etc/xinetd.d/amanda file included in the 12.04 amanda-common package
specifies "user=amandabackup" for some reason; I assume this is a bug in
the packaging.

(If you do a 
  $ grep inetd /var/lib/dpkg/info/amanda-common.postinst
, you'll see that the setup command used for other-than-xinetd internet
"super-server" daemons is
  update-inetd --add "amanda stream tcp nowait backup /usr/lib/amanda/amandad 
amandad -auth=bsdtcp amdump amindexd amidxtaped"
... i.e. it uses "backup" as the user.)

So, if you haven't already done so, I think updating
/etc/xinetd.d/amanda so that the user= line uses "backup" should fix
your problem.

Note that you probably don't need to edit the /etc/amandahosts file,
since that grants incoming permission to the account on the server side
(which presumably hasn't changed).

Hope that helps.

Nathan

Nathan Stratton Treadway  -  natha...@ontko.com  -  Mid-Atlantic region
Ray Ontko & Co.  -  Software consulting services  -   http://www.ontko.com/
 GPG Key: http://www.ontko.com/~nathanst/gpg_key.txt   ID: 1023D/ECFB6239
 Key fingerprint = 6AD8 485E 20B9 5C71 231C  0C32 15F3 ADCD ECFB 6239

Re: amandad running as the wrong user

2012-08-06 Thread Subscriptions

xinetd.d/amandaserver looks like



---

service amanda

{

disable = no

flags   = IPv4

socket_type = stream

protocol= tcp

wait= no

user= amandabackup

group   = disk

groups  = yes

server  = /usr/lib/amanda/amandad

server_args = -auth=bsdtcp amdump amindexd amidxtaped

}





amandahosts looks like



localhost   root amindexd amidxtaped

localhost amandabackup amdump

localhost.localdomain   root amindexd amidxtaped

localhost.localdomain amandabackup amdump

-



Note sure what you mean by

$amanda_local_account



Cheers,


On 07/08/12 01:01, Bryan Hodgson wrote:
> On Mon, Aug 06, 2012 at 10:23:30PM +1000, Subscriptions wrote:
>>  Everything is set up to use amandabackup as the user (and this
>>  worked for months prior to upgrading to 12.04), so not sure why it
>>  expects the user to be 'backup'
>>
> Doesn't matter.  Fix xinetd.d/amanda, ~$amanda_local_account/.amandahosts
> to permit it to run, and move on.
>

Re: amandad running as the wrong user

2012-08-06 Thread Jon LaBadie

On Mon, Aug 06, 2012 at 10:23:30PM +1000, Subscriptions wrote:
> 
> Hi have posted this in the forum too, but have had no replies
> 
> I have now downgraded/installed 3.3.0 on Ubuntu 12.04 LTS since the
> problem has been fixed but am now getting the following error
> 
> WARNING: ml.zudiewiener.com: selfcheck request failed:
> tcpm_recv_token: invalid size: "amandad: running as user
> \"amandabackup\" instead of \"backup\"\n"
> Client check: 1 host checked in 0.045 seconds. 1 problem found.
> 
> 
> Everything is set up to use amandabackup as the user (and this
> worked for months prior to upgrading to 12.04), so not sure why it
> expects the user to be 'backup'

Just a guess, perhaps the pre-built amanda package you are using
was compiled with a default user of "backup".

As a workaround, it might work to create a user named "backup"
with the same uid, gid, and home dir as "amandabackup".  Simplest
way is to edit /etc/passwd and /etc/shadow.  Duplicate the "amandabackup"
lines in those files and delete the leading 6 characters of the new line.

jl
-- 
Jon H. LaBadie j...@jgcomp.com
 11226 South Shore Rd.  (703) 787-0688 (H)
 Reston, VA  20190  (609) 477-8330 (C)

amandad running as the wrong user

2012-08-06 Thread Subscriptions


Hi have posted this in the forum too, but have had no replies

I have now downgraded/installed 3.3.0 on Ubuntu 12.04 LTS since the
problem has been fixed but am now getting the following error

WARNING: ml.zudiewiener.com: selfcheck request failed:
tcpm_recv_token: invalid size: "amandad: running as user
\"amandabackup\" instead of \"backup\"\n"
Client check: 1 host checked in 0.045 seconds. 1 problem found.


Everything is set up to use amandabackup as the user (and this
worked for months prior to upgrading to 12.04), so not sure why it
expects the user to be 'backup'

<https://forums.zmanda.com/editpost.php?p=14838&do=editpost><https://forums.zmanda.com/newreply.php?do=newreply&p=14838&noquote=1><https://forums.zmanda.com/newreply.php?do=newreply&p=14838>

Re: long running amandad on client

2010-10-08 Thread Dustin J. Mitchell

On Fri, Oct 8, 2010 at 1:07 PM, John Hein  wrote:
> I have a client with an amandad that has been running since Sep 23...
..
> 2000 APPLICATION 36 
> |;auth=BSD;compress-fast;index;exclude-file=.no-amanda-backup;exclude-file=.nobak;exclude-file=.noback;exclude-file=.nodump;

I would avoid BSD auth if possible.  It uses UDP, so there is a bit of
a race where amandad must wait around long enough to catch any new
packets, but must exit at some point.  Currently, this timeout is set
to 30 seconds, but there may be (or have been) a particular case in
which amandad can get "stuck" waiting for a new packet.

The file-descriptor leaking is still a problem - amandad is in awful
shape, to be honest - but zombie collection has improved somewhat
(with band-aids, not a fix, but..)

Dustin

-- 
Open Source Storage Engineer
http://www.zmanda.com

long running amandad on client

2010-10-08 Thread John Hein

I have a client with an amandad that has been running since Sep 23...

backup 97592  0.0  0.1 26780  7016  ??  Ss   23Sep10  40:30.43 amandad

Most of the backups on that client still work fine.  But two DLEs
fail nightly.  On the server, you get:

1286525084.841860: chunker: getcmd: START 20101007210002
1286525084.841877: chunker: getcmd: PORT-WRITE 00-00195 
/holding/20101007210002/someclient._somedle.0 someclient 9ffe7f 
/somedle 0 1970:1:1:0:0:0 51
2000 APPLICATION 36 
|;auth=BSD;compress-fast;index;exclude-file=.no-amanda-backup;exclude-file=.nobak;exclude-file=.noback;exclude-file=.nodump;
1286525084.842069: chunker: stream_server opening socket with family 2 
(requested family was 2)
1286525084.842086: chunker: try_socksize: receive buffer size is 65536
1286525084.844115: chunker: bind_portrange2: Try  port 11017: Available - 
Success
1286525084.844135: chunker: stream_server: waiting for connection: 0.0.0.0.11017
1286525084.844142: chunker: putresult: 23 PORT
1286525084.847225: chunker: stream_accept: connection from 127.0.0.1.11002
1286525084.847233: chunker: try_socksize: receive buffer size is 65536
1286525264.872340: chunker: putresult: 10 FAILED
1286525264.872462: chunker: pid 18935 finish time Fri Oct  8 02:07:44 2010


The amandad log on the client shows nothing at the 1286525084
timestamp (yes, the hosts in questions have good time sync).

It does show sendbackup entries after the 3 minute timeout on the
server above (1286525264 timestamp).  So the client amandad seems to
just be slow in responding.


It's not clear why this long running amandad is slow in responding for
a couple DLEs, but it's definitely abnormal to have such a long
running amandad to begin with.


lsof shows lots of open file descriptors like so:

amandad 97592 backup  609u  PIPE 0xff01dccfa00016384
->0xff01dccfa158

There may be a descriptor leak bug, but that's sort of unimportant since
_usually_ amandad runs only briefly.

The real question is: why is amandad not exiting?

Has anyone seen this before?

I plan to kill amandad on the client, but I'll leave it running for a
bit longer in case there might be something that can be learned.
Unfortunately, this is amanda-2.6.1p1 on the client, so interest in
learning about this anomaly in that code is likely low.

Re: hard coded limit REP_TIMEOUT of 6hrs in amandad-src/amandad.c

2009-09-30 Thread Jean-Louis Martineau


Jean-Francois Malouin wrote:

* Dustin J. Mitchell  [20090908 13:08]:
  

On Tue, Sep 8, 2009 at 12:03 PM, Jean-Francois
Malouin wrote:


Hmmm, more than a week now and no replies.
So I'll attempt to fix it myself: just to be on the safe side,
any adverse effect to just bump up REP_TIMEOUT to, say 10hrs?
  

Sorry, I thought I replied, but what I meant to say was essentially:
yes, that sounds like something to fix :)

As long as the client will die if the TCP connection goes away, I
would prefer to get rid of the timeout altogether.  These "long
enough" timeouts are really only relevant for UDP, where the OS
doesn't notify us of a lost connection (since there is no connection).



I'm reviving this thread as I got hit again last night on a server
running 2.6.1p1 that I reinstalled with REP_TIMEOUT=(12*60*60) ie,
12hrs. I'm still getting 6hrs timeouts on some DLEs:

planner: [disk /raid/nih, all estimate timed out] 
  

REP_TIMEOUT is a client timeout, It must be configured on the client.
You can also use "estimate server", less precise estimate but a lot faster.

Jean-Louis

Re: hard coded limit REP_TIMEOUT of 6hrs in amandad-src/amandad.c

2009-09-28 Thread Dustin J. Mitchell

On Sat, Sep 26, 2009 at 1:06 PM, Jean-Francois Malouin
 wrote:
> exactly 6 hours after amanda started its run.
> Is there a timeout on the client side as well?
> Client is at 2.5.2p1.

I think that the timeout is bidirectional.

Dustin

-- 
Open Source Storage Engineer
http://www.zmanda.com

Re: hard coded limit REP_TIMEOUT of 6hrs in amandad-src/amandad.c

2009-09-26 Thread Jean-Francois Malouin

* Dustin J. Mitchell  [20090908 13:08]:
> On Tue, Sep 8, 2009 at 12:03 PM, Jean-Francois
> Malouin wrote:
> > Hmmm, more than a week now and no replies.
> > So I'll attempt to fix it myself: just to be on the safe side,
> > any adverse effect to just bump up REP_TIMEOUT to, say 10hrs?
> 
> Sorry, I thought I replied, but what I meant to say was essentially:
> yes, that sounds like something to fix :)
> 
> As long as the client will die if the TCP connection goes away, I
> would prefer to get rid of the timeout altogether.  These "long
> enough" timeouts are really only relevant for UDP, where the OS
> doesn't notify us of a lost connection (since there is no connection).

I'm reviving this thread as I got hit again last night on a server
running 2.6.1p1 that I reinstalled with REP_TIMEOUT=(12*60*60) ie,
12hrs. I'm still getting 6hrs timeouts on some DLEs:

planner: [disk /raid/nih, all estimate timed out] 

exactly 6 hours after amanda started its run. 
Is there a timeout on the client side as well?
Client is at 2.5.2p1.

regards,
jf

> 
> Dustin
> 
> -- 
> Open Source Storage Engineer
> http://www.zmanda.com

Re: hard coded limit REP_TIMEOUT of 6hrs in amandad-src/amandad.c

2009-09-08 Thread Dustin J. Mitchell

On Tue, Sep 8, 2009 at 12:03 PM, Jean-Francois
Malouin wrote:
> Hmmm, more than a week now and no replies.
> So I'll attempt to fix it myself: just to be on the safe side,
> any adverse effect to just bump up REP_TIMEOUT to, say 10hrs?

Sorry, I thought I replied, but what I meant to say was essentially:
yes, that sounds like something to fix :)

As long as the client will die if the TCP connection goes away, I
would prefer to get rid of the timeout altogether.  These "long
enough" timeouts are really only relevant for UDP, where the OS
doesn't notify us of a lost connection (since there is no connection).

Dustin

-- 
Open Source Storage Engineer
http://www.zmanda.com

Re: hard coded limit REP_TIMEOUT of 6hrs in amandad-src/amandad.c

2009-09-08 Thread Jean-Francois Malouin

* Jean-Francois Malouin  [20090831 
13:38]:
> Hi,
> 
> I was hitting this hard coded limit a couple years ago running
> 2.5.2-20070523 and over the last month it started to show up again on
> a client having a lot of big DLEs (20 or so for a total of 11TB).
> Client is running 2.6.0p2 and server is at 2.6.1p1. Back then
> Jean-Louis provided me with a patch that reseted the timeout after
> each estimates but just perusing the source I see that amandad.c
> has been extensively rewritten. Anyone cares to have a look at this?

Hmmm, more than a week now and no replies.
So I'll attempt to fix it myself: just to be on the safe side, 
any adverse effect to just bump up REP_TIMEOUT to, say 10hrs?

jf

hard coded limit REP_TIMEOUT of 6hrs in amandad-src/amandad.c

2009-08-31 Thread Jean-Francois Malouin

Hi,

I was hitting this hard coded limit a couple years ago running
2.5.2-20070523 and over the last month it started to show up again on
a client having a lot of big DLEs (20 or so for a total of 11TB).
Client is running 2.6.0p2 and server is at 2.6.1p1. Back then
Jean-Louis provided me with a patch that reseted the timeout after
each estimates but just perusing the source I see that amandad.c
has been extensively rewritten. Anyone cares to have a look at this?

thanks,
jf
-- 
<° >< Jean-François Malouin  McConnell Brain Imaging Centre
Systems/Network Administrator   Montréal Neurological Institute
3801 Rue Université, Suite WB219  Montréal, Québec, H3A 2B4
Phone: 514-398-8924   Fax: 514-398-8948

Re: Error - tcpm_recv_token: invalid size: amandad:

2008-12-04 Thread Matt Burkhardt

Thanks Dustin!

Turns out that there was a permissions problem with /var/lib/amanda - I
had at first installed with the repos for Ubuntu, then uninstalled and
used the packages from Zmanda - so just changed the permissions to
amandabackup:disk and amcheck runs fine.

On Wed, 2008-12-03 at 21:18 -0500, Dustin J. Mitchell wrote:

> On Wed, Dec 3, 2008 at 5:32 PM, Matt Burkhardt <[EMAIL PROTECTED]> wrote:
> > WARNING: 192.168.1.102: selfcheck request failed: tcpm_recv_token: invalid
> > size: amandad:
> 
> What do you see if you telnet to that IP on port 10080?
> 
> Dustin
> 

Matt Burkhardt, MSTM
President
Impari Systems, Inc.
502 Fairview Avenue
Frederick, MD  21701
[EMAIL PROTECTED]
www.imparisystems.com
(301) 682-7901

Re: Error - tcpm_recv_token: invalid size: amandad:

2008-12-03 Thread Dustin J. Mitchell

On Wed, Dec 3, 2008 at 5:32 PM, Matt Burkhardt <[EMAIL PROTECTED]> wrote:
> WARNING: 192.168.1.102: selfcheck request failed: tcpm_recv_token: invalid
> size: amandad:

What do you see if you telnet to that IP on port 10080?

Dustin

-- 
Storage Software Engineer
http://www.zmanda.com

Re: amandad args in inetd.conf

2008-09-26 Thread John Hein

[EMAIL PROTECTED] wrote at 13:09 -0400 on Sep 26, 2008:
 > Hmm, thanks for mentioning.  If I condense down my browser window,
 > the text goes beyond the bounding box but I can still scroll over
 > to see the full line of text.  Are you not getting the same?

Exactly the same.  I was wondering if there is a wiki markup directive
to have the bounding box extend to the end of the text rather than the
border of the window.


 > > And somehow between 2.5 & 2.6, the .amandahost docs in amanda(8) lost
 > > the docs for the 'service(s)' field.
 > >
 > 
 > I'm seeing the following in 2.6.0p2's amanda(8):

Woops.  My fault... bad zgrep for the strings in the various flavors
of amanda I have installed (man pages in the 2.6* flavors moved into
/share/man instead of /man).

Re: amandad args in inetd.conf

2008-09-26 Thread pyeatman

From: "John Hein" <[EMAIL PROTECTED]>
Sent: Wednesday, September 24, 2008 11:00pm
> Paul Yeatman wrote at 16:59 -0700 on Sep 24, 2008:
 > > Online Amanda documentation for inetd.conf configuration for both
 > > server and client are found on the Amanda wiki site here
 > > 
 > > server:
 > > 
> >  
> > http://wiki.zmanda.com/index.php/Configuring_bsd/bsdudp/bsdtcp_authentication#xinetd.2Finetd_configuration_file_changes
> 
> Thanks.  Perhaps someone should add this info to the man pages.
>

Agreed.  I am thus working on having this included in the next release.

> I wonder how one might go about fixing the wikimedia markup to
> extend the bounding box in the 'inetd.conf' example (assuming
> your browser window is not really wide).

Hmm, thanks for mentioning.  If I condense down my browser window, the text 
goes beyond the bounding box but I can still scroll over to see the full line 
of text.  Are you not getting the same?

>
> And somehow between 2.5 & 2.6, the .amandahost docs in amanda(8) lost
> the docs for the 'service(s)' field.
>

I'm seeing the following in 2.6.0p2's amanda(8):

   "The format of the .amandahosts file is:

   hostname [ username [ service ]*]

   If username is ommitted, it defaults to the user running amandad,
   i.e. the user listed in the inetd or xinetd configuration file.

   The service is a list of the service the client is authorized to
   execute: amdump, noop, selfcheck, sendsize, sendbackup, amindexd,
   amidxtaped.  amdump is a shortcut for "noop selfcheck sendsize
   sendbackup"

> 
 > > client:
 > > http://wiki.zmanda.com/index.php/Quick_start#Configuring_inetd_on_the_client

> Thanks again.

No problem.

Paul

Re: amandad args in inetd.conf

2008-09-24 Thread John Hein

Paul Yeatman wrote at 16:59 -0700 on Sep 24, 2008:
 > Online Amanda documentation for inetd.conf configuration for both
 > server and client are found on the Amanda wiki site here
 > 
 > server:
 > http://wiki.zmanda.com/index.php/Configuring_bsd/bsdudp/bsdtcp_authentication#xinetd.2Finetd_configuration_file_changes

Thanks.  Perhaps someone should add this info to the man pages.

I wonder how one might go about fixing the wikimedia markup to
extend the bounding box in the 'inetd.conf' example (assuming
your browser window is not really wide).

And somehow between 2.5 & 2.6, the .amandahost docs in amanda(8) lost
the docs for the 'service(s)' field.


 > client:
 > http://wiki.zmanda.com/index.php/Quick_start#Configuring_inetd_on_the_client

Thanks again.

Re: amandad args in inetd.conf

2008-09-24 Thread Paul Yeatman

Online Amanda documentation for inetd.conf configuration for both server and 
client are found on the Amanda wiki site here

server:
http://wiki.zmanda.com/index.php/Configuring_bsd/bsdudp/bsdtcp_authentication#xinetd.2Finetd_configuration_file_changes

client:
http://wiki.zmanda.com/index.php/Quick_start#Configuring_inetd_on_the_client

Paul

Re: amandad args in inetd.conf

2008-09-23 Thread John Hein

Olivier Cherrier wrote at 18:46 -0400 on Sep 23, 2008:
 > On Tue, Sep 23, 2008 at 10:27:26AM -0600, [EMAIL PROTECTED] wrote:
 > > Where are the docs for what args need to be added to amandad in
 > > inetd.conf?
 > > 
 > > I added amindexd and amidxtaped on the backup server in order to do
 > > amrecover, but then amcheck failed (needed noop, then selfcheck).
 > > Then amdump failed (needed sendsize, ...).
 >  
 > I think you have to populate your ~amandauser/.amandahosts with
 > something like that (amandauser = operator for me) :
 > yourHost operatoramdump
 > 
 > >From amanda(8): "amdump is a shortcut for "noop selfcheck sendsize
 > sendbackup"

Ah... difference between amanda-2.5 and amanda-2.6.  At least a
documentation difference.  2.5.* seems to be missing that
documentation in the context of inetd.conf and 2.6.* doesn't have that
tidbit at all (in fact it doesn't seem to support the service entry in
.amandahosts in the 2.6 man page).  I'm not sure if that's an
oversight or deliberate.

I also wonder if that alias is valid in inetd.conf.

 > > I see the full list in amandad.c, and I think I understand why clients
 > > don't need the addition.  It defaults to all services except amindexd
 > > and amidxtaped being active.  But when you activate those two on the
 > > server, it seems it _de_activates the others.
 >  
 > I myself configured inetd.conf like that:
 > $ grep amandad /etc/inetd.conf 
 > amandadgram  udp wait   operator /usr/local/libexec/amanda/amandad 
 > amandad amindexd amidxtaped

So you don't backup your server?  Or you use .amandahosts to specify
the services that amandad can run on the server?  I guess I don't
see a big difference between editing .amandahosts vs. inetd.conf

Re: amandad args in inetd.conf

2008-09-23 Thread Olivier Cherrier

On Tue, Sep 23, 2008 at 10:27:26AM -0600, [EMAIL PROTECTED] wrote:
> Where are the docs for what args need to be added to amandad in
> inetd.conf?
> 
> I added amindexd and amidxtaped on the backup server in order to do
> amrecover, but then amcheck failed (needed noop, then selfcheck).
> Then amdump failed (needed sendsize, ...).
 
I think you have to populate your ~amandauser/.amandahosts with
something like that (amandauser = operator for me) :
yourHostoperatoramdump

>From amanda(8): "amdump is a shortcut for "noop selfcheck sendsize
sendbackup"

> I see the full list in amandad.c, and I think I understand why clients
> don't need the addition.  It defaults to all services except amindexd
> and amidxtaped being active.  But when you activate those two on the
> server, it seems it _de_activates the others.
 
I myself configured inetd.conf like that:
$ grep amandad /etc/inetd.conf 
amandadgram  udp wait   operator /usr/local/libexec/amanda/amandad amandad 
amindexd amidxtaped


Later,

-- 
Olivier Cherrier - Symacx.com
mailto:[EMAIL PROTECTED]

amandad args in inetd.conf

2008-09-23 Thread John Hein

Where are the docs for what args need to be added to amandad in
inetd.conf?

I added amindexd and amidxtaped on the backup server in order to do
amrecover, but then amcheck failed (needed noop, then selfcheck).
Then amdump failed (needed sendsize, ...).

I see the full list in amandad.c, and I think I understand why clients
don't need the addition.  It defaults to all services except amindexd
and amidxtaped being active.  But when you activate those two on the
server, it seems it _de_activates the others.

I didn't see it in the docs in the tarball and the wiki search leaves
a bit to be desired.  Where is it documented for the non-source diving
crowd?

Also, why would anyone want noop, selfcheck, sendbackup, etc.,
disabled?  Is it a use case that some people don't back up their
backup servers?

I've been using a 2.4.5 server for so long, I've missed some details.

amandad keeps dying on me...

2007-09-05 Thread Paul Lussier


Hi all,

I'm using Debian/stable, amanda 2.5.1p1 (note 2.5.1, NOT 2.5.2).

For some reason amandad keeps dying on me.  I can't find any reason in
any of my logs for this.  Currently, I still have the following
processes running:

  amanda2:/var/log/amanda/amandad# ps -ef |grep backup
  backup   25763  9089  0 05:31 pts/100:00:00 /bin/sh /usr/sbin/amdump 
offsite amanda2
  backup   25772 25763  0 05:31 pts/100:00:00 /usr/lib/amanda/planner 
offsite amanda2
  backup   25773 25763  0 05:31 pts/100:00:00 /usr/lib/amanda/driver 
offsite amanda2
  backup   25774 25773  0 05:31 pts/100:00:00 taper offsite
  backup   25775 25773  0 05:31 pts/100:00:00 dumper0 offsite
  backup   25776 25773  0 05:31 pts/100:00:00 dumper1 offsite
  backup   25777 25773  0 05:31 pts/100:00:00 dumper2 offsite
  backup   25778 25773  0 05:31 pts/100:00:00 dumper3 offsite
  backup   25779 25773  0 05:31 pts/100:00:00 dumper4 offsite
  backup   25780 25773  0 05:31 pts/100:00:00 dumper5 offsite
  backup   25781 25773  0 05:31 pts/100:00:00 dumper6 offsite
  backup   25782 25773  0 05:31 pts/100:00:00 dumper7 offsite
  backup   25783 25773  0 05:31 pts/100:00:00 dumper8 offsite
  backup   25784 25773  0 05:31 pts/100:00:00 dumper9 offsite
  backup   25785 25773  0 05:31 pts/100:00:00 dumper10 offsite
  backup   25786 25773  0 05:31 pts/100:00:00 dumper11 offsite
  backup   25787 25773  0 05:31 pts/100:00:00 dumper12 offsite
  backup   25788 25773  0 05:31 pts/100:00:00 dumper13 offsite
  backup   25789 25773  0 05:31 pts/100:00:00 dumper14 offsite
  backup   25790 25773  0 05:31 pts/100:00:00 dumper15 offsite
  backup   25791 25774  0 05:31 pts/100:00:00 taper offsite
  backup   29382 29381  0 12:47 pts/300:00:00 -sh

The client in question is 'amanda2', which is NFS mounting several
file systems from an OnStor NFS server.

amstatus reports:

   Using /var/log/amanda/offsite/amdump.1 from Wed Sep  5 05:31:03 EDT 2007

   amanda2:/00g waiting to flush
   amanda2:/00g estimate done
   amanda2:/home02g waiting to flush
   amanda2:/home02g estimate done
   amanda2:/nfs00g waiting to flush
   amanda2:/nfs00g estimate done
   amanda2:/nfs/RT 1   39g waiting to flush
   amanda2:/nfs/RT 0  582g estimate done
   amanda2:/nfs/archive0   11g waiting to flush
   amanda2:/nfs/archive0   11g estimate done
   amanda2:/nfs/backups00g waiting to flush
   amanda2:/nfs/backups00g estimate done
   amanda2:/nfs/builds 15g waiting to flush
   amanda2:/nfs/builds 0  117g estimate done
   amanda2:/nfs/debian 0   22g waiting to flush
   amanda2:/nfs/debian 0   22g estimate done
   amanda2:/nfs/patent 01g waiting to flush
   amanda2:/nfs/patent 01g estimate done
   amanda2:/nfs/release0  236g estimate done
   amanda2:/nfs/software   0   24g estimate done
   amanda2:/nfs/system 0   10g estimate done
   amanda2:/nfs/user   00g estimate done
   amanda2:/nfs/user/ad0   74g partial estimate done
   amanda2:/nfs/user/assar getting estimate
   amanda2:/nfs/user/ehgetting estimate
   amanda2:/nfs/user/ilgetting estimate
   amanda2:/nfs/user/mpgetting estimate
   amanda2:/nfs/user/qtgetting estimate
   amanda2:/nfs/user/uzgetting estimate
   amanda2:/usr 00g waiting to flush
   amanda2:/usr 00g estimate done
   amanda2:/var 01g waiting to flush
   amanda2:/var 01g estimate done

   SUMMARY  part  real  estimated
  size   size
   partition   :  33
   estimated   :  16 1085g
   flush   :  1185g
   failed  :   00g   (  0.00%)
   wait for dumping:   00g   (  0.00%)
   dumping to tape :   00g   (  0.00%)
   dumping :   0 0g 0g (  0.00%) (  0.00%)
   dumped  :   0 0g 0g (  0.00%) (  0.00%)
   wait for writing:   0 0g 0g (  0.00%) (  0.00%)
   wait to flush   :  1185g85g (100.00%) (  0.00%)
   writing to tape :   0 0g 0g (  0.00%) (  0.00%)
   failed to tape  :   0 0g 0g (  0.00%) (  0.00%)
   taped   :   0 0g 0g (  0.00%) (  0.00%)
   16 dumpers idle : not-idle
   taper idle
   network free kps:   1048576
   holding space   :  1700g (100.00%)
0 dumpers busy :  0:00:00  (  0.00%)

But there is no estimate being done.  There is no tar process running,
amandad is not runn

Re: amandad troubles

2007-09-02 Thread Glenn English

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Glenn English wrote:

Absolutely never mind. I'm getting way too old to play this game...

FWIW, the problem this time was not looking into DNS enough.

- --
Glenn English
[EMAIL PROTECTED]

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFG2vHX04yQfZbbTLYRAl1WAJ9RB2OeJujsYf46r9nvaZYvCucfIgCgrh5w
3LjCby76F86Ut0H5lsdN83g=
=bxn4
-END PGP SIGNATURE-

amandad troubles

2007-09-01 Thread Glenn English

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Debian etch Linux, Amanda v 2.5.1p1 from a debian package.

The 2 clients on the DMZ don't respond; on the LAN all is well. The
amanda server is on the LAN, there's a pix between the LAN and the DMZ,
and this install / configuration has worked in the past (or maybe I
should say 'a configuration pretty close to this one').


Amcheck says they time out waiting for an ACK;

tcpdump shows packets arriving at the DMZ client, but nothing leaving
(on a working host, there are several packets going back and forth);

tcpdump on the server sees many packets running around the LAN, some
going to the DMZ, and nothing coming back;

disabling the iptables packet filters on the server and the client makes
no difference;

an entry in the client's daemon log says amandad was contacted;

.amandahosts is valid, DNS works both ways, a reverse lookup of the
server IP gives exactly the same text as is in .amandahosts, and
emptying the file has no effect;

when I run amcheck on the server, ps on the client shows amandad running
for 30 seconds or so, then stopping, same as it does on a working client;

running amandad by hand as the amanda user does the same thing (nothing)
on a working or non-working client;

if I delete /tmp/amanda, the directory is re-created after an amcheck,
but it's empty;

nagios successfully checks tftp on the DMZ client with no problems, the
pix can write its config file to it, and both the pix and the border
router do ntp with the server on the DMZ, so I'm fairly certain about
UDP through the pix;

I can get to ftp on the DMZ client from the amanda server (also run by
inetd and watched over by tcpwrappers);

ps shows inetd is running; the modification dates for amandad, tcpd,
and inetd are well in the past;

reinstalling the amanda client does nothing;

rebooting makes no difference;

and the hosts on the DMZ are running the same kernel as on the LAN.


If this sounds familiar, it is. I had a very similar problem 2 or 3
weeks ago, and that turned out to have happened because I didn't check
inetd well enough. I don't think it's inetd this time.

I've tried the suggestions I could find via google (and in the responses
from the list last time). No joy.

Since the LAN works and the DMZ doesn't, it's 'obviously' the pix
between them. But there's lots of evidence of packets getting through
the pix from the server to the client (and of amandad starting up
there). If the pix were blocking the return, I'd expect tcpdump to show
packets leaving the DMZ client. But it doesn't, and iptables doesn't see
any either. It looks like amandad never tries to answer.

Any thoughts? Is there such a thing as a talking amandad?

Just thought of something, but tar's mod date is in 2006...

- --
Glenn English
[EMAIL PROTECTED]

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFG2gMw04yQfZbbTLYRAh5rAJ4uV4vR96S3d3B/q4kpPVdyRi/oYQCeKwGm
PonC1MpFAe6ZYWQxsGrfq5k=
=z1yC
-END PGP SIGNATURE-

Does amrecvoer connect to amandad ?

2007-08-30 Thread stan

Does amrecover make a connection to the port amandad is running on?

I'm doing some debugging, and I see inetd seeing a connection to that port
when I run amrecover. I thought it connected to the index server.

-- 
I'm sorry, no one here has any intentions of helping you with anything. 
I am the manager of all of Customer Service."

Regarding the script file & c file for amandad

2006-12-03 Thread silpa kala

Hi,

I need one clarification regarding the amandad script
file.

amandad.c file present in client-src directory,when
built  amandad executable file generated in .libs ;
and when we do make install then this file is moving
to /usr/local/libexec.

There is one more script file for amandad is
generated. Why this file is requrired?

Is this file is requried to create?

Please clarify.

Thanks & Regards,
Silpakala


 

Want to start your own business?
Learn how on Yahoo! Small Business.
http://smallbusiness.yahoo.com/r-index

Re: amandad on client

2006-08-21 Thread Paul Bijnens


On 2006-08-21 10:03, Natalia García Nebot wrote:
Hi all! I have a doubt: Is it possible run a daemon for amanda (amandad) 
on client instead of inetd or xinetd?




No, amandad relies on an external program to do that.  In all the
Unix-variants I know there is at least inetd or xinetd provided.

Just keep the services that you don't want disabled to avoid opening
security holes.  (verify with "netstat --natu" to see what is opened
before and after starting (x)inetd).


--
Paul Bijnens, xplanation Technology ServicesTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***

amandad on client

2006-08-21 Thread Natalia García Nebot

Hi all! I have a doubt: Is it possible run a daemon for amanda (amandad) 
on client instead of inetd or xinetd?

Re: Solaris 8 inetd killing amandad

2006-07-19 Thread Paul Bijnens


On Wed, Jul 19, 2006 at 10:46:18AM -0600, Chris Cameron wrote:

On Tue, 2006-07-18 at 17:36 -0700, Kevin Till wrote:

what is the amanda entry in /etc/inet/inetd.conf?

amanda  dgram   udp waitroot/opt/amanda/libexec/amandad  amandad


Tried to run amandad as root-user instead of Amanda?
Or is this just for the test?

If you have run once as root, some files are created as root, and
when running later as the backup user, Amanda cannot write into them.

The most obvious one is /tmp/amanda debug directory.  Leaving you
without a clue why not even the debug output is not there.

Most people notice this only when on the Amanda server the amcheck
fails.  That why a lot of client config problems are handled here:
(instead of under "amandad issues")

http://wiki.zmanda.com/index.php/Amcheck:_selfcheck_request_failed


--
Paul Bijnens, xplanation Technology ServicesTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***

Re: Solaris 8 inetd killing amandad

2006-07-19 Thread Jon LaBadie

On Wed, Jul 19, 2006 at 10:46:18AM -0600, Chris Cameron wrote:
> On Tue, 2006-07-18 at 17:36 -0700, Kevin Till wrote:
> > 
> > what is the amanda entry in /etc/inet/inetd.conf?
> 
> amanda  dgram   udp waitroot    /opt/amanda/libexec/amandad
> amandad
> 
> 
> > Try "truss /opt/amanda/libexec/amandad" and see if there is anything 
> > obviously wrong.
> 
> Only thing that doesn't look normal to me:
> 
> open("/usr/platform/SUNW,Sun-Fire-V240/lib/libc_psr.so.1", O_RDONLY) = 3
> mmap(0xFF23, 8192, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_FIXED, 3, 0)
> = 0xFF23
> close(3)= 0
> brk(0x00024098) = 0
> brk(0x00026098) = 0
> sysconfig(_CONFIG_STACK_PROT)   = 7
> fcntl(0, F_GETFD, 0x)   = 0
> fcntl(1, F_GETFD, 0x0001)   = 0
> fcntl(2, F_GETFD, 0x0001)   = 0
> close(3)Err#9 EBADF
> close(4)Err#9 EBADF
> close(5)Err#9 EBADF
> close(6)Err#9 EBADF
> close(7)Err#9 EBADF
> 
> 
> The close()'s go all the way to 1023. After that it goes on to writing
> its debug output to /tmp/amanda.
> 
> 
> I don't know what they mean, but could all those close()'s be the cause?

Shouldn't.  Amanda knows what file descriptors it will need.
As a security measure, most amanda commands, at start-up,
close all file descriptors that will not be needed.

-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)

Re: Solaris 8 inetd killing amandad

2006-07-19 Thread Peter Kunst


Hi Chris,

Chris Cameron wrote:

On Tue, 2006-07-18 at 17:36 -0700, Kevin Till wrote:


what is the amanda entry in /etc/inet/inetd.conf?



amanda  dgram   udp waitroot/opt/amanda/libexec/amandad
amandad


i would try to start amandad as "amanda" (or whatever user you have 
amanda compiled for), not as root ;-)


When using truss, i would give it the -f option to trace also child 
processes.


 Peter


Try "truss /opt/amanda/libexec/amandad" and see if there is anything obviously 
wrong.



Only thing that doesn't look normal to me:

open("/usr/platform/SUNW,Sun-Fire-V240/lib/libc_psr.so.1", O_RDONLY) = 3
mmap(0xFF23, 8192, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_FIXED, 3, 0)
= 0xFF23
close(3)= 0
brk(0x00024098) = 0
brk(0x00026098) = 0
sysconfig(_CONFIG_STACK_PROT)   = 7
fcntl(0, F_GETFD, 0x)   = 0
fcntl(1, F_GETFD, 0x0001)   = 0
fcntl(2, F_GETFD, 0x0001)   = 0
close(3)Err#9 EBADF
close(4)Err#9 EBADF
close(5)Err#9 EBADF
close(6)Err#9 EBADF
close(7)Err#9 EBADF


The close()'s go all the way to 1023. After that it goes on to writing
its debug output to /tmp/amanda.


I don't know what they mean, but could all those close()'s be the cause?


Chris

Re: Solaris 8 inetd killing amandad

2006-07-19 Thread Chris Cameron

On Tue, 2006-07-18 at 17:36 -0700, Kevin Till wrote:
> 
> what is the amanda entry in /etc/inet/inetd.conf?

amanda  dgram   udp waitroot/opt/amanda/libexec/amandad
amandad


> Try "truss /opt/amanda/libexec/amandad" and see if there is anything 
> obviously wrong.

Only thing that doesn't look normal to me:

open("/usr/platform/SUNW,Sun-Fire-V240/lib/libc_psr.so.1", O_RDONLY) = 3
mmap(0xFF23, 8192, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_FIXED, 3, 0)
= 0xFF23
close(3)= 0
brk(0x00024098) = 0
brk(0x00026098) = 0
sysconfig(_CONFIG_STACK_PROT)   = 7
fcntl(0, F_GETFD, 0x)   = 0
fcntl(1, F_GETFD, 0x0001)   = 0
fcntl(2, F_GETFD, 0x0001)   = 0
close(3)Err#9 EBADF
close(4)Err#9 EBADF
close(5)Err#9 EBADF
close(6)Err#9 EBADF
close(7)Err#9 EBADF


The close()'s go all the way to 1023. After that it goes on to writing
its debug output to /tmp/amanda.


I don't know what they mean, but could all those close()'s be the cause?


Chris

Re: Solaris 8 inetd killing amandad

2006-07-18 Thread Kevin Till


Chris Cameron wrote:

Have 2 working Solaris machines being backed up by Amanda. Trying to add
a 3rd, I just copy the amanda directory from one client to the new
client, same way I installed the other one.

Made an Amanda user, group other, add amanda entries to services, and
entry to inetd.conf

When I run amcheck I see this in my messages log on the new server:

Jul 18 16:00:12 app01 inetd[169]: [ID 858011
daemon.warning] /opt/amanda/libexec/amandad: Killed
Jul 18 16:00:50 app01 last message repeated 38 times
Jul 18 16:00:51 app01 inetd[169]: [ID 667328 daemon.error] amanda/udp
server failing (looping), service terminated


ldd on amandad doesn't show any missing libraries. I've recompiled
Amanda on the new machine, same problem. Tried different users, same
problem.



Anybody know what I've done here?


what is the amanda entry in /etc/inet/inetd.conf?
Try "truss /opt/amanda/libexec/amandad" and see if there is anything obviously 
wrong.



--
Thank you!
Kevin Till

Amanda documentation: http://wiki.zmanda.com
Amanda forums:http://forums.zmanda.com

Solaris 8 inetd killing amandad

2006-07-18 Thread Chris Cameron

Have 2 working Solaris machines being backed up by Amanda. Trying to add
a 3rd, I just copy the amanda directory from one client to the new
client, same way I installed the other one.

Made an Amanda user, group other, add amanda entries to services, and
entry to inetd.conf

When I run amcheck I see this in my messages log on the new server:

Jul 18 16:00:12 app01 inetd[169]: [ID 858011
daemon.warning] /opt/amanda/libexec/amandad: Killed
Jul 18 16:00:50 app01 last message repeated 38 times
Jul 18 16:00:51 app01 inetd[169]: [ID 667328 daemon.error] amanda/udp
server failing (looping), service terminated


ldd on amandad doesn't show any missing libraries. I've recompiled
Amanda on the new machine, same problem. Tried different users, same
problem.



Anybody know what I've done here?


Chris

Re: chunker ,amandad

2006-07-12 Thread Guy

2006/7/12, silpa kala <[EMAIL PROTECTED]>:
Hi,Please help me out in this problem.I seen the socket connection between the chunker andamandad. What is purpose of the socket?Actually I understood that backup data send by theamandad(client) and received by the
dumper(server)through the data port. Is it true?How the dumper will send the backup data to thechunker?I understood that chunker is introduced in 2.5version for new features splitsize. Suppose if i
didn't configure for this parameters, eventhoughchunker is playing as an important role. Why thedumper funda is moved to chunker.Taper is creating the Shared Memory. Why it isnecessary?Please clarify me these doubts.
Thanks & Regards,SilpakalaSorry but you are using the wrong mailing list, pal. This is amanda USERS, not HACKERS.  There is a mailing list devoted to amanda development.

Communication between the chunker and amandad

2006-07-12 Thread silpa kala

Hi,

Please help me out in this problem. 

I seen the socket connection between the chunker and
amandad. What is purpose of the socket?

Actually I understood that backup data send by the
amandad(client) and received by the
dumper(server)through the data port. Is it true?

How the dumper will send the backup data to the
chunker?

I understood that chunker is introduced in 2.5 version
for new features splitsize. Suppose if i didn't
configure for this parameters, eventhough chunker is
playing as an important role. Why the dumper funda is
moved to chunker.

Taper is creating the Shared Memory. Why it is
necessary?

Please clarify me these doubts. 

Thanks & Regards,
Silpakala







__
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com

chunker ,amandad

2006-07-11 Thread silpa kala

Hi,
 
Please help me out in this problem. 
 
I seen the socket connection between the chunker and
amandad. What is purpose of the socket?
 
Actually I understood that backup data send by the
amandad(client) and received by the
dumper(server)through the data port. Is it true?
 
How the dumper will send the backup data to the
chunker?

I understood that chunker is introduced in 2.5
version for new features splitsize. Suppose if i
didn't configure for this parameters, eventhough
chunker is playing as an important role. Why the
dumper funda is moved to chunker.

Taper is creating the Shared Memory. Why it is
necessary?

Please clarify me these doubts. 

Thanks & Regards,
Silpakala



__
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com

Please clarify the socket connection between the dumper and amandad

2006-07-11 Thread silpa kala

Hi,

Please clarify the TCP socket connection between the
dumper and amandad. Let me know which one is acting as
a server and client. If we specify the entry in the
DLE like this 222.222.222.222 /home root-tar.

chunker: stream_accept: connection from
127.0.0.1.45601
dumper: stream_client: connected to 127.0.0.1.784
chunker: try_socksize: receive buffer size is 32768
dumper: stream_client: our side is 0.0.0.0.45601
dumper: try_socksize: send buffer size is 65536
dumper: bind_portrange2: trying port=778
dumper: dgram_bind: socket bound to 0.0.0.0.778
dumper: stream_client: connected to 192.61.224.88.778
dumper: stream_client: our side is 0.0.0.0.35926
dumper: try_socksize: send buffer size is 65536
dumper: try_socksize: receive buffer size is 65536


Why the chunker and dumper is using three IP addresses
in the above log data?

The dumper is creating the previlgied prot no like 778
for data and one for mesg and one for index.
what is the other port no 
35926.

Why the chunker is trying to connect to the port no?
chunker: try_socksize: receive buffer size is 65536
chunker: bind_portrange2: trying port=784
chunker: stream_server: waiting for connection:
0.0.0.0.784
driver: result time 4.778 from chunker0: PORT 784


Please clarify me the TCP communication between
amandad, dumper and chunker processes ?

I am totally confused, even debugging is also
difficult.

Thanks & Regards,
Silpakala



__
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com

dumper, chunker and amandad communication

2006-07-10 Thread silpa kala

Hi,

I need some clarification on Amanda Internals.

Driver will initiate the chunker process and driver
will provide some information to the chunker process
like name of the file for holding disk, hostname,
level, datastamp etc.,
How the driver will send the information to the
chunker? either through pipes ?I copied some lines
from the amdump logfile.

driver: send-cmd time 4.768 to chunker0: PORT-WRITE
00-1
/home/20060705194408/hostname._home_amanda-2.5.0p1-20060428_server-src.1
hostname feff9ffe07
/home/amanda-2.5.0p1-20060428/ server-src 1
2006:7:6:0:42:13 1048576 GNUTAR 6624
|;auth=BSD;no-record;index;


chunker is trying to connect to one privileged port
no. Driver will initiate the dumper process and driver
will provide some information to dumper port no to
connect(what chunker initiated for the particular
port), hostname etc.,

driver: send-cmd time 4.778 to dumper0: PORT-DUMP
00-1 784 hostname feff9ffe07
/home/amanda-2.5.0p1-20060428/server-src NODEVICE 1
2006:7:6:0:42:13 GNUTAR |;auth=BSD;no-record;index;


Is there any socket communication is required between
the chunker and driver ? I got so much of confusion
with the amdump.1 log file. The contents i am sending
here from the log file

chunker: stream_accept: connection from
127.0.0.1.45601
dumper: stream_client: connected to 127.0.0.1.784
chunker: try_socksize: receive buffer size is 32768
dumper: stream_client: our side is 0.0.0.0.45601
dumper: try_socksize: send buffer size is 65536

What is the significance of this. Please explain?

Amandad log file is also having the information below:

amandad: try_socksize: send buffer size is 65536
amandad: try_socksize: receive buffer size is 65536

 I think the above two log information is
interconnected, what kind of information is sending
and receiving by dumper or chunker or amandad?

I need these clarifications. Please help me out to
understand these funda.

Thanks & Regards,
Silpakala


 







__
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com

Linux + ipv6 + inetd == non-working amandad (solution)

2006-07-05 Thread Christopher DeMarco

Hi all, I'm posting here for the benefit of the next person who runs
into this problem:

If amandad is running through inetd on a Linux system with the ipv6
kernel module loaded, it will try to talk to 0.0.0.0 instead of
whoever's on the other side of port 10080.  The fix is to either
remove the module (unverified) or to run in e.g. xinetd instead
(verified).

A helpful individual on #amanda suggested that setting the inetd
"service type" field to "tcp46" may also work.


Here's the symptom in amandad.debug:

amandad: time 0.000: dgram_send_addr: sendto(0.0.0.0.566) failed:
Invalid argument
amandad: time 0.003: sending REP packet:

Amanda 2.4 REP HANDLE 001-A8850608 SEQ 1152114891
ERROR [addr 0.0.0.0: hostname lookup failed


Hope this helps somebody in the future...


-- 
Christopher DeMarco <[EMAIL PROTECTED]>
Alephant Systems (http://alephant.net)
PGP public key at http://pgp.alephant.net
+1-412-708-9660

Re: amandad process defunct

2006-06-22 Thread Joe Donner (sent by Nabble.com)


Hi and thanks for your replies.

See below for the log file contents (21.06.06).  It also shows the version
(which I used simply because Red Hat had those rpms available on their web
site - I didn't want to add the 'complexity' of having to compile the source
myself).  In my stupidity I don't see anything glaringly obvious that can be
wrong.  Amanda is configured to use gnu-tar, and there's no hardware
compression or encryption involved.

I don't see why it should still show it's busy at 9am when the backup runs
at 24:45 every morning, and only backs up about 100mb (this is just for
testing purposes using virtual tapes).  But this has happened twice now, and
I simply don't know what to do or how to fix it without rebooting the test
lab machine.

I'm very new to Linux and Amanda has been making my head hurt, so please
accept my apologies if I sound silly.

# cat amandad.20060621004501.debug
amandad: debug 1 pid 8021 ruid 33 euid 33: start at Wed Jun 21 00:45:01 2006
amandad: version 2.4.4p1
amandad: build: VERSION="Amanda-2.4.4p1"
amandad:    BUILT_DATE="Mon Jun 20 14:39:39 EDT 2005"
amandad:BUILT_MACH="Linux porky.build.redhat.com 2.4.21-25.ELsmp #1
SMP Fri Nov 12 21:34:51 EST 2004 i686 i686 i386 GNU/Linux"
amandad:CC="i386-redhat-linux-gcc"
amandad:CONFIGURE_COMMAND="'./configure' '--host=i386-redhat-linux'
'--build=i386-redhat-linux' '--target=i386-redhat-linux-gnu'
'--program-prefix=' '--prefix=/usr' '--exec-prefix=/usr' '--bindir=/usr/bin'
'--sbindir=/usr/sbin' '--sysconfdir=/etc' '--datadir=/usr/share'
'--includedir=/usr/include' '--libdir=/usr/lib'
'--libexecdir=/usr/lib/amanda' '--localstatedir=/var/lib'
'--sharedstatedir=/usr/com' '--mandir=/usr/share/man'
'--infodir=/usr/share/info' '--enable-shared'
'--with-index-server=localhost'
'--with-gnutar-listdir=/var/lib/amanda/gnutar-lists'
'--with-smbclient=/usr/bin/smbclient' '--with-amandahosts'
'--with-user=amanda' '--with-group=disk' '--with-tmpdir=/var/log/amanda'
'--with-gnutar=/bin/tar'"
amandad: paths: bindir="/usr/bin" sbindir="/usr/sbin"
amandad:libexecdir="/usr/lib/amanda" mandir="/usr/share/man"
amandad:    AMANDA_TMPDIR="/var/log/amanda"
amandad:AMANDA_DBGDIR="/var/log/amanda" CONFIG_DIR="/etc/amanda"
amandad:DEV_PREFIX="/dev/" RDEV_PREFIX="/dev/r"
amandad:DUMP="/sbin/dump" RESTORE="/sbin/restore"
amandad:SAMBA_CLIENT="/usr/bin/smbclient" GNUTAR="/bin/tar"
amandad:COMPRESS_PATH="/usr/bin/gzip"
amandad:    UNCOMPRESS_PATH="/usr/bin/gzip" MAILER="/usr/bin/Mail"
amandad:listed_incr_dir="/var/lib/amanda/gnutar-lists"
amandad: defs:  DEFAULT_SERVER="localhost" DEFAULT_CONFIG="DailySet1"
amandad:DEFAULT_TAPE_SERVER="localhost"
amandad:DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
amandad:    LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE
amandad:AMANDA_DEBUG_DAYS=4 BSD_SECURITY USE_AMANDAHOSTS
amandad:CLIENT_LOGIN="amanda" FORCE_USERID HAVE_GZIP
amandad:    COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
amandad:COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
amandad: time 0.000: got packet:

Amanda 2.4 REQ HANDLE 001-D8690608 SEQ 1150847102
SECURITY USER amanda
SERVICE noop
OPTIONS features=feff9ffe0f;


amandad: time 0.000: sending ack:

Amanda 2.4 ACK HANDLE 001-D8690608 SEQ 1150847102


amandad: time 0.005: bsd security: remote host hades.matrix-data.local user
amanda local user amanda
amandad: time 0.005: amandahosts security check passed
amandad: time 0.005: running service "noop"
amandad: time 0.005: sending REP packet:

Amanda 2.4 REP HANDLE 001-D8690608 SEQ 1150847102
OPTIONS features=feff9ffe0f;


amandad: time 0.021: got packet:

Amanda 2.4 ACK HANDLE 001-D8690608 SEQ 1150847102


amandad: time 0.024: pid 8021 finish time Wed Jun 21 00:45:01 2006
--
View this message in context: 
http://www.nabble.com/amandad-process-defunct-t1825006.html#a4991274
Sent from the Amanda - Users forum at Nabble.com.

Re: amandad process defunct

2006-06-21 Thread Paul Bijnens


Joe Donner (sent by Nabble.com) schreef:

Hi,

I'm running Amanda in a test lab, and every several days or so the backup
job to virtual tape fails because a process called amandad becomes
unresponsive on the backup client.  When doing an amcheck from the Amanda
server it tells me that amandad on the client is "busy".  When having a look
at the client with the ps command, it lists two diferent amandad processes,
with one showing as being "defunct".  I cannot kill this process, but when I
kill the other amandad process I can get rid of both, but then have to
reboot the client to get the backup job to work again.

Do you know how I can avoid doing this reboot, or what may be going on here?


Reboot is a pretty drastic remedy :-)
When the amandad process is killed, doesn't xinetd listen on
port 10080/udp again?

Do you have any idea what is "failing"?  Usually in the debug files
in /tmp/amanda on the client there should be more clues about what is 
going on.


The error "amandad" busy actually indicates to me that the amandad is
still active on that client: that is the response you get from the first
amandad when you want to start another dump or check.


--
Paul Bijnens, XplanationTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, F6, *
* quit,  ZZ, :q, :q!,  M-Z, ^X^C,  logoff, logout, close, bye,  /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* kill -9 1,  Alt-F4,  Ctrl-Alt-Del,  AltGr-NumLock,  Stop-A,  ...*
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***

Re: amandad process defunct

2006-06-21 Thread Paddy Sreenivasan


Joe,

Which version of Amanda are you using?

Please take a look at the log files in /tmp/amanda/ on the client
system. You might
have an idea which process exited and was not "waited" for (look for timestamps
in the logs).

Are you using compression or client side encryption?

Thanks,
Paddy

On 6/21/06, Joe Donner (sent by Nabble.com) <[EMAIL PROTECTED]> wrote:


Hi,

I'm running Amanda in a test lab, and every several days or so the backup
job to virtual tape fails because a process called amandad becomes
unresponsive on the backup client.  When doing an amcheck from the Amanda
server it tells me that amandad on the client is "busy".  When having a look
at the client with the ps command, it lists two diferent amandad processes,
with one showing as being "defunct".  I cannot kill this process, but when I
kill the other amandad process I can get rid of both, but then have to
reboot the client to get the backup job to work again.

Do you know how I can avoid doing this reboot, or what may be going on here?

I'd appreciate your help as I'm pretty new to Amanda and Linux, but have
been asked to look into Amanda as an alternative backup solution for our 5
Linux servers.

Thanks in advance!

Joe
--
View this message in context: 
http://www.nabble.com/amandad-process-defunct-t1825006.html#a4977961
Sent from the Amanda - Users forum at Nabble.com.





--

Amanda documentation: http://wiki.zmanda.com
Amanda forums: http://forums.zmanda.com

amandad process defunct

2006-06-21 Thread Joe Donner (sent by Nabble.com)


Hi,

I'm running Amanda in a test lab, and every several days or so the backup
job to virtual tape fails because a process called amandad becomes
unresponsive on the backup client.  When doing an amcheck from the Amanda
server it tells me that amandad on the client is "busy".  When having a look
at the client with the ps command, it lists two diferent amandad processes,
with one showing as being "defunct".  I cannot kill this process, but when I
kill the other amandad process I can get rid of both, but then have to
reboot the client to get the backup job to work again.

Do you know how I can avoid doing this reboot, or what may be going on here?

I'd appreciate your help as I'm pretty new to Amanda and Linux, but have
been asked to look into Amanda as an alternative backup solution for our 5
Linux servers.

Thanks in advance!

Joe
--
View this message in context: 
http://www.nabble.com/amandad-process-defunct-t1825006.html#a4977961
Sent from the Amanda - Users forum at Nabble.com.

Re: (subject change) Possible to run scripts just before an amandad?

2006-06-01 Thread Paul Bijnens


On 2006-06-01 15:18, Francis Galiegue wrote:


In fact I'm not asking for such a fine grained setup. I'd be perfectly happy 
with only a way to have amandad run scripts before and after backing up only, 
whatever the number of DLEs. At least, that's my need: a wrapper over 
amandad, not gtar.


For instance, you could have two directories /etc/amandad/before (note the 
"d") and /etc/amandad/after with scripts named 00stoporacle, 01stopmysql, 
whatever.


I have seen people add this shellscript to crontab to
run the amanda dumps:

   #!/bin/sh
   ssh [EMAIL PROTECTED] /usr/local/bin/stopdb
   amdump daily
   ssh [EMAIL PROTECTED] /usr/local/bin/startdb

With an approriate set of private/public keys to avoid password prompts
of course.

But I agree in general that before/after scripts more built into
amanda would be welcome.


--
Paul Bijnens, xplanation Technology ServicesTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***

(subject change) Possible to run scripts just before an amandad?

2006-06-01 Thread Francis Galiegue

(sorry for the misformulation of my first mail, I really meant that amandad, 
not amdump, was to run scripts before/after backup)

Le Jeudi 01 Juin 2006 14:46, Jon LaBadie a écrit :
> >
> > IMHO, I think this is a quite common problem.
> > Moreover, a "tar wrapper" is not practical when there are several DLEs
> > for the same host.
>
> Sure it is.  Though the wrapper is quite customized then.
> The wrapper can examine the arguments passed to tar to
> determine which DLE is the target and whether it is for
> estimate or dump purposes.

In fact I'm not asking for such a fine grained setup. I'd be perfectly happy 
with only a way to have amandad run scripts before and after backing up only, 
whatever the number of DLEs. At least, that's my need: a wrapper over 
amandad, not gtar.

For instance, you could have two directories /etc/amandad/before (note the 
"d") and /etc/amandad/after with scripts named 00stoporacle, 01stopmysql, 
whatever.

-- 
Francis Galiegue, [EMAIL PROTECTED]
One2team - 12bis rue de la Pierre Levée, 75011 PARIS - 0143381980

Re: 2.4.2p2 client, 2.4.4p3 server: timeout from amandad...

2006-05-03 Thread Francis Galiegue

Le Mercredi 03 Mai 2006 16:39, Paul Bijnens a écrit :
> On 2006-05-03 15:53, Francis Galiegue wrote:
>
> "... definitely not..." resembles to some famous last words...
> But I could be wrong of course.   The test above will prove that.

I did the test but did not wait long enough :p

It was indeed the problem. Now I see why this conntrack module was created in 
the first place...

Problem solved. Thanks very much for the link!

-- 
Francis Galiegue, [EMAIL PROTECTED]
One2team - 12bis rue de la Pierre Levée, 75011 PARIS - 0143381980

Re: 2.4.2p2 client, 2.4.4p3 server: timeout from amandad...

2006-05-03 Thread Paul Bijnens


On 2006-05-03 15:53, Francis Galiegue wrote:

Le Mercredi 03 Mai 2006 14:48, vous avez écrit :

On 2006-05-03 13:26, Francis Galiegue wrote:



You could run tcpdump or ethereal on the server and client and verify
if indeed the packet is arriving there and with the correct IP-number
(considering the aliases for eth0 can have messed that up).



Well, if it did, I wouldn't have been able to do "small" backups either, would 
I?



Is there a firewall inbetween, or on one/both of the servers?
You may need to increase the UDP-reply timeout on the firewall (or
disable the firewall).  I believe many firewalls timeout UDP packets
after 180 seconds.



There's a firewall on both. On the client machine, UDP/10080 is let in from 
the server machine only. On the server machine, UDP/10082 and 10083 are let 
in from the client only. As to the timeout, I don't see this as a problem 
since I have an iptables rules that catches "INVALID" packets (as per 
conntrack definition) and it captures nothing at all...


You're missing the UDP/10080 from client to server...  (after timeout
it is not related or tracked with conntrack).

Actually iptables is one that times out UDP packets after 180 seconds.
That why the ip_conntrack_amanda kernel module has the option to set
that larger with the master_timeout parameter.

Did you try this simple manual test:

Start on the Amanda client:

 $ nc -vv -u -l -p 10080

Next on the Amanda server:

 $ nc -vv -u theclient.example.com 10080

Type something on the server (you should see that text appear on the
client).  Type something on the client (we simulate the ACK that the
clients sends for the firewall).
Now wait 5 minutes and type something on the client again (the final
results packet).  Is it received on the server?  If not, do you notice
a log entry for an INVALID packet ?

(Because port 10080 is probably already in use by (x)inetd, you probably
have to disable amanda udp/10080 on the client temporarily.)





There are other possibilities, solutions. See:

http://wiki.zmanda.com/index.php/Amdump:_results_missing

in amanda 2.4.2p2 the "calcsize" was not yet implemented (it did
exist, but was experimental, I believe).
if the server is 2.4.4xx then you can use "estimate server", even
if the client is old.



From what I've read on the zmanda wiki, estimate server uses data from the 

previous runs. But there's no previous run...

BTW I've tried the nc stuff on the wiki page as well: it's definitely not a 
firewall problem.


"... definitely not..." resembles to some famous last words...
But I could be wrong of course.   The test above will prove that.


--
Paul Bijnens, xplanation Technology ServicesTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***

Re: 2.4.2p2 client, 2.4.4p3 server: timeout from amandad...

2006-05-03 Thread Francis Galiegue

Le Mercredi 03 Mai 2006 15:53, vous avez écrit :

>
> There's a firewall on both. On the client machine, UDP/10080 is let in from
> the server machine only. On the server machine, UDP/10082 and 10083 are let
> in from the client only. As to the timeout, I don't see this as a problem
> since I have an iptables rules that catches "INVALID" packets (as per
> conntrack definition) and it captures nothing at all...
>

OK well, I stand corrected. The zmanda Wiki link did provide me with an 
insight. I did NOT have the ip_conntrack_amanda module inserted. As such, the 
UDP timeout was the default, ie 30 seconds, ie far from being enough.

I've therefore inserted this module with master_timeout=3600, that should do 
it...

-- 
Francis Galiegue, [EMAIL PROTECTED]
One2team - 12bis rue de la Pierre Levée, 75011 PARIS - 0143381980

Re: 2.4.2p2 client, 2.4.4p3 server: timeout from amandad...

2006-05-03 Thread Francis Galiegue

Le Mercredi 03 Mai 2006 14:48, vous avez écrit :
> On 2006-05-03 13:26, Francis Galiegue wrote:

>
> You could run tcpdump or ethereal on the server and client and verify
> if indeed the packet is arriving there and with the correct IP-number
> (considering the aliases for eth0 can have messed that up).
>

Well, if it did, I wouldn't have been able to do "small" backups either, would 
I?

> Is there a firewall inbetween, or on one/both of the servers?
> You may need to increase the UDP-reply timeout on the firewall (or
> disable the firewall).  I believe many firewalls timeout UDP packets
> after 180 seconds.
>

There's a firewall on both. On the client machine, UDP/10080 is let in from 
the server machine only. On the server machine, UDP/10082 and 10083 are let 
in from the client only. As to the timeout, I don't see this as a problem 
since I have an iptables rules that catches "INVALID" packets (as per 
conntrack definition) and it captures nothing at all...

> There are other possibilities, solutions. See:
>
> http://wiki.zmanda.com/index.php/Amdump:_results_missing
>
> in amanda 2.4.2p2 the "calcsize" was not yet implemented (it did
> exist, but was experimental, I believe).
> if the server is 2.4.4xx then you can use "estimate server", even
> if the client is old.
>

>From what I've read on the zmanda wiki, estimate server uses data from the 
previous runs. But there's no previous run...

BTW I've tried the nc stuff on the wiki page as well: it's definitely not a 
firewall problem.

-- 
Francis Galiegue, [EMAIL PROTECTED]
One2team - 12bis rue de la Pierre Levée, 75011 PARIS - 0143381980

Re: 2.4.2p2 client, 2.4.4p3 server: timeout from amandad...

2006-05-03 Thread Paul Bijnens


On 2006-05-03 13:26, Francis Galiegue wrote:
The list of filesystems represent 24 Gb total (compressed with gzip). The 
problem is this: it works fine when I try and backup every directory but one 
of the two largest (which are resp. 8.4 Gb and 10 Gb uncompressed on disk), 
and fails when I try to include either of these because _amandad_, not 
amdump, times out. I get this in the amandad logfile:



amandad: debug 1 pid 6636 ruid 33 euid 33 start time Wed May  3 12:15:04 2006
amandad: version 2.4.2p2

[...]

amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, giving up!
amandad: pid 6636 finish time Wed May  3 12:20:06 2006


Reproducible at will: amandad always times out after 5 minutes. Meanwhile, 
amdump stays there waiting for... Well, I don't know, frankly, but I have to 
C-c it and amcleanup afterwards.


What I've already done is increase the etimeout parameter on the server side: 
I put 1200 instead of the default value, 300. But that didn't help. Out of 
despair I even tried and changed this value in the old server config files, 
in case amandad would try and read them :p But no.



You could run tcpdump or ethereal on the server and client and verify
if indeed the packet is arriving there and with the correct IP-number
(considering the aliases for eth0 can have messed that up).

Is there a firewall inbetween, or on one/both of the servers?
You may need to increase the UDP-reply timeout on the firewall (or
disable the firewall).  I believe many firewalls timeout UDP packets
after 180 seconds.

There are other possibilities, solutions. See:

http://wiki.zmanda.com/index.php/Amdump:_results_missing

in amanda 2.4.2p2 the "calcsize" was not yet implemented (it did
exist, but was experimental, I believe).
if the server is 2.4.4xx then you can use "estimate server", even
if the client is old.




It should also be noted that the client machine is such a mess that my 
predecessor of a sysadmin created 6 aliases for interface eth0... I had to 
bind amandad specifically to the address I wanted so that dumps could work in 
the first place. But I don't see this having an influence here, since smaller 
backups work perfectly...


I'd appreciate any hint on this one!






--
Paul Bijnens, xplanation Technology ServicesTel  +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUMFax  +32 16 397.512
http://www.xplanation.com/  email:  [EMAIL PROTECTED]
***
* I think I've got the hang of it now:  exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt,  abort,  hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e,  kill -1 $$,  shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ...  "Are you sure?"  ...   YES   ...   Phew ...   I'm out  *
***

2.4.2p2 client, 2.4.4p3 server: timeout from amandad...

2006-05-03 Thread Francis Galiegue

Hello everyone,

I'm in the process of migrating a full backup configuration from one machine 
to another. The problem with the old one is threefold:

- it's a mess, which happens to be our main internal LAN server,
- it's obsolete software wise,
- the DAT changer attached to it now has insufficient capacity.

Such a mess it is that I cannot afford to install a new Amanda version on it 
and see if that would cure the problem - some dumps are left on the holding 
disk but then I can copy them around.

So, the new machine has a brand new DAT72x6 HP changer (DAT24x6 on the old 
one), I've copied the old Amanda configuration to the new one and adapted 
some settings (tapetype, in essence). Now the configuration looks like this:


org "One2team"  
mailto "[EMAIL PROTECTED]"
dumpuser "amanda"   
inparallel 4
netusage  5000 Kbps 
dumpcycle 0 days
runspercycle 1 days 
tapecycle 2 tapes   
bumpsize 20 Mb  
bumpdays 1  
bumpmult 4  
etimeout 1200   
runtapes 1  
tpchanger "/usr/lib/amanda/chg-zd-mtx"
tapedev "/dev/nst0"
changerfile "changer.conf"
changerdev "/dev/sg1"
tapetype HP-DAT72  
labelstr "^full-[0-9][0-9]*$"  
holdingdisk hd1 {
comment "main holding disk"
directory "/var/lib/amanda/full/dumps"
use -1024 Mb   
}
reserve 30 
infofile "/var/lib/amanda/full/info" 
logdir   "/var/lib/amanda/full/logs" 
indexdir "/var/lib/amanda/full/index"
define tapetype HP-DAT72 {
comment "Produced by tapetype prog (hardware compression off)"
length 37511 mbytes
filemark 625 kbytes
speed 1758 kps
}


The dump types are defined like this (relevant settings only AFAICS):


define dumptype global {
comment "Global definitions"
exclude "./tmp"
}
define dumptype root-tar {
global
program "GNUTAR"
comment "root partitions dumped with tar"
compress none
index
exclude list "/etc/amanda/exclude.gtar"
priority low
}
define dumptype comp-root-tar {
root-tar
comment "Root partitions with compression"
compress server fast
}



The list of filesystems represent 24 Gb total (compressed with gzip). The 
problem is this: it works fine when I try and backup every directory but one 
of the two largest (which are resp. 8.4 Gb and 10 Gb uncompressed on disk), 
and fails when I try to include either of these because _amandad_, not 
amdump, times out. I get this in the amandad logfile:


amandad: debug 1 pid 6636 ruid 33 euid 33 start time Wed May  3 12:15:04 2006
amandad: version 2.4.2p2
amandad: build: VERSION="Amanda-2.4.2p2"
[blah blah]
Amanda 2.4 REQ HANDLE 000-9006B109 SEQ 1146651283
SECURITY USER amanda
SERVICE sendsize
OPTIONS features=feff9ffe0f;maxdumps=1;hostname=crios.olympe.o2t;
GNUTAR /usr/local 0 1970:1:1:0:0:0 -1 exclude-file=./tmp
[more blah]
sending ack:

Amanda 2.4 ACK HANDLE 000-9006B109 SEQ 1146651283


bsd security: remote host circe.olympe.o2t user amanda local user amanda
amandahosts security check passed
amandad: running service "/usr/lib/amanda/sendsize"
amandad: sending REP packet:

Amanda 2.4 REP HANDLE 000-9006B109 SEQ 1146651283
OPTIONS maxdumps=1;
/etc 0 SIZE 5990
/var/named 0 SIZE 40
[goes on and reports the rest]


amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, retrying
amandad: dgram_recv: timeout after 10 seconds
amandad: waiting for ack: timeout, giving up!
amandad: pid 6636 finish time Wed May  3 12:20:06 2006


Reproducible at will: amandad always times out after 5 minutes. Meanwhile, 
amdump stays there waiting for... Well, I don't know, frankly, but I have to 
C-c it and amcleanup afterwards.

What I've already done is increase the etimeout parameter on the server side: 
I put 1200 instead of the default value, 300. But that didn't help. Out of 
despair I even tried and changed this value in the old server config files, 
in case amandad would try and read them :p But no.

It should also be noted that the client machine is such a mess that my 
predecessor of a sysadmin created 6 aliases for interface eth0... I had to 
bind amandad specifically to the address I wanted so that dumps could work in 
the first place. But I don't see this having an influence here, since smaller 
backups work perfectly...

I'd appreciate any hint on this one!

Thanks,
-- 
Francis Galiegue, [EMAIL PROTECTED]
One2team - 12bis rue de la Pierre Levée, 75011 PARIS - 0143381980

Re: amandad can't be started

2006-02-23 Thread Patrick Mania


Jon LaBadie wrote:


On Thu, Feb 23, 2006 at 04:34:19PM +0100, Patrick Mania wrote:
 


Dear group,



The problem is, that the amandad canŽt be started.

I execute $Prefix/libexec/amandad, this takes a while (30 secs) and breaks.


   


amandad is not intended to run continually nor to be run from the command line.
It starts as needed by {x}inetd and communicates over the network.
You are just seeing its timeout and exit when no one is talking to it
over the network.

 


Hi Jon,

thanks for your reply. I´ve just started the amandad manually because on 
"amcheck" from the amanda server the result is "request timed out" for 
this machine.


Greetings,
Patrick

Re: amandad can't be started

2006-02-23 Thread Jon LaBadie

On Thu, Feb 23, 2006 at 04:34:19PM +0100, Patrick Mania wrote:
> Dear group,
> 
>  
> 
> The problem is, that the amandad canŽt be started.
> 
> I execute $Prefix/libexec/amandad, this takes a while (30 secs) and breaks.
> 
>  
amandad is not intended to run continually nor to be run from the command line.
It starts as needed by {x}inetd and communicates over the network.
You are just seeing its timeout and exit when no one is talking to it
over the network.

-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)

amandad can't be started

2006-02-23 Thread Patrick Mania









Dear group,

 

i´am trying to install the amanda client from the
current version.

I´ve configured it with user=amanda group=disk and
client installation only

The prefix is /usr/amanda.

 

The problem is, that the amandad can´t be started.

I execute $Prefix/libexec/amandad, this takes a while
(30 secs) and breaks.

 

The Logfile says:

 

serv:/usr/amanda/libexec# cat
/tmp/amanda/amandad.20060220015348.debug 

amandad: debug 1 pid 2500 ruid 1000 euid 1000: start
at Mon Feb 20 01:53:48 2006

amandad: version 2.4.5p1

 

 Building Infos 

 

amandad: time 29.992: dgram_recv: timeout after 30
seconds

amandad: error receiving message: timeout

amandad: time 29.993: error receiving message:
timeout

amandad: time 29.993: pid 2500 finish time Mon Feb 20
01:54:18 2006

 

This is my inetd.conf:

 

serv:/usr/amanda# cat /etc/inetd.conf 

# /etc/inetd.conf:  see inetd(8) for further
informations.

#

# Internet server configuration database

#

#

# Lines starting with "#:LABEL:" or
"##" should not

# be changed unless you know what you are doing!

#

# If you want to disable an entry so it isn't touched
during

# package updates just comment it out with a single
'#' character.

#

# Packages should modify this file by using
update-inetd(8)

#

#  


#

#:INTERNAL: Internal services

#echo   stream  tcp nowait  root   
internal

#echo   dgram   udp wait    root   
internal

#chargen    stream  tcp nowait  root   
internal

#chargen    dgram   udp wait    root   
internal

#discard    stream  tcp nowait  root   
internal

#discard    dgram   udp wait    root   
internal

#daytime    stream  tcp nowait  root   
internal

#daytime    dgram   udp wait    root   
internal

#time   stream  tcp nowait  root   
internal

#time   dgram   udp wait    root   
internal

 

#:STANDARD: These are standard services.

 

#:BSD: Shell, login, exec and talk are BSD protocols.

shell   stream  tcp nowait  root   
/usr/sbin/tcpd  /usr/sbin/in.rshd

login   stream  tcp nowait  root   
/usr/sbin/tcpd  /usr/sbin/in.rlogind

exec    stream  tcp nowait  root    /usr/sbin/tcpd 
/usr/sbin/in.rexecd

amanda  dgram   udp wait    amanda 
/usr/amanda/libexec/amandad amandad

 

The /etc/services file:

hitchhiker:/usr/amanda/libexec# cat /etc/services |
grep amanda

amanda  10080/udp   #
Amanda backup services

kamanda 10081/tcp   #
amanda backup services (Kerberos)

kamanda 10081/udp

amandaidx   10082/tcp   #
amanda backup services

amidxtape   10083/tcp   #
amanda backup services

 

 

The inetd seems to work:

serv:/usr/amanda/libexec# netstat -a | grep amanda

udp    0  0 *:amanda    *:*    

 

   

The /etc/hosts.deny and /etc/hosts.allow files are
empty.

The Items in
http://wiki.zmanda.com/index.php/Amcheck:_selfcheck_request_timed_out seems to
be correct.

 

Any suggestions?

Greetings,
Patrick

Re: amandad busy (solved)

2006-02-06 Thread shoaib r

f.y.i.     The reboot of the client did indeed fix the problem without me having to change any of the timeout settings. I only refrained from doing this because my timeouts were more than high enough already and as the backup size hadn't suddenly changed, the backups should have worked with their current settings.     Would love to know what the reboot does to sort the problem. Prior to the reboot I had killed off all the amanda owned processes but still the backups failed.  The reboot suggestion from Gene sorted it however.     Thank you for your input Gene and amanda-users!     Shabs.Gene Heskett <[EMAIL PROTECTED]> wrote:  On Friday 27 January 2006 06:05, shoaib r wrote:>I've had backups working for over 2 mon!
 ths
 now.>> A system reboot started daily failures on the remote machine:>> Report says:> ==> FAILURE AND STRANGE DUMP SUMMARY:> planner: ERROR host1NAK : amandad busy> host01 /export RESULTS MISSING>> etc...>> amandad.debug says:> > amandad: time 30.009: dgram_recv: timeout after 30 seconds>amandad: error receiving message: timeout>amandad: time 30.010: error receiving message: timeout>> No changes made to amanda.conf or any other configuration.>> The tape server is backing its filesystems up sucessfully. However> the one and only remote server is not.>> Does anyone have any ideas?>> Thank youISTR I've had that happen once or twice, not recently, because a zombie was left on the client, possibly due to a timeout error. A reboot of the client should fix that. And you !
 might add
 a bit to the timeout values in your amanda.conf just for insurance. An additional 25% should be enough, if indeed that was the cause.-- Cheers, GenePeople having trouble with vz bouncing email to me should add the word'online' between the 'verizon', and the dot which bypasses vz'sstupid bounce rules. I do use spamassassin too. :-)Yahoo.com and AOL/TW attorneys please note, additions to the abovemessage by Gene Heskett are:Copyright 2006 by Maurice Eugene Heskett, all rights reserved.
		Yahoo! Photos  NEW, now offering a quality print service from just 8p a photo.

Re: amandad busy

2006-01-30 Thread shoaib r

sorry I should have mentioned, I'm using Solaris 9.     My etimeout is set to -8000 (which has been ok for months) but i still think its way to much. I didnt change it as this is the error i'm getting now:   FAILURE AND STRANGE DUMP SUMMARY:  planner: ERROR Estimate timeout from host01  host01 /u01 lev 0 FAILED [disk /u01, all estimate timed out]  host01 /opt lev 0 FAILED [disk /opt, all estimate timed out]  host01 /var lev 0 FAILED [disk /var, all estimate timed out]  host01 /usr lev 0 FAILED [disk /usr, all estimate timed out]  host01 / lev 0 FAILED [disk /, all estimate timed out]  host01 /export lev 0 FAILED 20060124 [too many dumper retry]     Will schedule a reboot in and see how it goes.     The above error would indicate to me that the etimeout isn't high eno!
 ugh but
 its already at 8000secs (133mins)!! Dont understand that.     Thanks for input so far guys.     shabs  Salvatore Enrico Indiogine <[EMAIL PROTECTED]> wrote:  2006/1/27, shoaib r <[EMAIL PROTECTED]>:> I've had backups working for over 2 months now.> A system reboot started daily failures on the remote machine:> The tape server is backing its filesystems up sucessfully. However the one> and only remote server is not.>> Does anyone have any ideas?> Thank youJust a shot in the dark, but a reboot will usually boot the newlyinstalled kernel if you have something like yum update set onautomatic.Check e.g. /var/log/messages, uname -r, dmesgI am assuming you run Linux and use yum. Simila!
 r things
 could havehappened with automatic apt-get upgrade.--Enrico IndiogineParasol LaboratoryTexas A&M University[EMAIL PROTECTED][EMAIL PROTECTED]979-845-3937  
		To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre.

Re: amandad busy

2006-01-27 Thread Salvatore Enrico Indiogine

2006/1/27, shoaib r <[EMAIL PROTECTED]>:
> I've had backups working for over 2 months now.
> A system reboot started daily failures on the remote machine:
> The tape server is backing its filesystems up sucessfully. However the one
> and only remote server is not.
>
> Does anyone have any ideas?
>   Thank you

Just a shot in the dark, but a reboot will usually boot the newly
installed kernel if you have something like yum update set on
automatic.

Check e.g. /var/log/messages, uname -r, dmesg

I am assuming you run Linux and use yum.   Similar things could have
happened with automatic apt-get upgrade.

--
Enrico Indiogine
Parasol Laboratory
Texas A&M University

[EMAIL PROTECTED]
[EMAIL PROTECTED]
979-845-3937

Re: amandad busy

2006-01-27 Thread Gene Heskett

On Friday 27 January 2006 06:05, shoaib r wrote:
>I've had backups working for over 2 months now.
>
>  A system reboot started daily failures on the remote machine:
>
>  Report says:
>  ==
>FAILURE AND STRANGE DUMP SUMMARY:
>  planner: ERROR host1NAK : amandad busy
>  host01 /export RESULTS MISSING
>
>  etc...
>
>  amandad.debug says:
>  
>  amandad: time 30.009: dgram_recv: timeout after 30 seconds
>amandad: error receiving message: timeout
>amandad: time 30.010: error receiving message: timeout
>
>  No changes made to amanda.conf or any other configuration.
>
>  The tape server is backing its filesystems up sucessfully. However
> the one and only remote server is not.
>
>  Does anyone have any ideas?
>
>  Thank you

ISTR I've had that happen once or twice, not recently, because a zombie 
was left on the client, possibly due to a timeout error.  A reboot of 
the client should fix that. And you might add a bit to the timeout 
values in your amanda.conf just for insurance.  An additional 25% 
should be enough, if indeed that was the cause.

-- 
Cheers, Gene
People having trouble with vz bouncing email to me should add the word
'online' between the 'verizon', and the dot which bypasses vz's
stupid bounce rules.  I do use spamassassin too. :-)
Yahoo.com and AOL/TW attorneys please note, additions to the above
message by Gene Heskett are:
Copyright 2006 by Maurice Eugene Heskett, all rights reserved.

amandad busy

2006-01-27 Thread shoaib r

I've had backups working for over 2 months now.     A system reboot started daily failures on the remote machine:     Report says:  ==FAILURE AND STRANGE DUMP SUMMARY:  planner: ERROR host1NAK : amandad busy  host01 /export RESULTS MISSING  etc...     amandad.debug says:  ====  amandad: time 30.009: dgram_recv: timeout after 30 secondsamandad: error receiving message: timeoutamandad: time 30.010: error receiving message: timeout     No changes made to amanda.conf or any other configuration.     The tape server is backing its filesystems up sucessfully. However the one and only remote server is not.     Does anyone have any ideas?     Thank
 you
		Yahoo! Photos  NEW, now offering a quality print service from just 8p a photo.

amandad still running/ RESULTS MISSING

2005-05-27 Thread Vicki Stanfield

I am having a problem with amanda whereby some of the dumps (DLE's on 
the amanda server itself) are showing as RESULTS MISSING in the logs. 
There is an amandad process which is still running after quite a long 
time (24 hours) and which has a defunct child process:


backup3318  0.0  0.1  3044 1076 ?SMay26   0:00 amandad
backup3320  0.0  0.0 00 ?ZMay26   0:00 [amandad 
]


I googled the RESULTS MISSING error and checked that the appropriate 
files have SUID set and that the entries in inetd are right (they have 
not been changed). The only change that I have made in the last couple 
days is to add a DLE on a remote system (which already had other DLE's 
being backed up). Anyone have a clue for me?


TIA,
Vicki

Amcheck issue - i run amandad as amanda user and here is the blurb

2005-05-18 Thread Chuck Amadi Systems Administrator

amandad: debug 1 pid 2758 ruid 2003 euid 2003: start at Wed May 18
17:13:31 2005
amandad: version 2.4.4p2
amandad: build: VERSION="Amanda-2.4.4p2"
amandad:BUILT_DATE="Wed Jun 30 23:45:53 UTC 2004"
amandad:BUILT_MACH="Linux eisenstein 2.6.5 #1 Thu Nov 14
12:14:04 UTC 2002 i686 i686 i386 GNU/Linux"
amandad:CC="gcc"
amandad:CONFIGURE_COMMAND="'./configure'
'--mandir=/usr/share/man' '--prefix=/usr' '--infodir=/usr/share/info'
'--sysconfdir=/etc' '--libdir=/usr/lib' '--libexecdir=/usr/lib/amanda'
'--localstatedir=/var/lib' '--with-index-server=localhost'
'--with-gnutar-listdir=/var/lib/amanda/gnutar-lists'
'--with-smbclient=/usr/bin/smbclient' '--with-amandahosts'
'--with-user=amanda' '--with-group=disk' '--with-gnutar=/bin/tar'
'--disable-libtool' '--disable-shared' '--disable-static'"
amandad: paths: bindir="/usr/bin" sbindir="/usr/sbin"
amandad:    libexecdir="/usr/lib/amanda" mandir="/usr/share/man"
amandad:AMANDA_TMPDIR="/tmp/amanda" AMANDA_DBGDIR="/tmp/amanda"
amandad:CONFIG_DIR="/etc/amanda" DEV_PREFIX="/dev/"
amandad:    RDEV_PREFIX="/dev/r" DUMP="/sbin/dump"
amandad:RESTORE="/sbin/restore" VDUMP=UNDEF VRESTORE=UNDEF
amandad:XFSDUMP=UNDEF XFSRESTORE=UNDEF VXDUMP=UNDEF
VXRESTORE=UNDEF
amandad:SAMBA_CLIENT="/usr/bin/smbclient" GNUTAR="/bin/tar"
amandad:COMPRESS_PATH="/usr/bin/gzip"
amandad:UNCOMPRESS_PATH="/usr/bin/gzip" LPRCMD="/usr/bin/lpr"
amandad:MAILER="/usr/bin/Mail"
amandad:listed_incr_dir="/var/lib/amanda/gnutar-lists"
amandad: defs:  DEFAULT_SERVER="localhost" DEFAULT_CONFIG="DailySet1"
amandad:    DEFAULT_TAPE_SERVER="localhost"
amandad:DEFAULT_TAPE_DEVICE="/dev/null" HAVE_MMAP HAVE_SYSVSHM
amandad:LOCKING=POSIX_FCNTL SETPGRP_VOID DEBUG_CODE
amandad:AMANDA_DEBUG_DAYS=4 BSD_SECURITY USE_AMANDAHOSTS
amandad:CLIENT_LOGIN="amanda" FORCE_USERID HAVE_GZIP
amandad:COMPRESS_SUFFIX=".gz" COMPRESS_FAST_OPT="--fast"
amandad:COMPRESS_BEST_OPT="--best" UNCOMPRESS_OPT="-dc"
amandad: time 29.995: dgram_recv: timeout after 30 seconds
amandad: error receiving message: timeout
amandad: time 29.995: error receiving message: timeout
amandad: time 29.995: pid 2758 finish time Wed May 18 17:14:01 2005
~
Has anyone any idea and is there info realting to my Client hist always
been down!.

Re: localhost amandad busy

2005-03-25 Thread Stefan G. Weichinger

Hi, Bert,

on Freitag, 25. März 2005 at 14:52 you wrote to amanda-users:

Bpb> WARNING: Usage of fully qualified hostname recommended for Client 
localhost.
Bpb> WARNING: Usage of fully qualified hostname recommended for Client 
localhost.
Bpb> ERROR: NAK localhost: amandad busy
Bpb> Client check: 3 hosts checked in 20.062 seconds, 1 problem found

Bpb> Googling around didn't solve too much,

The mentioned WARNING is pretty new to AMANDA.

Bpb> Simple solution : do what Amanda asks you to do : don't use localhost   
 :-)
Bpb> So, after changing the localhost for smb-shares to
Bpb> goscinny; the problem went away.

Good.

And if someone finds this thread after googling now, he may also refer
to http://www.amanda.org/docs/topten.html#id2519209 for more on this
good old topic.
-- 
best regards,
Stefan

Stefan G. Weichinger
mailto:[EMAIL PROTECTED]

localhost amandad busy

2005-03-25 Thread Bert_De_Ridder


Not a question, just something I encountered
today; 

I had a disklist containing some local
directories and some nt server's directories; something like this :

# Local directory
goscinny /home server-compress-low
goscinny /root server-compress
goscinny /var server-compress

# NT-machine
localhost //CMS/CMS server-compress-low
localhost //CMS/wwwroot server-compress-low
localhost //CMS/IIS_NR server-compress-low

When doing an amcheck, I got this error
:

WARNING: Usage of fully qualified hostname
recommended for Client localhost.
WARNING: Usage of fully qualified hostname
recommended for Client localhost.
ERROR: NAK localhost: amandad busy
Client check: 3 hosts checked in 20.062
seconds, 1 problem found

I should also mention that xinetd was
being used on site, not inetd.

Googling around didn't solve too much,
some suggestions about changing the amanda configuration in xinetd.d directory.
But all to no avail.

Simple solution : do what Amanda asks
you to do : don't use localhost    :-)
So, after changing the localhost for
smb-shares to goscinny; the problem went away.




Regards,

Bert De Ridder

PeopleWare NV - Head Office
Cdt.Weynsstraat 85 
B-2660 Hoboken 
Tel: +32 3 448.33.38 
Fax: +32 3 448.32.66 

PeopleWare NV - Branch Office Geel
Kleinhoefstraat 5
B-2440 Geel
Tel: +32 14 57.00.90
Fax: +32 14 58.13.25

http://www.peopleware.be

http://www.mobileware.be

Re: How to run amandad without inetd

2005-01-08 Thread Mathias Koerber

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
David Newman said the following on 1/9/2005 1:49 PM:
|>I configured all host the same using amanda as user name and group name at
|>compile time. I checked and rechecked. When I run amcheck on 11.00 server
|>to connect to itself there is a debug entry in /tmp/amanda for amcheck but
|>nothing for amandad. If I do this on a working machine there is an entry
|>in /tmp/amanda for amcheck and anandad.
|
|
| Yes, after running amcheck you should see at least four files in
| /tmp/amanda: two from amandad, one from amcheck, and one from selfcheck.
|
| It sounds like this is indeed some problem with amandad not running on the
| 11.00 box. From the packets below, it doesn't look like amandad ever gets
| invoked.
|
Could this be a problem with
~  a) a firewall blocking access to (one of) the port(s)
on Linux i would say iptables. I have no idea what
the equivalent on HP-UX is...
~  b) tcp-wrappers not allowing access (even to localhost)
(check /etc/hosts.* files)?
just a guess
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.5 (MingW32)
iD8DBQFB4MpQRWbA1HjQuHERAo/KAJwL9qkMFhQRcIHkIrzEXCpQARayLQCgt5lT
021rdjxhMxp2CYBP6qWBTkU=
=rTrU
-END PGP SIGNATURE-

1 2 3 4 >

1 - 100 of 304 matches

Mail list logo