Thank you, Mr Troll,  for complaining about our posting techniques  (is this 
better?)  but
giving no actual help.    Seann,  any luck yet?    Top posting because this is 
not a REPLY
but a query.
Deb 



>>>> On Aug 17, 2015, at 2:33 PM, Seann <nombran...@tsukinokage.net> wrote:
>>>> 
>>>>> All,
>>>>> 
>>>>> I am looking for a little direction on a problem that has cropped up for 
>>>>> me recently.
>>>>> 
>>>>>  I have a backup set, that was created using Amanda 2.5 (default on 
>>>>> CentOS 5.11) and ran very well, both manually and from the cron job I had 
>>>>> set for it.
>>>>> It has approximately 13 hosts to backup, from as simple as backing up a 
>>>>> single directory, to backing up the full system, and it ran with no 
>>>>> issues on CentOS 5.11.
>>>>> The basic setup is using hard drives as the backup media, compressing the 
>>>>> backups to save space, using server compression, these also use GNU-TAR 
>>>>> as the archive format.
>>>>> 
>>>>>  Fast forward to today, I have the system upgraded to CentOS 7, which 
>>>>> also upgraded to Amanda 3.3.3-13, and after some configuration file 
>>>>> re-writing, I got most of the backups to work.
>>>>> Two systems, one backing up the web directory, the other backing up the 
>>>>> full disk, fail constantly.
>>>>> When these two disklist statements are removed, the backup runs, and 
>>>>> takes approximately 2 and a half hours to run on the 8 other hosts (the 
>>>>> other 3 hosts are currently offline and not in scope).
>>>>> 
>>>>>  When the CRON job kicks off at midnight, it runs for over 12 hours (I 
>>>>> have the etimeout set to one day, as the planner kept dying saying to 
>>>>> timed out).
>>>>> This is the same basic error that I get with the two above mentioned 
>>>>> failing backups.
>>>>>  
>>>>> When the hung backup job is running, I see the dumpers and main dump 
>>>>> process running on the backup server, but nothing in the logs outside of 
>>>>> the "We started the backup job" type of log messages.
>>>>> On all of the hosts, I don't see the client running, nor to I see any TAR 
>>>>> processes running.
>>>>> 
>>>>> There are also no clues in the logs on which host the server is waiting 
>>>>> on, and checking all the hosts in scope show they are all in the same 
>>>>> state, that is they have sent the estimate to the backup server and are 
>>>>> waiting on the next phase.
>>>>> 
>>>>> Any help on this would be appreciated, and also is there a better way of 
>>>>> making sense of the logs (such as using something like Graylog2?), and on 
>>>>> reporting for issues with Amanda 3.3?
>>>>> 
>>>>> Regards,
>>>>> Seann
>>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> On 8/17/2015 3:05 PM, Debra S Baddorf wrote:
>>>> 
>>>> Does this include the two clients who fail — do THEY also say that their 
>>>> estimates are complete?   Or are they still working on estimates, and thus 
>>>> holding up the whole works?  All of the estimates seem to need to finish,  
>>>> before anybody gets to start.
>>>> 
>>>> Deb Baddorf
>>>> Fermilab
>>>> 
>>>>  
>>> On Aug 17, 2015, at 3:25 PM, Seann <nombran...@tsukinokage.net> wrote:
>>> 
>>> Deb,
>>> 
>>>  The CRON issue happens with, and without the two clients who fail.
>>> 
>>> All of my normal testing, and manual runs are done with the two problem 
>>> hosts disabled, because they are known problems, and I figure it is due to 
>>> large amounts of files on both of them causing a timeout.
>>> 
>>> I am working on a separate solution  to see if I can get a better method of 
>>> backup on those two hosts.
>>> They are both VM's and one is notoriously slow in disk reads, in addition 
>>> to a large amount of files in the directory that is backed up.
>>> Two other similar VM's backup properly without issue.
>>> 
>>>  Regards,
>>> Seann
>>> 
>>  
>> 
>> On 8/17/2015 3:33 PM, Debra S Baddorf wrote:
>> 
>> Ahh.   So are you saying that the other nodes work when the backup is run 
>> manually,   but hang forever when it is run from a CRON job?
>> That might imply permissions when run as a cron job vs live….
>> 
>> Deb
>> 
>  On Aug 17, 2015, at 4:00 PM, Seann <nombran...@tsukinokage.net> wrote:
> 
> Deb,
> 
> That would make sense.
> On the new install, the process is using amandabackup as the user.
> 
>  When I run the process manually, I run it as follows:
> 
>  sudo -u amandabackup amdump DailySet1
> 
>  I have the CRON scheduled in the amandabackup user CRON
>  Editing it using this command:
> 
> crontab -u amandabackup -e
>  
> I will check permissions around the filesystem and Amanda related files to 
> clear those out as possible suspects.
> 
> Also, no selinux is running on the system, as that was already seen as a 
> possible issue to be resolved at a later date, after everything is working 
> normally.
> 
> Regards,
> Seann
> 
>  
On Aug 17, 2015, at 4:09 PM, Debra S Baddorf <badd...@fnal.gov> wrote:

Check that the parts in  /etc/xinetd.d    (or similar … I’m on linux)   are 
ALSO  using  amandabackup
and not your previous backup name.   The services  have  a    user=XXXXX   line 
  in them.
Grep a bit  (or maybe somebody else will join the conversation!)

 
My  files to start the services look like this  (I start several flavors of the 
AMANDAD service,  for several
generations of clients,  so look mostly at the  USER=   line here) :

 service amanda

{
       socket_type             = stream 
       protocol                = tcp
       wait                    = no
       user                    = operator
       group                   = root
       server                  = /usr/local/libexec/amanda/amandad
       server_args             = -auth=bsdtcp amdump amindexd amidxtaped
       disable                 = no
       groups                  = yes
}

Deb

 

 

 
 

>>>>> Regards,
>>>>> Seann
>>>>> 
>>>>>  
>>>>> 
>>>>> This message is confidential. It may also be privileged or otherwise 
>>>>> protected by work product immunity or other legal rules. If you have 
>>>>> received it by mistake, please let us know by e-mail reply and delete it 
>>>>> from your system; you may not copy this message or disclose its contents 
>>>>> to anyone. Please send us by fax any message containing deadlines as 
>>>>> incoming e-mails are not screened for response deadlines. The integrity 
>>>>> and security of this message cannot be guaranteed on the Internet.
>>>>> 
>>>>> _____________________________________________________________________
>>>>> 
>>>>>  
>>>>> 
>>>>> This email and any files transmitted with it are confidential and 
>>>>> intended solely for the use of the individual or entity to whom they are 
>>>>> addressed. If you have received this email in error please notify the 
>>>>> system manager. Please note that any views or opinions presented in this 
>>>>> email are solely those of the author and do not necessarily represent 
>>>>> those of the company. Finally, the recipient should check this email and 
>>>>> any attachments for the presence of viruses. The company accepts no 
>>>>> liability for any damage caused by any virus transmitted by this email.
>>>>> 
>>>>>  
>>>>> 
>>>>> Tsukinokage.net Omaha, Nebraska
>>>>> 
>>>>>  

Reply via email to