hosts entry not present or 'After=network.target' not present in Unit file)

deoren Sat, 08 Jul 2017 20:19:44 -0700

On 7/8/17 9:23 PM, David Lang wrote:

On Sat, 8 Jul 2017, deoren wrote:
Looking around I learned of these two directives:

$DebugLevel 2
$DebugFile /var/log/rsyslog-debug.log
I added those, rebooted the VM and quickly had lots of debug info towork with. In the file I found these entries:
5676.682567045:sendToLogserver queue:Reg/w0: error 111 in getaddrinfo
5676.682583062:sendToLogserver queue:Reg/w0: end relpSessConnect, iRet100145676.682588226:sendToLogserver queue:Reg/w0: Action 1 transitioned tostate: rtry5676.682594836:sendToLogserver queue:Reg/w0: action 'sendToLogserver':is transactional - executing in commit phase5676.682597400:sendToLogserver queue:Reg/w0: actionDoRetry:sendToLogserver enter loop, iRetries=0, ResumeInRow 1
5676.682684496:sendToLogserver queue:Reg/w0: error 111 in getaddrinfo
5676.682688779:sendToLogserver queue:Reg/w0: end relpSessConnect, iRet100145676.682691652:sendToLogserver queue:Reg/w0: actionDoRetry:sendToLogserver action->tryResume returned -20075676.682693866:sendToLogserver queue:Reg/w0: actionDoRetry:sendToLogserver check for max retries, iResumeRetryCount -1, iRetries 0
I believe that getaddrinfo is attempting to lookup the IP for thegiven FQDN, but it's failing with whatever error 111 is. Looking atthe counts given by way of the impstats module, it appears that thequeue is only growing and even if the system appears to be fullyfunctional and other daemons are accessing the network without issue,rsyslog still refuses to send messages to the remote system.
If rsyslog is unable to resolve the name, it cannot send the message. Ialways put the log destinations in /etc/hosts or configure it to send toan IP address.

I can see the advantage to that approach and have been considering bothapproaches (the second approach the more appealing)

rsyslog will suspend sending logs when the attempt to connect fails, andwill only retry periodically (with a back-off to keep from probing toofrequently as the probes themselves can be a problem)

I see lots of retries within the log file, and it's not necessarily thetime it waits that I see as a problem (at least in this initialconfiguration), but that once rsyslog tries to resolve the name itdoesn't seem (looking at this from the outside, just by the "feel" ofthings) to ever make another attempt at resolving the name to an IP oncethe network is fully established. It feels like rsyslog is continuing touse a cached result (whatever that may be) with future retry attempts. Ileft the test system running overnight and it was still hung up the nextday.

we've had other people report that the backoff gets unreasonably long,we should put in a limit to how long it will wait.

I'm not one to argue with adding more knobs/buttons to fine tunebehavior. :)

In your debug logs, look for the initial suspend message, it should saywhen it will try again (you can also configure rsyslog to log suspendsand resumes as well)

Thank you for those tips. I'm fairly new to anything close to 'advanced'with rsyslog, but I believe I have logging of suspends and resumesenabled globally and via the specific queue for omrelp. My hope was thatit would assist with determining whether anything at all was happening.

For what it is worth, I'm booting this test Ubuntu system from a SSD.Once I move it onto slower storage I see a 3x slower startup time:

root@vmclone:/var/log# systemd-analyze critical-chain rsyslog.service |grep rsyslog.service

rsyslog.service +616ms

Running the same command on the SSD copy of that VM I see about 220msstartup time. I'm also new to systemd, so I might be misinterpreting thevalues, but it appears that the slower load time for rsyslog is givingthe system sufficient time to load all required networking support sothat the remote server's name resolves to an IP properly.

As indicated before, regardless of the boot speed, if I enter the IPmapping in /etc/hosts, the bare address within the omrelp action(target) or if I add 'After=network.target' to the/lib/systemd/system/rsyslog.service file ([Unit] section) I get positiveresults. It is when I don't do one of those things and boot the VM fromthe SSD that I'm seeing these results.

Any thoughts/tips/tricks re the repeat getaddrinfo failures, continuing(according to the debug log file) even after the system has been up fora while? Other applications, such as remote_rsyslog2, are able to sendmessages to the remote syslog server (same version of ryslog as theclient) using the FQDN without issue.


Thanks again for your help with this.
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Re: [rsyslog] Log messages held when using FQDN as omrelp target and (/etc/hosts entry not present or 'After=network.target' not present in Unit file)

Reply via email to