2016-03-17 22:15 GMT+01:00 David Lang <[email protected]>:
> I have been running 8.17 from the repo combined with a copy of liblognorm
> 2.0.0 that I compiled during the 8.16 timeframe. Since the upgrade to 8.17
> I've been getting a few coredumps. Since I enabled async writing to
> dynamically generated files, rsyslog is using 1500+ threads (up from ~17
> threads prior to this change) and has been running a 32G box OOM repeatedly.

Just a side-note: async writing is very ressource intense. Among
other, it requires one background thread per file. It should be turned
on only if there is sufficient slow processing, e.g zip writing with
larger io buffers. If io buffers are small, there is also probably
lots of locking contention between forground action processing and
background writer.

>
> Figuring that there is a reasonable chance that these problems are due to
> the mixing of versions, I compiled from today's git tree and deployed that
> to a server that's receiving a flood of logs from queues that are flushing
> to it. When I deployed the new version, the throughput dropped noticably
> (~30% drop from handling ~300K messages/min to handling 200K messages/min)

One thing that triggers my mind is the change of hash function. In
8.17, we use a different hash function inside libfastjson. This has
prooven to be much faster in experiments we carried out, but maybe
it's not the case for you. You can comment out line 1640

https://github.com/rsyslog/rsyslog/blob/master/tools/rsyslogd.c#L1640

here and see what happens.

I need to go through the rest of the thread in more detail. However, I
have reviewed the ChangeLog and neither it nor my memory points into
any changes in 8.17 that would explain what you see (also refering to
your later mail). So we probably need to track things down. A good
start would be if you could run tests on a dedicated test system.
Running under valgrind to see the potential leak and doing a git
bisect to find a potential culprit sounds like good next steps to me.

Rainer
>
> This is with no config changes, just changing the binary packages.
>
> This is with a rather complex ruleset (11 queues, 75+ actions, at least 4
> mmnormalize calls, one with a 1500 line ruleset)
>
> Since pushing the new version to this machine, no coredumps and no OOM (not
> definitive given that it's only been about 4 hours, but highly suggestive)
>
> David Lang
> _______________________________________________
> rsyslog mailing list
> http://lists.adiscon.net/mailman/listinfo/rsyslog
> http://www.rsyslog.com/professional-services/
> What's up with rsyslog? Follow https://twitter.com/rgerhards
> NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of
> sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T
> LIKE THAT.
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Reply via email to