Hi,
On 12/9/20 3:48 PM, iono sphere wrote:
I am not sure who could I ask this, but I would like to try here.
Currently, I'm seeing something weird in my server. Thanks to Prometheus
and Node-Exporter, I have seen that node_filesystem_avail_bytes has been
fluctuating up and down for hundreds of Megabytes once a day or two. So
I am trying to hunt down the root cause of this. I have been asking
everyone already that at the time of
node_filesystem_avail_bytes increasing/decreasing for hundreds of
Megabytes nobody has access the server. I have already checked the
auth.log file as well to confirm this. Our server only has a PHP Larvel
project being used as a development server, MySQL, Vue.js Front-End
Server, and Docker to containerize these. But none of these should
create or remove hundreds of Megabytes...
My current guess right now is the swap file might be at work but then I
am not sure.
Swap often has its own partition. If it's really a file, it is usually
pre-allocated, I think. So this sounds unlikely to me.
Are there other possible systematic process in Ubuntu that
creates/removes hundreds of Megabytes once in a day or two?
Cron jobs or systemd timers come to mind. Log rotation might behave that
way for large logs. MySQL might perform some reorganization. Prometheus
itself runs a process called compaction which temporarily needs more space:
https://prometheus.io/docs/prometheus/latest/storage/#compaction
Checking the Prometheus logs and maybe increasing the log level can
help. There are also metrics around compaction (e.g.
prometheus_tsdb_compactions_total) which are worth checking.
In fact, is there a way to subtract all the values of expected
systematic processes from node_filesystem_avail_bytes?
I don't think so. node_exporter takes the value directly from the
relevant kernel interfaces. It's hard to decide what's systematic and
what not. ;)
If you cannot find the cause after checking the items from above, you
can try hunting it down using systematic tracing (perf trace, systemtap,
iosnoop [1]).
[1] http://www.brendangregg.com/blog/2014-07-16/iosnoop-for-linux.html
Kind regards,
Christian
--
You received this message because you are subscribed to the Google Groups
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to prometheus-users+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/prometheus-users/8c1b329e-7821-5413-f073-7b01ce453591%40hoffmann-christian.info.