Thanks for your insight. We also work with elasticsearch and I appreciate the
easy analysis (once one understands Kibana logic). Do you use job completion
plugin as is? Or did you modify it to account for ssl or additional metrics?
From a central location, we poll data from each cluster - including sacct,
but also KPI-like measures (node status, partitions, accounts). These are
just streams of json that flow through logstash.
Mainly this is because we need a detailed, global view across all clusters,
but also partly for historic reasons (pre-existing systems expect job data
in a different, ad-hoc format), and partly to keep systems loosely coupled.
regards, mark hahn
--
operator may differ from spokesperson. h...@mcmaster.ca