Hi Janne,
The configuration is correct. Please try the command :
$ ipmi-sensors --non-abbreviated-units | grep Watts
If you have nothing, then the IPMI plugin cannot be used on this hardware.
We plan to add an option to used the DCMI power to support more hardwares.
The undocumented options "EnergyIPMIxxxx" are wrappers for
"ipmi-sensors" options.
I never have reason to use them.
Except "timeout" and "reflush", when I used BMC in unstable "dev" state,
but in this case I had troubles with ipmi-sensors too.
Thomas
Le 03/09/2014 11:44, Janne Blomqvist a écrit :
Hi,
has anyone got the acct_gather_energy/ipmi plugin to work correctly?
In acct_gather.conf I have the lines
EnergyIPMIFrequency=30
EnergyIPMICalcAdjustment=yes
and in slurm.conf
DebugFlags=Profile
AcctGatherNodeFreq=30
AcctGatherEnergyType=acct_gather_energy/ipmi
However, the end result is that in the slurmd logs when starting
slurmd a line like
[2014-08-28T10:44:52.179] Power sensor not found.
appears.
I suspect that the reason is related to the fact that I cannot
retrieve the power readings with the "ipmi-sensors" command. With
"ipmi-sensors -W discretereading" I can get a reading for the power
supplies, but it seems to be the nameplate capacity rather than the
current consumption. Same for using ipmitool and ipmiutil rather than
ipmi-sensors.
However, using "ipmi-dcmi --get-system-power-statistics" (part of
freeipmi) does appear to work.
So my question, I guess, is that is there some way to configure the
acct_gather_energy/ipmi plugin to retrieve these DCMI power values
instead of whatever it tries to do now? I looked briefly into the
source code and there is a big bunch of undocumented "EnergyIPMIxxxx"
configuration parameters, but I didn't figure out if any of those
could be used to use DCMI.
(The hardware in question is various HP Proliant servers somewhere
between 1 and 4 years old)