Re: [ovirt-users] ovirt-ha-agent service errors

2017-08-22 Thread Vadim
Hi, All

after upgrade to 4.1.5 no any error messages ;-)

Thanks to TEAM

Пнд 21 Авг 2017 14:29:49 +0300, Vadim  написал:
> Hi, All
> 
> ovirt 4.1.4 fresh install on two hosts with hosted-engine on both.
> gluster volume is replica 3 with two vdsm hosts and one VM under esxi
> 
> working only one vm for HE 
> 
> sometimes have such errors in ha-agent
> 
> # service ovirt-ha-agent status
> Redirecting to /bin/systemctl status  ovirt-ha-agent.service
> ● ovirt-ha-agent.service - oVirt Hosted Engine High Availability Monitoring 
> Agent
>Loaded: loaded (/usr/lib/systemd/system/ovirt-ha-agent.service; enabled; 
> vendor preset: disabled)
>Active: active (running) since Thu 2017-08-17 11:43:44 MSK; 3 days ago
>  Main PID: 2534 (ovirt-ha-agent)
>CGroup: /system.slice/ovirt-ha-agent.service
>└─2534 /usr/bin/python 
> /usr/share/ovirt-hosted-engine-ha/ovirt-ha-agent --no-daemon
> 
> Aug 21 00:29:11 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 00:48:32 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 01:12:05 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 02:12:09 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 03:55:08 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 08:14:05 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 08:25:06 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 08:46:05 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 09:20:06 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> Aug 21 09:21:40 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has 
> bad health status, timeout in 300 seconds
> 
> 
> in agent log:
> 
> MainThread::INFO::2017-08-21 
> 09:21:40,314::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
>  Found OVF_STORE: imgUUID:c7b33006-e7c7-4e39-8d80-2301149ac8f9, 
> volUUID:184f9e45-ab1b-44b8-8a68-238042dba1a7
> MainThread::INFO::2017-08-21 
> 09:21:40,594::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
>  Found OVF_STORE: imgUUID:e7a3c173-9f87-4f6c-a807-63118b9b7cb2, 
> volUUID:92317b81-1bb0-43e6-b029-8931aa5d0af0
> MainThread::INFO::2017-08-21 
> 09:21:40,716::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>  Extracting Engine VM OVF from the OVF_STORE
> MainThread::INFO::2017-08-21 
> 09:21:40,749::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>  OVF_STORE volume path: 
> /rhev/data-center/mnt/glusterSD/localhost:_ovha/7b6badfb-4986-4983-9f62-ae55da33d15e/images/e7a3c173-9f87-4f6c-a807-63118b9b7cb2/92317b81-1bb0-43e6-b029-8931aa5d0af0
> MainThread::INFO::2017-08-21 
> 09:21:40,787::config::431::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
>  Found an OVF for HE VM, trying to convert
> MainThread::INFO::2017-08-21 
> 09:21:40,792::config::436::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
>  Got vm.conf from OVF_STORE
> MainThread::ERROR::2017-08-21 
> 09:21:40,792::states::606::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(consume)
>  Engine VM has bad health status, timeout in 300 seconds
> MainThread::INFO::2017-08-21 
> 09:21:40,792::states::430::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(consume)
>  Engine vm running on localhost
> MainThread::INFO::2017-08-21 
> 09:21:40,796::state_decorators::88::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(check)
>  Timeout cleared while transitioning  'ovirt_hosted_engine_ha.agent.states.EngineUpBadHealth'> ->  'ovirt_hosted_engine_ha.agent.states.EngineUp'>
> MainThread::INFO::2017-08-21 
> 09:21:40,800::brokerlink::111::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(notify)
>  Trying: notify time=1503296500.8 type=state_transition 
> detail=EngineUp

[ovirt-users] ovirt-ha-agent service errors

2017-08-21 Thread Vadim
Hi, All

ovirt 4.1.4 fresh install on two hosts with hosted-engine on both.
gluster volume is replica 3 with two vdsm hosts and one VM under esxi

working only one vm for HE 

sometimes have such errors in ha-agent

# service ovirt-ha-agent status
Redirecting to /bin/systemctl status  ovirt-ha-agent.service
● ovirt-ha-agent.service - oVirt Hosted Engine High Availability Monitoring 
Agent
   Loaded: loaded (/usr/lib/systemd/system/ovirt-ha-agent.service; enabled; 
vendor preset: disabled)
   Active: active (running) since Thu 2017-08-17 11:43:44 MSK; 3 days ago
 Main PID: 2534 (ovirt-ha-agent)
   CGroup: /system.slice/ovirt-ha-agent.service
   └─2534 /usr/bin/python 
/usr/share/ovirt-hosted-engine-ha/ovirt-ha-agent --no-daemon

Aug 21 00:29:11 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 00:48:32 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 01:12:05 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 02:12:09 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 03:55:08 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 08:14:05 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 08:25:06 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 08:46:05 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 09:20:06 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds
Aug 21 09:21:40 kvm03 ovirt-ha-agent[2534]: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM has bad 
health status, timeout in 300 seconds


in agent log:

MainThread::INFO::2017-08-21 
09:21:40,314::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
 Found OVF_STORE: imgUUID:c7b33006-e7c7-4e39-8d80-2301149ac8f9, 
volUUID:184f9e45-ab1b-44b8-8a68-238042dba1a7
MainThread::INFO::2017-08-21 
09:21:40,594::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
 Found OVF_STORE: imgUUID:e7a3c173-9f87-4f6c-a807-63118b9b7cb2, 
volUUID:92317b81-1bb0-43e6-b029-8931aa5d0af0
MainThread::INFO::2017-08-21 
09:21:40,716::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
 Extracting Engine VM OVF from the OVF_STORE
MainThread::INFO::2017-08-21 
09:21:40,749::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
 OVF_STORE volume path: 
/rhev/data-center/mnt/glusterSD/localhost:_ovha/7b6badfb-4986-4983-9f62-ae55da33d15e/images/e7a3c173-9f87-4f6c-a807-63118b9b7cb2/92317b81-1bb0-43e6-b029-8931aa5d0af0
MainThread::INFO::2017-08-21 
09:21:40,787::config::431::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
 Found an OVF for HE VM, trying to convert
MainThread::INFO::2017-08-21 
09:21:40,792::config::436::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
 Got vm.conf from OVF_STORE
MainThread::ERROR::2017-08-21 
09:21:40,792::states::606::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(consume)
 Engine VM has bad health status, timeout in 300 seconds
MainThread::INFO::2017-08-21 
09:21:40,792::states::430::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(consume)
 Engine vm running on localhost
MainThread::INFO::2017-08-21 
09:21:40,796::state_decorators::88::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(check)
 Timeout cleared while transitioning  -> 
MainThread::INFO::2017-08-21 
09:21:40,800::brokerlink::111::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(notify)
 Trying: notify time=1503296500.8 type=state_transition 
detail=EngineUpBadHealth-EngineUp hostname='kvm03'
MainThread::INFO::2017-08-21 
09:21:40,853::brokerlink::121::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(notify)
 Success, was notification of state_transition (EngineUpBadHealth-EngineUp) 
sent? sent
MainThread::INFO::2017-08-21 
09:21:40,853::hosted_engine::604::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_vdsm)
 Initializing VDSM


T