Re: [ovirt-users] Is a dedicated oVirt mgmt VLAN still needed for oVirt host nodes?

2015-09-15 Thread Christopher Cox

On 09/15/2015 03:28 PM, Gianluca Cecchi wrote:


Il 15/Set/2015 05:10, mailto:c...@endlessnow.com>> ha 
scritto:
 >
 > We have an oVirt environment that I inherited.  One is running 3.4.0 and
 > one is running 3.5.0.
 >
 > Seem in both cases the prior administrator stated that a dedicated VLAN
 > was necessary for oVirt mgmt.  That is, we could not run multiple tagged
 > VLANs on a nic for a given oVirt host node.
 >
 > Does any of this make sense?  Is this true?  Is it still true for more
 > contemporary versions of oVirt?
 >
 > My problem is that our nodes are blades and I only have two physical nics
 > per blade.  In our network for redundancy we need to have the two nics
 > have the same VLANs so that things failover ok.  Which means we have to
 > share the oVirt mgmt network on the same wire.  That's the ideal.

Hy, My opinion:

ovirt-engine supports configuration of the ovirtmgmt as a non-vm network,
untagged. And on top of that nic (bond in your possible specific case) configure
all of the VLANs.
If you configure it as a vm network and want to put on the bond other networks
too, then you have to configure both the ovirtmgmt and the other ones as tagged
VLANs, you cannot mix tagged and untagged in this case.
Or at least it was so that I remained in terms of configuration.
Valid for 3.4 and 3.5, i think.
Can anyone of the net part maintainers confirm?


The import oVirt setup is 3.4 based.. so it's important that we can run all the 
vlans on the same nic for that version.

Thanks to any who can verify or confirm.




___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Extension aaa: No search for principal

2015-09-15 Thread Alon Bar-Lev


- Original Message -
> From: "Daniel Helgenberger" 
> To: "Alon Bar-Lev" 
> Cc: Users@ovirt.org
> Sent: Tuesday, September 15, 2015 11:09:45 PM
> Subject: Re: [ovirt-users] Extension aaa: No search for principal
> 
> I think I did find the issue here;
> 
> my domain is named int.corp.com
> 
> I have defined several UPN aliases and our real world users do use the UPN
> @corp.com.
> 
> Using some internal user with UPN int.corp.com the authentication works as
> expected; while my real world users fail.
> 
> I tried to create a new profile for that; but it fails to load off course
> because the domain corp.com cannot be connected.
> 

the user is upn, users should specify their full upn if this non default domain 
suffix.

you do not need a new profile.

in your case it would probably be us...@corp.com for user1.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Is a dedicated oVirt mgmt VLAN still needed for oVirt host nodes?

2015-09-15 Thread Gianluca Cecchi
Il 15/Set/2015 05:10,  ha scritto:
>
> We have an oVirt environment that I inherited.  One is running 3.4.0 and
> one is running 3.5.0.
>
> Seem in both cases the prior administrator stated that a dedicated VLAN
> was necessary for oVirt mgmt.  That is, we could not run multiple tagged
> VLANs on a nic for a given oVirt host node.
>
> Does any of this make sense?  Is this true?  Is it still true for more
> contemporary versions of oVirt?
>
> My problem is that our nodes are blades and I only have two physical nics
> per blade.  In our network for redundancy we need to have the two nics
> have the same VLANs so that things failover ok.  Which means we have to
> share the oVirt mgmt network on the same wire.  That's the ideal.

Hy, My opinion:

ovirt-engine supports configuration of the ovirtmgmt as a non-vm network,
untagged. And on top of that nic (bond in your possible specific case)
configure all of the VLANs.
If you configure it as a vm network and want to put on the bond other
networks too, then you have to configure both the ovirtmgmt and the other
ones as tagged VLANs, you cannot mix tagged and untagged in this case.
Or at least it was so that I remained in terms of configuration.
Valid for 3.4 and 3.5, i think.
Can anyone of the net part maintainers confirm?

Other useful configurations, in term of bonding and related considerations
(mtu, ecc):
http://www.ovirt.org/Bonding_VLAN_Bridge
https://access.redhat.com/solutions/43070

Gianluca
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Some VMs in status "not responding" in oVirt interface

2015-09-15 Thread Daniel Helgenberger


On 15.09.2015 21:31, Markus Stockhausen wrote:
> Hi Christian,
> 
> I think of a package similar like this:
> 
> qemu-debuginfo.x86_64   2:2.1.3-10.fc21
> 
> That allows gdb to show information about backtrace symbols. See
> comment 12 of https://bugzilla.redhat.com/show_bug.cgi?id=1262251
> Makes error search much simpler - especially if qemu hangs.

Markus, thanks for the BZ. I think I do see the same issue. Actually my VM is 
currently the only with a live snapshot and (puppetmaster)
does a lot of I/O.

Christian, maybe this BZ1262251 also applicable?

I'll go ahead and delete the live snapshot. If I see this issue again I will 
submit the trace to your BZ.


> 
> Markus
> 
> **
> 
> Von: Christian Hailer [christ...@hailer.eu]
> 
> Gesendet: Dienstag, 15. September 2015 21:24
> 
> An: Markus Stockhausen; 'Daniel Helgenberger'
> 
> Cc: yd...@redhat.com; users@ovirt.org
> 
> Betreff: AW: [ovirt-users] Some VMs in status "not responding" in oVirt 
> interface
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> Hi Markus,
>  
> gdb is available on CentOS 7, but what do you mean by qemu-debug? I Installed 
> qemu-kvm-tools, maybe this is the pendant for CentOS?
>  
> qemu-kvm-tools.x86_64 : KVM debugging and diagnostics tools
> qemu-kvm-tools-ev.x86_64 : KVM debugging and diagnostics tools
> qemu-kvm-tools-rhev.x86_64 : KVM debugging and diagnostics tools
>  
> Regards, Christian
>  
> 
> 
> 
> 
> Von: Markus Stockhausen [mailto:stockhau...@collogia.de]
> 
> 
> Gesendet: Dienstag, 15. September 2015 20:40
> 
> An: Daniel Helgenberger 
> 
> Cc: Christian Hailer ; yd...@redhat.com; users@ovirt.org
> 
> Betreff: Re: [ovirt-users] Some VMs in status "not responding" in oVirt 
> interface
> 
> 
>  
> Do you have a chance to install qemu-debug? If yes I would try a backtrace.
> gdb -p 
> 
> # bt
> Markus
> 
> 
> Am 15.09.2015 4:15 nachm. schrieb Daniel Helgenberger 
> :
> 
> 
> 
> 
> 
> Hello,
> 
> 
> 
> I do not want to hijack the thread but maybe my issue is related?
> 
> 
> 
> It might have started with ovirt 3.5.3; but I cannot tell for sure.
> 
> 
> 
> For me, one vm (foreman) is affected; the second time in 14 days. I can 
> confirm this as I also loose any network connection to the VM and
> 
> the ability to connect a console.
> 
> Also, the only thing witch 'fixes' the issue is right now 'kill -9  qemu-kvm process>'
> 
> 
> 
> As far as I can tell the VM became unresponsive at around Sep 15 12:30:01; 
> engine logged this at 12:34. Nothing obvious in VDSM logs (see
> 
> attached).
> 
> 
> 
> Below the engine.log part.
> 
> 
> 
> Versions:
> 
> ovirt-engine-3.5.4.2-1.el7.centos.noarch
> 
> 
> 
> vdsm-4.16.26-0.el7.centos
> 
> libvirt-1.2.8-16.el7_1.3
> 
> 
> 
> engine.log (1200 - 1300:
> 
> 2015-09-15 12:03:47,949 INFO  
> [org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
> (DefaultQuartzScheduler_Worker-56) [264d502a] HA
> 
> reservation status for cluster Default is OK
> 
> 2015-09-15 12:08:02,708 INFO  [org.ovirt.engine.core.bll.OvfDataUpdater] 
> (DefaultQuartzScheduler_Worker-89) [2e7bf56e] Attempting to update
> 
> VMs/Templates Ovf.
> 
> 2015-09-15 12:08:02,709 INFO  
> [org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
> (DefaultQuartzScheduler_Worker-89)
> 
> [5e9f4ba6] Running command: ProcessOvfUpdateForStoragePoolCommand internal: 
> true. Entities affected :  ID:
> 
> 0002-0002-0002-0002-0088 Type: l
> 
> 2015-09-15 12:08:02,780 INFO  
> [org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
> (DefaultQuartzScheduler_Worker-89)
> 
> [5e9f4ba6] Lock freed to object EngineLock [exclusiveLocks= key: 
> 0002-0002-0002-0002-0088 value: OVF_UPDATE
> 
> 2015-09-15 12:08:47,997 INFO  
> [org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
> (DefaultQuartzScheduler_Worker-21) [3fc854a2] HA
> 
> reservation status for cluster Default is OK
> 
> 2015-09-15 12:13:06,998 INFO  
> [org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
> (org.ovirt.thread.pool-8-thread-48)
> 
> [50221cdc] START, GetFileStatsVDSCommand( storagePoolId = 
> 0002-0002-0002-0002-0088, ignoreFailoverLimit = false), log id: 
> 1503968
> 
> 2015-09-15 12:13:07,137 INFO  
> [org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
> (org.ovirt.thread.pool-8-thread-48)
> 
> [50221cdc] FINISH, GetFileStatsVDSCommand, return: 
> {pfSense-2.0-RELEASE-i386.iso={status=0, ctime=1432286887.0, size=115709952},
> 
> Fedora-15-i686-Live8
> 
> 2015-09-15 12:13:07,178 INFO  
> [org.ovirt.engine.core.bll.IsoDomainListSyncronizer] 
> (org.ovirt.thread.pool-8-thread-48) [50221cdc] Finished
> 
> automatic refresh process for ISO file type with success, for storage domain 
> id 84dcb2fc-fb63-442f-aa77-3e84dc7d5a72.
> 
> 2015-09-15 12:13:48,043 INFO  
> [org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
> (DefaultQuartzScheduler_Worker-87) [4fa1bb16] HA
> 
> reservation status for cluster Default is OK
> 
> 2015-09-15

Re: [ovirt-users] Extension aaa: No search for principal

2015-09-15 Thread Daniel Helgenberger


On 15.09.2015 19:23, Alon Bar-Lev wrote:
> 
> 
> - Original Message -
>> From: "Daniel Helgenberger" 
>> To: "Alon Bar-Lev" 
>> Cc: Users@ovirt.org
>> Sent: Tuesday, September 15, 2015 2:41:02 PM
>> Subject: Re: [ovirt-users] Extension aaa: No search for principal
>>
>>
>>
>> On 11.09.2015 17:00, Alon Bar-Lev wrote:
>>>
>>>
>>> - Original Message -
 From: "Daniel Helgenberger" 
 To: "Alon Bar-Lev" 
 Cc: Users@ovirt.org
 Sent: Friday, September 11, 2015 5:33:21 PM
 Subject: Re: [ovirt-users] Extension aaa: No search for principal

 sorry, forgot one:

 On 11.09.2015 12:48, Alon Bar-Lev wrote:
> Hi!
>
> Thank you for the information, for some reason the administrator user
> cannot be resolved to userPrincipalName during login, is it specific for
> Administrator or any user?
 This is the default domain administrator account witch exits in any
 forest. But just in case I created a new domain user just for the
 purpose; same outcome
>>>
>> Sorry for the delay, Alon.
>>
>>> I am unsure what actually happens...
>> I might have an idea, at least from the commands you supplied.
>>
>>> Something in global catalog is out of sync.
>>> Usually - you do not add domain administrator to external application...
>>> there is no need to expose it.
>>> By default Administrator does not have "login from network" and "user
>>> principal suffix".
>>>
>>> Also in my environment I do not get result for administrator, but I do get
>>> one for regular user that has upn suffix in user record, you can see these
>>> fields in user and domain manager.
>>>
>>> So please use regular unprivileged users which belongs to "Domain Users"
>>> from now on.
>>>
>>> To test if user has userPrincipalName use the following command (assuming
>>> we search for u...@int.corp.de):
>>>
>>> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
>>> ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w
>>> PASSWORD -b '' '(userPrincipalName=u...@int.corp.de)' cn userPrincipalName
>> It seams with Active Directory (at least) the search base cannot be
>> empty (-b '') but needs to be provided.
>>
>> In my case, the above command fails with:
>>> # search result
>>> search: 2
>>> result: 32 No such object
>>> text: 208D: NameErr: DSID-03100213, problem 2001 (NO_OBJECT), data 0,
>>> best match of:
>>
>> While adding the most basic search path it succeeds:
>>
>> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
>> ldap://int.corp.de:389/ -x -D 'b...@int.corp.de' -w PASSWORD -b
>> 'dc=int,dc=corp,dc=de' '(userPrincipalName=administra...@int.corp.de)'
>> cn userPrincipalName
>>> # search reference
>>> ref:
>>> ldap://ForestDnsZones.int.corp.de/DC=ForestDnsZones,DC=int,DC=corp,DC=de
>>>
>>> # search reference
>>> ref:
>>> ldap://DomainDnsZones.int.corp.de/DC=DomainDnsZones,DC=int,DC=corp,DC=de
>>>
>>> # search reference
>>> ref: ldap://int.corp.de/CN=Configuration,DC=int,DC=corp,DC=de
>>>
>>> # search result
>>> search: 2
>>> result: 0 Success
>>> control: 1.2.840.113556.1.4.319 false DDDSSSDDMM=
>>> pagedresults: cookie=
>>>
>>> # numResponses: 4
>>> # numReferences: 3
> 
> But I asked to query a specific port... the global catalog, port 3268, see my 
> command above.
> 
>>
>> It succeeds with every user I tried.
> 
> what we see is not a success... :(
> I also asked not to use administrator as a reference user, please create a 
> standard non privileged user for these tests, so skip oddness of builtin 
> administrator for now.

Ok, sorry; thought this was for me change as part of the ldap URL.

> 
> 
>> I would set the search base; but i am not sure where to do so.
>>
>>>
>>> This should find the user (return one result), if not, please checkout user
>>> in Users and Domains manager for the domain suffix, maybe it is empty.
>>>
>>> To find user without userPrincipalName such as Administrator use the
>>> following command:
>>>
>>> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
>>> ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w
>>> PASSWORD -b '' '(sAMAccountName=user)' cn userPrincipalName
>>>
>>> For example, the above will work for Administrator, but for kerberos to
>>> work properly user principal name must be defined, so these users will not
>>> work.
>>>
>>> You can dump entire GC and send me a user record if no result so I can
>>> determine what is different from expectations:
>>>
>>> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
>>> ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w
>>> PASSWORD -b '' > /tmp/dump.out
>>
>> If you still require a dump (its even a small one..) please drop a mail.
> 
> I will be happy to receive a complete dump of your gc, please send me 
> privately, so we can progress.
> Please use this exact command just replace qa1.qa.lab.tlv.redhat.com with 
> your dc, b...@int.corp.de with your bind user and PASSWORD with bind user 
> password.

I did; this now works as expected using GC

[ovirt-users] oVirt + gluster + selfhosted + bonding

2015-09-15 Thread Joachim Tingvold

Hi,

First-time user of oVirt, so bear with me.

Trying to get redundant oVirt + gluster set up. Have four hosts;

  gluster1 (CentOS7)
  gluster2 (CentOS7)
  ovirt1 (CentOS7)
  ovirt2 (CentOS7)

Using replica 3 volume with arbiter node (new in 3.7.0). Got that part 
up and running (using ovirt1 as the arbiter node), and it works fine.


Initial goal (before reading up on both gluster and oVirt) was to have 
everything v6-only, but found out quickly enough that we had to scratch 
that plan for now (I see that there are some activity on both gluster 
and oVirt on this, which is nice).


Anyways. We wanted to use the "self hosted engine gluster"-feature 
(which, by the looks of it, is only present in 3.6). We installed 3.6b4 
(3.6.0.1-0.1.20150821.gitc8ddcd8.el7.centos).


I already had the network set up (couldn't find any specifics on this in 
the somewhat lacking oVirt-documentation?), something along these lines;


 * eth0 + eth1, bonded in bond0 (LACP)
 * vlan110 on top of bond0: v6-only for mgmt of host
 * vlan111 on top of bond0: v4 for gluster + ovirt

We then ran the 'hosted-engine --deploy' command, filling out the 
information as best as we could (some of these options seemed to lack 
documentation, or at least we had trouble finding it). The end-result 
was like this[1].


Accepting this, we suddenly found ourselves without connectivity to the 
host. Logged in via KVM, and this[2] was the last part of the log.


All of the interfaces we had before (bond0, vlan110, vlan111) was "wiped 
clean" for it's configuration, and VDSM seems to have taken control on 
that part (however, since the script failed, we seem to have ended up in 
some kind of "limbo mode"). Rebooting didn't help bring things up again, 
and we're currently looking into manually configuring things via VDSM.


Thought I'd post here meanwhile, seeing if we've missed something 
obvious, or if oVirt should've handled this any different?


If relevant, the content of the answer-file referenced in [2] can be 
found here[3].


[1] 
[2] 
[3] 

--
Joachim
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Upgrade problem

2015-09-15 Thread Koen Vanoppen
Super!! Thanks man!! Worked like a charm!

Kind regards,

Koen

On 15 September 2015 at 10:06, Simone Tiraboschi 
wrote:

>
>
> On Mon, Sep 14, 2015 at 8:27 PM, Koen Vanoppen 
> wrote:
>
>> Yes,
>> That will indeed be the issue I think. But... Can you explain to me what
>> steps I need to take? I rather not change my certs...
>>
>
> The issue is that the setup is going to extract the cert to verify its
> compatibility with latest browser but to do that it assumes that it was
> created with our internal CA with the default password.
> So it fails if the user substituted the apache cert with a certificate
> signed by a different CA and protected with a different password.
>
> We'll fix it for the next release but I can suggest you a workaround:
> - Backup the current /etc/pki/ovirt-engine/keys/apache.p12
> - Recreate the apache.p12 with the password 'mypass':
>   openssl pkcs12 -export -out /etc/pki/ovirt-engine/keys/apache.p12 -inkey
> /etc/pki/ovirt-engine/keys/apache.key.nopass -in
> /etc/pki/ovirt-engine/certs/apache.cer
> When prompted for password use 'mypass' without the quotation marks
> - execute engine-setup to upgrade
> - if you prefer, restore your original
> /etc/pki/ovirt-engine/keys/apache.p12 and restart the Apache server.
>
>
>> Kind regards and thanks for your help,
>>
>> Koen
>>
>> On 11 September 2015 at 19:25, Yaniv Kaul  wrote:
>>
>>> On 11/09/15 19:18, Koen Vanoppen wrote:
>>>
>>> Hi everybody,
>>>
>>> I'm trying to upgrade ovirt to the ovirt 3.5.4 from my current version
>>> 3.5.3.
>>> But when I try to run engine-setup I got this:
>>>
>>> [ INFO  ] Checking for product updates...
>>>   Setup has found updates for some packages, do you wish to
>>> update them now? (Yes, No) [Yes]:
>>> [ INFO  ] Checking for an update for Setup...
>>>
>>>   --== ALL IN ONE CONFIGURATION ==--
>>>
>>>
>>>   --== NETWORK CONFIGURATION ==--
>>>
>>>   Setup can automatically configure the firewall on this system.
>>>   Note: automatic configuration of the firewall may overwrite
>>> current settings.
>>>   Do you want Setup to configure the firewall? (Yes, No) [Yes]:
>>> [ INFO  ] firewalld will be configured as firewall manager.
>>>
>>>   --== DATABASE CONFIGURATION ==--
>>>
>>>   The detected DWH database size is 198 MB.
>>>   Setup can backup the existing database. The time and space
>>> required for the database backup depend on its size. This process takes
>>> time, and in some cases (for instance, when the size is few GBs) may take
>>> several hours to complete.
>>>   If you choose to not back up the database, and Setup later
>>> fails for some reason, it will not be able to restore the database and all
>>> DWH data will be lost.
>>>   Would you like to backup the existing database before
>>> upgrading it? (Yes, No) [Yes]:
>>>
>>>   --== OVIRT ENGINE CONFIGURATION ==--
>>>
>>>   Skipping storing options as database already prepared
>>>
>>>   --== PKI CONFIGURATION ==--
>>>
>>> [ ERROR ] Failed to execute stage 'Environment customization': Command
>>> '/bin/openssl' failed to execute
>>> [ INFO  ] Stage: Clean up
>>>   Log file is located at
>>> /var/log/ovirt-engine/setup/ovirt-engine-setup-20150911181604-1gft67.log
>>> [ INFO  ] Generating answer file
>>> '/var/lib/ovirt-engine/setup/answers/20150911181626-setup.conf'
>>> [ INFO  ] Stage: Pre-termination
>>> [ INFO  ] Stage: Termination
>>> [ ERROR ] Execution of setup failed
>>>
>>>
>>> Any idea?
>>>
>>>
>>> Looks like https://bugzilla.redhat.com/show_bug.cgi?id=1260752 .
>>> Y.
>>>
>>>
>>> Kind regards,
>>>
>>> Koen
>>>
>>>
>>> ___
>>> Users mailing 
>>> listUsers@ovirt.orghttp://lists.ovirt.org/mailman/listinfo/users
>>>
>>>
>>>
>>
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>>
>>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Some VMs in status "not responding" in oVirt interface

2015-09-15 Thread Markus Stockhausen
Hi Christian,

I think of a package similar like this:

qemu-debuginfo.x86_64   2:2.1.3-10.fc21

That allows gdb to show information about backtrace symbols. See
comment 12 of https://bugzilla.redhat.com/show_bug.cgi?id=1262251
Makes error search much simpler - especially if qemu hangs.

Markus

**

Von: Christian Hailer [christ...@hailer.eu]

Gesendet: Dienstag, 15. September 2015 21:24

An: Markus Stockhausen; 'Daniel Helgenberger'

Cc: yd...@redhat.com; users@ovirt.org

Betreff: AW: [ovirt-users] Some VMs in status "not responding" in oVirt 
interface










Hi Markus,
 
gdb is available on CentOS 7, but what do you mean by qemu-debug? I Installed 
qemu-kvm-tools, maybe this is the pendant for CentOS?
 
qemu-kvm-tools.x86_64 : KVM debugging and diagnostics tools
qemu-kvm-tools-ev.x86_64 : KVM debugging and diagnostics tools
qemu-kvm-tools-rhev.x86_64 : KVM debugging and diagnostics tools
 
Regards, Christian
 




Von: Markus Stockhausen [mailto:stockhau...@collogia.de]


Gesendet: Dienstag, 15. September 2015 20:40

An: Daniel Helgenberger 

Cc: Christian Hailer ; yd...@redhat.com; users@ovirt.org

Betreff: Re: [ovirt-users] Some VMs in status "not responding" in oVirt 
interface


 
Do you have a chance to install qemu-debug? If yes I would try a backtrace.
gdb -p 

# bt
Markus


Am 15.09.2015 4:15 nachm. schrieb Daniel Helgenberger 
:





Hello,



I do not want to hijack the thread but maybe my issue is related?



It might have started with ovirt 3.5.3; but I cannot tell for sure.



For me, one vm (foreman) is affected; the second time in 14 days. I can confirm 
this as I also loose any network connection to the VM and

the ability to connect a console.

Also, the only thing witch 'fixes' the issue is right now 'kill -9 '



As far as I can tell the VM became unresponsive at around Sep 15 12:30:01; 
engine logged this at 12:34. Nothing obvious in VDSM logs (see

attached).



Below the engine.log part.



Versions:

ovirt-engine-3.5.4.2-1.el7.centos.noarch



vdsm-4.16.26-0.el7.centos

libvirt-1.2.8-16.el7_1.3



engine.log (1200 - 1300:

2015-09-15 12:03:47,949 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-56) [264d502a] HA

reservation status for cluster Default is OK

2015-09-15 12:08:02,708 INFO  [org.ovirt.engine.core.bll.OvfDataUpdater] 
(DefaultQuartzScheduler_Worker-89) [2e7bf56e] Attempting to update

VMs/Templates Ovf.

2015-09-15 12:08:02,709 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)

[5e9f4ba6] Running command: ProcessOvfUpdateForStoragePoolCommand internal: 
true. Entities affected :  ID:

0002-0002-0002-0002-0088 Type: l

2015-09-15 12:08:02,780 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)

[5e9f4ba6] Lock freed to object EngineLock [exclusiveLocks= key: 
0002-0002-0002-0002-0088 value: OVF_UPDATE

2015-09-15 12:08:47,997 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-21) [3fc854a2] HA

reservation status for cluster Default is OK

2015-09-15 12:13:06,998 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)

[50221cdc] START, GetFileStatsVDSCommand( storagePoolId = 
0002-0002-0002-0002-0088, ignoreFailoverLimit = false), log id: 
1503968

2015-09-15 12:13:07,137 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)

[50221cdc] FINISH, GetFileStatsVDSCommand, return: 
{pfSense-2.0-RELEASE-i386.iso={status=0, ctime=1432286887.0, size=115709952},

Fedora-15-i686-Live8

2015-09-15 12:13:07,178 INFO  
[org.ovirt.engine.core.bll.IsoDomainListSyncronizer] 
(org.ovirt.thread.pool-8-thread-48) [50221cdc] Finished

automatic refresh process for ISO file type with success, for storage domain id 
84dcb2fc-fb63-442f-aa77-3e84dc7d5a72.

2015-09-15 12:13:48,043 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-87) [4fa1bb16] HA

reservation status for cluster Default is OK

2015-09-15 12:18:48,088 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-44) [6345e698] HA

reservation status for cluster Default is OK

2015-09-15 12:23:48,137 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-13) HA reservation

status for cluster Default is OK

2015-09-15 12:28:48,183 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-76) [154c91d5] HA

reservation status for cluster Default is OK

2015-09-15 12:33:48,229 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-36) [27c73ac6] HA

reservation status for cluster Default is OK

2015-09-15 12:34:49,432 

Re: [ovirt-users] Some VMs in status "not responding" in oVirt interface

2015-09-15 Thread Christian Hailer
Hi Markus,

 

gdb is available on CentOS 7, but what do you mean by qemu-debug? I Installed 
qemu-kvm-tools, maybe this is the pendant for CentOS?

 

qemu-kvm-tools.x86_64 : KVM debugging and diagnostics tools

qemu-kvm-tools-ev.x86_64 : KVM debugging and diagnostics tools

qemu-kvm-tools-rhev.x86_64 : KVM debugging and diagnostics tools

 

Regards, Christian

 

Von: Markus Stockhausen [mailto:stockhau...@collogia.de] 
Gesendet: Dienstag, 15. September 2015 20:40
An: Daniel Helgenberger 
Cc: Christian Hailer ; yd...@redhat.com; users@ovirt.org
Betreff: Re: [ovirt-users] Some VMs in status "not responding" in oVirt 
interface

 

Do you have a chance to install qemu-debug? If yes I would try a backtrace.

gdb -p 
# bt

Markus

Am 15.09.2015 4:15 nachm. schrieb Daniel Helgenberger 
mailto:daniel.helgenber...@m-box.de> >:

Hello,

I do not want to hijack the thread but maybe my issue is related?

It might have started with ovirt 3.5.3; but I cannot tell for sure.

For me, one vm (foreman) is affected; the second time in 14 days. I can confirm 
this as I also loose any network connection to the VM and
the ability to connect a console.
Also, the only thing witch 'fixes' the issue is right now 'kill -9 '

As far as I can tell the VM became unresponsive at around Sep 15 12:30:01; 
engine logged this at 12:34. Nothing obvious in VDSM logs (see
attached).

Below the engine.log part.

Versions:
ovirt-engine-3.5.4.2-1.el7.centos.noarch

vdsm-4.16.26-0.el7.centos
libvirt-1.2.8-16.el7_1.3

engine.log (1200 - 1300:
2015-09-15 12:03:47,949 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-56) [264d502a] HA
reservation status for cluster Default is OK
2015-09-15 12:08:02,708 INFO  [org.ovirt.engine.core.bll.OvfDataUpdater] 
(DefaultQuartzScheduler_Worker-89) [2e7bf56e] Attempting to update
VMs/Templates Ovf.
2015-09-15 12:08:02,709 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)
[5e9f4ba6] Running command: ProcessOvfUpdateForStoragePoolCommand internal: 
true. Entities affected :  ID:
0002-0002-0002-0002-0088 Type: l
2015-09-15 12:08:02,780 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)
[5e9f4ba6] Lock freed to object EngineLock [exclusiveLocks= key: 
0002-0002-0002-0002-0088 value: OVF_UPDATE
2015-09-15 12:08:47,997 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-21) [3fc854a2] HA
reservation status for cluster Default is OK
2015-09-15 12:13:06,998 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)
[50221cdc] START, GetFileStatsVDSCommand( storagePoolId = 
0002-0002-0002-0002-0088, ignoreFailoverLimit = false), log id: 
1503968
2015-09-15 12:13:07,137 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)
[50221cdc] FINISH, GetFileStatsVDSCommand, return: 
{pfSense-2.0-RELEASE-i386.iso={status=0, ctime=1432286887.0, size=115709952},
Fedora-15-i686-Live8
2015-09-15 12:13:07,178 INFO  
[org.ovirt.engine.core.bll.IsoDomainListSyncronizer] 
(org.ovirt.thread.pool-8-thread-48) [50221cdc] Finished
automatic refresh process for ISO file type with success, for storage domain id 
84dcb2fc-fb63-442f-aa77-3e84dc7d5a72.
2015-09-15 12:13:48,043 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-87) [4fa1bb16] HA
reservation status for cluster Default is OK
2015-09-15 12:18:48,088 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-44) [6345e698] HA
reservation status for cluster Default is OK
2015-09-15 12:23:48,137 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-13) HA reservation
status for cluster Default is OK
2015-09-15 12:28:48,183 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-76) [154c91d5] HA
reservation status for cluster Default is OK
2015-09-15 12:33:48,229 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-36) [27c73ac6] HA
reservation status for cluster Default is OK
2015-09-15 12:34:49,432 INFO  
[org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] 
(DefaultQuartzScheduler_Worker-41) [5f2a4b68] VM
foreman 8b57ff1d-2800-48ad-b267-fd8e9e2f6fb2 moved from Up --> NotResponding
2015-09-15 12:34:49,578 WARN  
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(DefaultQuartzScheduler_Worker-41)
[5f2a4b68] Correlation ID: null, Call Stack: null, Custom Event ID: -1, 
Message: VM foreman is not responding.
2015-09-15 12:38:48,273 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-10) [7a800766] HA
reservation status for cluster Default i

Re: [ovirt-users] Some VMs in status "not responding" in oVirt interface

2015-09-15 Thread Markus Stockhausen
Do you have a chance to install qemu-debug? If yes I would try a backtrace.

gdb -p 
# bt

Markus

Am 15.09.2015 4:15 nachm. schrieb Daniel Helgenberger 
:
Hello,

I do not want to hijack the thread but maybe my issue is related?

It might have started with ovirt 3.5.3; but I cannot tell for sure.

For me, one vm (foreman) is affected; the second time in 14 days. I can confirm 
this as I also loose any network connection to the VM and
the ability to connect a console.
Also, the only thing witch 'fixes' the issue is right now 'kill -9 '

As far as I can tell the VM became unresponsive at around Sep 15 12:30:01; 
engine logged this at 12:34. Nothing obvious in VDSM logs (see
attached).

Below the engine.log part.

Versions:
ovirt-engine-3.5.4.2-1.el7.centos.noarch

vdsm-4.16.26-0.el7.centos
libvirt-1.2.8-16.el7_1.3

engine.log (1200 - 1300:
2015-09-15 12:03:47,949 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-56) [264d502a] HA
reservation status for cluster Default is OK
2015-09-15 12:08:02,708 INFO  [org.ovirt.engine.core.bll.OvfDataUpdater] 
(DefaultQuartzScheduler_Worker-89) [2e7bf56e] Attempting to update
VMs/Templates Ovf.
2015-09-15 12:08:02,709 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)
[5e9f4ba6] Running command: ProcessOvfUpdateForStoragePoolCommand internal: 
true. Entities affected :  ID:
0002-0002-0002-0002-0088 Type: l
2015-09-15 12:08:02,780 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)
[5e9f4ba6] Lock freed to object EngineLock [exclusiveLocks= key: 
0002-0002-0002-0002-0088 value: OVF_UPDATE
2015-09-15 12:08:47,997 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-21) [3fc854a2] HA
reservation status for cluster Default is OK
2015-09-15 12:13:06,998 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)
[50221cdc] START, GetFileStatsVDSCommand( storagePoolId = 
0002-0002-0002-0002-0088, ignoreFailoverLimit = false), log id: 
1503968
2015-09-15 12:13:07,137 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)
[50221cdc] FINISH, GetFileStatsVDSCommand, return: 
{pfSense-2.0-RELEASE-i386.iso={status=0, ctime=1432286887.0, size=115709952},
Fedora-15-i686-Live8
2015-09-15 12:13:07,178 INFO  
[org.ovirt.engine.core.bll.IsoDomainListSyncronizer] 
(org.ovirt.thread.pool-8-thread-48) [50221cdc] Finished
automatic refresh process for ISO file type with success, for storage domain id 
84dcb2fc-fb63-442f-aa77-3e84dc7d5a72.
2015-09-15 12:13:48,043 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-87) [4fa1bb16] HA
reservation status for cluster Default is OK
2015-09-15 12:18:48,088 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-44) [6345e698] HA
reservation status for cluster Default is OK
2015-09-15 12:23:48,137 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-13) HA reservation
status for cluster Default is OK
2015-09-15 12:28:48,183 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-76) [154c91d5] HA
reservation status for cluster Default is OK
2015-09-15 12:33:48,229 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-36) [27c73ac6] HA
reservation status for cluster Default is OK
2015-09-15 12:34:49,432 INFO  
[org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] 
(DefaultQuartzScheduler_Worker-41) [5f2a4b68] VM
foreman 8b57ff1d-2800-48ad-b267-fd8e9e2f6fb2 moved from Up --> NotResponding
2015-09-15 12:34:49,578 WARN  
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(DefaultQuartzScheduler_Worker-41)
[5f2a4b68] Correlation ID: null, Call Stack: null, Custom Event ID: -1, 
Message: VM foreman is not responding.
2015-09-15 12:38:48,273 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-10) [7a800766] HA
reservation status for cluster Default is OK
2015-09-15 12:43:48,320 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-42) [440f1c40] HA
reservation status for cluster Default is OK
2015-09-15 12:48:48,366 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-70) HA reservation
status for cluster Default is OK
2015-09-15 12:53:48,412 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-12) [50221cdc] HA
reservation status for cluster Default is OK
2015-09-15 12:58:48,459 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-3

Re: [ovirt-users] Extension aaa: No search for principal

2015-09-15 Thread Alon Bar-Lev


- Original Message -
> From: "Daniel Helgenberger" 
> To: "Alon Bar-Lev" 
> Cc: Users@ovirt.org
> Sent: Tuesday, September 15, 2015 2:41:02 PM
> Subject: Re: [ovirt-users] Extension aaa: No search for principal
> 
> 
> 
> On 11.09.2015 17:00, Alon Bar-Lev wrote:
> >
> >
> > - Original Message -
> >> From: "Daniel Helgenberger" 
> >> To: "Alon Bar-Lev" 
> >> Cc: Users@ovirt.org
> >> Sent: Friday, September 11, 2015 5:33:21 PM
> >> Subject: Re: [ovirt-users] Extension aaa: No search for principal
> >>
> >> sorry, forgot one:
> >>
> >> On 11.09.2015 12:48, Alon Bar-Lev wrote:
> >>> Hi!
> >>>
> >>> Thank you for the information, for some reason the administrator user
> >>> cannot be resolved to userPrincipalName during login, is it specific for
> >>> Administrator or any user?
> >> This is the default domain administrator account witch exits in any
> >> forest. But just in case I created a new domain user just for the
> >> purpose; same outcome
> >
> Sorry for the delay, Alon.
> 
> > I am unsure what actually happens...
> I might have an idea, at least from the commands you supplied.
> 
> > Something in global catalog is out of sync.
> > Usually - you do not add domain administrator to external application...
> > there is no need to expose it.
> > By default Administrator does not have "login from network" and "user
> > principal suffix".
> >
> > Also in my environment I do not get result for administrator, but I do get
> > one for regular user that has upn suffix in user record, you can see these
> > fields in user and domain manager.
> >
> > So please use regular unprivileged users which belongs to "Domain Users"
> > from now on.
> >
> > To test if user has userPrincipalName use the following command (assuming
> > we search for u...@int.corp.de):
> >
> > $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
> > ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w
> > PASSWORD -b '' '(userPrincipalName=u...@int.corp.de)' cn userPrincipalName
> It seams with Active Directory (at least) the search base cannot be
> empty (-b '') but needs to be provided.
> 
> In my case, the above command fails with:
> > # search result
> > search: 2
> > result: 32 No such object
> > text: 208D: NameErr: DSID-03100213, problem 2001 (NO_OBJECT), data 0,
> > best match of:
> 
> While adding the most basic search path it succeeds:
> 
> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
> ldap://int.corp.de:389/ -x -D 'b...@int.corp.de' -w PASSWORD -b
> 'dc=int,dc=corp,dc=de' '(userPrincipalName=administra...@int.corp.de)'
> cn userPrincipalName
> > # search reference
> > ref:
> > ldap://ForestDnsZones.int.corp.de/DC=ForestDnsZones,DC=int,DC=corp,DC=de
> >
> > # search reference
> > ref:
> > ldap://DomainDnsZones.int.corp.de/DC=DomainDnsZones,DC=int,DC=corp,DC=de
> >
> > # search reference
> > ref: ldap://int.corp.de/CN=Configuration,DC=int,DC=corp,DC=de
> >
> > # search result
> > search: 2
> > result: 0 Success
> > control: 1.2.840.113556.1.4.319 false DDDSSSDDMM=
> > pagedresults: cookie=
> >
> > # numResponses: 4
> > # numReferences: 3

But I asked to query a specific port... the global catalog, port 3268, see my 
command above.

> 
> It succeeds with every user I tried.

what we see is not a success... :(
I also asked not to use administrator as a reference user, please create a 
standard non privileged user for these tests, so skip oddness of builtin 
administrator for now.


> I would set the search base; but i am not sure where to do so.
> 
> >
> > This should find the user (return one result), if not, please checkout user
> > in Users and Domains manager for the domain suffix, maybe it is empty.
> >
> > To find user without userPrincipalName such as Administrator use the
> > following command:
> >
> > $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
> > ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w
> > PASSWORD -b '' '(sAMAccountName=user)' cn userPrincipalName
> >
> > For example, the above will work for Administrator, but for kerberos to
> > work properly user principal name must be defined, so these users will not
> > work.
> >
> > You can dump entire GC and send me a user record if no result so I can
> > determine what is different from expectations:
> >
> > $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H
> > ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w
> > PASSWORD -b '' > /tmp/dump.out
> 
> If you still require a dump (its even a small one..) please drop a mail.

I will be happy to receive a complete dump of your gc, please send me 
privately, so we can progress.
Please use this exact command just replace qa1.qa.lab.tlv.redhat.com with your 
dc, b...@int.corp.de with your bind user and PASSWORD with bind user password.

Thanks!
 
> >
> > Regards,
> > Alon
> >
> 
> --
> Daniel Helgenberger
> m box bewegtbild GmbH
> 
> P: +49/30/2408781-22
> F: +49/30/2408781-10
> 
> ACKERSTR. 19
> D-10115 BERLIN
> 
> 
> www.m-box.de  www.monkeymen.

Re: [ovirt-users] Some VMs in status "not responding" in oVirt interface

2015-09-15 Thread Christian Hailer
Hello Daniel,

this is exactly what I experienced in the past weeks. 
I switched the NIC and the HDD from e1000 and IDE to VirtIO NIC and VirtIO disk 
for all Windows Server 2012 R2 VMs, they are running for 2 days now without 
problems.
Additionally 2 of my CentOS VMs stopped responding today, and this was a bit 
scary: the VM itself was running and I could connect to the console (I 
intentionally logged in as root yesterday and didn't log out, so I could have a 
look today what was the problem) . Network was down (pinging anything was 
unsuccessful), and every action concerning reading from or writing to the 
harddisk immediately hung. So I reset the VM (oVirt wasn't able to shut it 
down, so I had to kill the process) and had a look at /var/log/messages after 
booting up.
The last entry was this night at 03:01, cron.daily. Nothing else until I 
rebooted this morning at 08:30.

The VM is and always was configured with both VirtIO NIC and HDD.

So what could be the reason that the VM couldn't access the harddisk anymore?

Best regards, Christian 

-Ursprüngliche Nachricht-
Von: Daniel Helgenberger [mailto:daniel.helgenber...@m-box.de] 
Gesendet: Dienstag, 15. September 2015 16:15
An: Christian Hailer ; users@ovirt.org; yd...@redhat.com
Betreff: Re: [ovirt-users] Some VMs in status "not responding" in oVirt 
interface

Hello,

I do not want to hijack the thread but maybe my issue is related?

It might have started with ovirt 3.5.3; but I cannot tell for sure.

For me, one vm (foreman) is affected; the second time in 14 days. I can confirm 
this as I also loose any network connection to the VM and the ability to 
connect a console.
Also, the only thing witch 'fixes' the issue is right now 'kill -9 '

As far as I can tell the VM became unresponsive at around Sep 15 12:30:01; 
engine logged this at 12:34. Nothing obvious in VDSM logs (see attached).

Below the engine.log part.

Versions:
ovirt-engine-3.5.4.2-1.el7.centos.noarch

vdsm-4.16.26-0.el7.centos
libvirt-1.2.8-16.el7_1.3

engine.log (1200 - 1300:
2015-09-15 12:03:47,949 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-56) [264d502a] HA reservation status for cluster 
Default is OK
2015-09-15 12:08:02,708 INFO  [org.ovirt.engine.core.bll.OvfDataUpdater] 
(DefaultQuartzScheduler_Worker-89) [2e7bf56e] Attempting to update 
VMs/Templates Ovf.
2015-09-15 12:08:02,709 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)
[5e9f4ba6] Running command: ProcessOvfUpdateForStoragePoolCommand internal: 
true. Entities affected :  ID:
0002-0002-0002-0002-0088 Type: l
2015-09-15 12:08:02,780 INFO  
[org.ovirt.engine.core.bll.ProcessOvfUpdateForStoragePoolCommand] 
(DefaultQuartzScheduler_Worker-89)
[5e9f4ba6] Lock freed to object EngineLock [exclusiveLocks= key: 
0002-0002-0002-0002-0088 value: OVF_UPDATE
2015-09-15 12:08:47,997 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-21) [3fc854a2] HA reservation status for cluster 
Default is OK
2015-09-15 12:13:06,998 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)
[50221cdc] START, GetFileStatsVDSCommand( storagePoolId = 
0002-0002-0002-0002-0088, ignoreFailoverLimit = false), log id: 
1503968
2015-09-15 12:13:07,137 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetFileStatsVDSCommand] 
(org.ovirt.thread.pool-8-thread-48)
[50221cdc] FINISH, GetFileStatsVDSCommand, return: 
{pfSense-2.0-RELEASE-i386.iso={status=0, ctime=1432286887.0, size=115709952},
Fedora-15-i686-Live8
2015-09-15 12:13:07,178 INFO  
[org.ovirt.engine.core.bll.IsoDomainListSyncronizer] 
(org.ovirt.thread.pool-8-thread-48) [50221cdc] Finished automatic refresh 
process for ISO file type with success, for storage domain id 
84dcb2fc-fb63-442f-aa77-3e84dc7d5a72.
2015-09-15 12:13:48,043 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-87) [4fa1bb16] HA reservation status for cluster 
Default is OK
2015-09-15 12:18:48,088 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-44) [6345e698] HA reservation status for cluster 
Default is OK
2015-09-15 12:23:48,137 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-13) HA reservation status for cluster Default is 
OK
2015-09-15 12:28:48,183 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-76) [154c91d5] HA reservation status for cluster 
Default is OK
2015-09-15 12:33:48,229 INFO  
[org.ovirt.engine.core.bll.scheduling.HaReservationHandling] 
(DefaultQuartzScheduler_Worker-36) [27c73ac6] HA reservation status for cluster 
Default is OK
2015-09-15 12:34:49,432 INFO  
[org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] 
(DefaultQuartzScheduler_Worker-41) 

Re: [ovirt-users] Is a dedicated oVirt mgmt VLAN still needed for oVirt host nodes?

2015-09-15 Thread Patrick Russell
It will all work using different VLAN tags on the same physical nics. At least 
in 3.5.x that’s the case, we don’t have a 3.4.x install so I can’t speak to 
that. You’ll want to watch your NFS and migration traffic though. Make sure you 
don’t overrun the bandwidth for management traffic or you’re going to have a 
bad day.

-Patrick
 
> On Sep 14, 2015, at 10:08 AM, c...@endlessnow.com wrote:
> 
> We have an oVirt environment that I inherited.  One is running 3.4.0 and
> one is running 3.5.0.
> 
> Seem in both cases the prior administrator stated that a dedicated VLAN
> was necessary for oVirt mgmt.  That is, we could not run multiple tagged
> VLANs on a nic for a given oVirt host node.
> 
> Does any of this make sense?  Is this true?  Is it still true for more
> contemporary versions of oVirt?
> 
> My problem is that our nodes are blades and I only have two physical nics
> per blade.  In our network for redundancy we need to have the two nics
> have the same VLANs so that things failover ok.  Which means we have to
> share the oVirt mgmt network on the same wire.  That's the ideal.
> 
> Currently we have a whole nic on the blade just for oVirt management.  Is
> this a requirement?
> 
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] moving storage and importing vms issue

2015-09-15 Thread Jiří Sléžka

Hello,

any hope and/or hint for me?

Before I moved storage I (partly live) migrated disks to this storage 
(we have about 5 LUNS). Probably there could be some issues. Just a 
guess - could it mean that some disks would stay on original storage as 
orphaned images?


It would be useful to have some low-level util to display all images 
(also orphaned) and their properties and correlations with vms.


Kind regards,

Jiri



Dne 10.9.2015 v 16:31 Eli Mesika napsal(a):

Adding Allon M

- Original Message -

From: "Jiří Sléžka" 
To: "Eli Mesika" 
Cc: users@ovirt.org, "Omer Frenkel" 
Sent: Thursday, September 10, 2015 4:07:48 PM
Subject: Re: [ovirt-users] moving storage and importing vms issue

Hello,


- Original Message -

From: "Jiří Sléžka" 
To: emes...@redhat.com
Cc: users@ovirt.org
Sent: Thursday, September 10, 2015 1:50:14 PM
Subject: Re: [ovirt-users] moving storage and importing vms issue

Hello,


- Original Message -

From: "Jiří Sléžka" 
To: users@ovirt.org
Sent: Thursday, September 10, 2015 1:30:29 AM
Subject: [ovirt-users] moving storage and importing vms issue

Hello,

I am working on some consolidation of our RHEV/oVirt servers and I moved
one storage to new oVirt datacenter (put it into maintenance, detached
it from old and imported into new datacenter) which worked pretty good.

Then I tried to import all the vms which worked also great except for
three of them.

These vms are stucked in VM Import sub-tab and are quietly failing
import attempts (I can only see failed task "Importing VM clavius-winxp
from configuration to Cluster CIT-oVirt" but no related event and/or
explanation)

There is only one host in this datacenter/cluster which is SPM. I can't
find anything interesting in vdsm.log (short span of import time is in
attachment).


Can you please attach also engine.log ?


sure

well, here I can see an error... it looks like some db and/or snapshot
issue.


Yes, seems as ImportVmFromConfigurationCommand tries to add snapshots with
the empty GUID (000..0)
This cause violation of the primary key of the snapshots table
CCing Omer F on that



well and it looks like I lost also one secondary disk from one correctly
imported vm.

is there a way to show all images on some storage domain?

I found that my storage is this

[root@ovirt04 ~]# vdsClient -s 0 getStorageDomainInfo
088e7ed9-84c7-4fbd-a570-f37fa986a772
uuid = 088e7ed9-84c7-4fbd-a570-f37fa986a772
vguuid = MkMpr6-o9c1-LBUq-rZ0E-ZRSg-X31T-2aU1PV
state = OK
version = 3
role = Master
type = FCP
class = Data
pool = ['0002-0002-0002-0002-02b9']
name = oVirt-SlowStorage

but I have no luck with finding how to display all images on it.


try

# vdsClient -s getImagesList "088e7ed9-84c7-4fbd-a570-f37fa986a772"


yes, it works :-)

now I have list of imgUUIDs on this storage. When I compare it against
Disks tab in oVirt manager a see 5 images that are not visible in manager.

346ad5af-9db8-46eb-9a45-172ce3213496
45493042-67f5-4dcd-8dae-5b2c213aa95a
fb8f3165-5976-4094-9d37-ea0b09124547
e15288bc-30ec-4a77-837b-bdc7de37a08b
be5c56de-6a22-4d1a-8579-f0f5d501d90c

now I tried to find anything about these images

[root@ovirt04 ~]# vdsClient -s 0 getVolumesList
"088e7ed9-84c7-4fbd-a570-f37fa986a772"
"0002-0002-0002-0002-02b9"
"346ad5af-9db8-46eb-9a45-172ce3213496"
eeca0e49-ba6d-4b4b-9eb4-731b90b48091 : Exported by virt-v2v.
da00feb8-991d-4b91-b424-6931daf00c83 : Parent is
eeca0e49-ba6d-4b4b-9eb4-731b90b48091



[root@ovirt04 ~]# vdsClient -s 0 getVolumesList
"088e7ed9-84c7-4fbd-a570-f37fa986a772"
"0002-0002-0002-0002-02b9"
"45493042-67f5-4dcd-8dae-5b2c213aa95a"

d2916b5d-50e4-482c-aa6b-e26d2c78ef46 : Exported by virt-v2v.



[root@ovirt04 ~]# vdsClient -s 0 getVolumesList
"088e7ed9-84c7-4fbd-a570-f37fa986a772"
"0002-0002-0002-0002-02b9"
"fb8f3165-5976-4094-9d37-ea0b09124547"
cc83caa4-e366-4fd6-94b7-d16089aa29d6 : Parent is
53c5003d-80de-4dfd-b5d8-50537a3a54d6

53c5003d-80de-4dfd-b5d8-50537a3a54d6 : imported by virt-v2v.



[root@ovirt04 ~]# vdsClient -s 0 getVolumesList
"088e7ed9-84c7-4fbd-a570-f37fa986a772"
"0002-0002-0002-0002-02b9"
"e15288bc-30ec-4a77-837b-bdc7de37a08b"

2f2c2a1c-6dcc-436c-962c-00e4e074a39a :
{"DiskAlias":"polymatheia1.slu.cz_Disk1","DiskDescription":""}.



[root@ovirt04 ~]# vdsClient -s 0 getVolumesList
"088e7ed9-84c7-4fbd-a570-f37fa986a772"
"0002-0002-0002-0002-02b9"
"be5c56de-6a22-4d1a-8579-f0f5d501d90c"

0681822f-3ac8-473b-95ce-380f8ab4de06 :



when I look on last case

[root@ovirt04 ~]# vdsClient -s 0 getVolumeInfo
"088e7ed9-84c7-4fbd-a570-f37fa986a772"
"0002-0002-0002-0002-02b9"
"be5c56de-6a22-4d1a-8579-f0f5d501d90c"
"0681822f-3ac8-473b-95ce-380f8ab4de06"
status = OK
domain = 088e7ed9-84c7-4fbd-a570-f37fa986a772
capacity = 322122547200
voltype = LEAF
description =
parent = 0

Re: [ovirt-users] vdsm high mem usage

2015-09-15 Thread Michael Kleinpaste
Thanks all.  I restarted it and that fixed the issue temporarily freeing up
memory but it continued the leak process.  I updated the vdsm package and
that fixed it overall.

On Tue, Sep 15, 2015 at 12:57 AM Michal Skrivanek <
michal.skriva...@redhat.com> wrote:

>
> On Sep 10, 2015, at 20:35 , Michael Kleinpaste <
> michael.kleinpa...@sharperlending.com> wrote:
>
> > Hi everybody.
> >
> > So I ran into that high mem usage thing. The problem I have with
> patching is that this is a live system so I can't do it mid day.  Can
> anybody tell me if it is possible to just restart the vdsm service or does
> the host have to be in "maintenance mode" before restarting it?  It is
> using gluster storage, if that makes a difference as well.
>
> Hi,
> you can restart vdsm without any effect on running VMs. Other than short
> interruption of communication between engine and host. It can cause a short
> CPU spike on startup, so do that with caution when you run tend or hundreds
> of VMs on a same overloaded host.
> Obviously, while vdsm is not running, the system is a bit more vulnerable
> to failures, but as long as you don't do that in the middle of a migration
> or a power failure you're good:)
>
> Thanks,
> michal
> >
> > Thanks,
> > --
> > Michael Kleinpaste
> > Senior Systems Administrator
> > SharperLending, LLC.
> > www.SharperLending.com
> > michael.kleinpa...@sharperlending.com
> > (509) 324-1230   Fax: (509) 324-1234
> > ___
> > Users mailing list
> > Users@ovirt.org
> > http://lists.ovirt.org/mailman/listinfo/users
>
> --
*Michael Kleinpaste*
Senior Systems Administrator
SharperLending, LLC.
www.SharperLending.com
michael.kleinpa...@sharperlending.com
(509) 324-1230   Fax: (509) 324-1234
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VLAN with one NIC

2015-09-15 Thread gregor
Thanks for the information's. I attached a second NIC to solve this,
then it was easy to configure ovirt to use the tagged and untagged network.

Regards
gregor

On 2015-09-06 08:31, Genadi Chereshnya wrote:
> Hi,
> Indeed at this stage it is not supported to have VM and VLAN networks on the 
> same physical interface.
> I know it is supposed to be changed in the future (if I understand correctly 
> it is already supported on the VDSM side).
> 
> For now you can have 2 VLAN networks on the same interface ( so if it works 
> for your specific scenario you can tag the management network)
> Or you can configure the Management network to be non-VM as was proposed 
> earlier and then you can add VLAN networks to the same interface (for such 
> scenario you can't have management network to be configured on your VMs)
> Of course if you can have or add additional NICs to you host device, and then 
> you can configure Management network on one of the interfaces and VLAN on 
> another.
> 
> Regards,
>Genadi
> 
> - Original Message -
> From: "Chris Liebman" 
> To: "gregor" 
> Cc: "users" 
> Sent: Friday, September 4, 2015 7:58:43 PM
> Subject: Re: [ovirt-users] VLAN with one NIC
> 
> 
> 
> If your sharing a physical interface with both tagged and untagged traffic 
> they don't allow VM's on the untagged link. I believe this is because some 
> earlier versions of the bridging code in the kernel would allow pass the 
> tagged traffic to VM's located on the untagged interface. That being a 
> security issue. 
> 
> 
> On Fri, Sep 4, 2015 at 9:47 AM, gregor < gregor_fo...@catrix.at > wrote: 
> 
> 
> Thanks, now I can use the eth0 to connect to the web-interface and the 
> logical vlan network to use in a VM but I can't use the untagged VLAN1 
> in the VM. When I add a new logical network without VLAN tagging the 
> web-interface returns "Cannot have more than one non-VLAN network on one 
> interface." 
> 
> When I create a logical network with taggged ID 1 I can't connect from a 
> VM to my VLAN1. 
> 
> So I have to find a way to created the following networks: 
> - ovirtmgmt: without tagged vlan for managing -> works 
> - vlan1: without tagged vlan for the VM's to connect to my default 
> network -> currently no solution 
> - vlan10: tagged vlan with id 10 -> works 
> 
> cheers 
> gregor 
> 
> On 2015-09-04 18:05, Chris Liebman wrote: 
>> You have to edit the ovirtgmt network and un-check the "VM Network" box: 
>>
>> Inline image 1 
>>
>> On Fri, Sep 4, 2015 at 8:47 AM, gregor < gregor_fo...@catrix.at 
>> > wrote: 
>>
>> Hi, 
>>
>> is it possible to use different VLAN with one NIC in ovirt? 
>>
>> I can not add a logical network configured as VLAN to my ovirtmgmt 
>> interface, I get "Cannot have a non-VLAN VM network and VLAN-tagged 
>> networks on one interface." 
>>
>> The setup for the Port is a trunk of different tagged VLAN's and the 
>> default VLAN1 is untagged. Normally this work under centos where I give 
>> the nic eth0 an IP and create a nic eth0.10 for the VLAN with ID 10 and 
>> set an IP for it. 
>>
>> cheers 
>> gregor 
>>
>> ___ 
>> Users mailing list 
>> Users@ovirt.org  
>> http://lists.ovirt.org/mailman/listinfo/users 
>>
>>
> 
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
> 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Python: Clone snapshot into VM

2015-09-15 Thread gregor
Hi,

I write currently a little backup tool in Python which use the following
workflow:
- create a snapshot -> works
- clone snapshot into VM -> help needed
- delete the snapshot -> works
- export VM to NFS share -> works
- delete cloned VM -> TODO

Is it possible to clone a snapshot into a VM like from the web-interface?

The above workflow is a little bit resource expensive but it will when
it is finished make Online-Full-backups of VM's.

cheers
gregor
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Disappearing vm.conf

2015-09-15 Thread noc
On 15-9-2015 15:26, Simone Tiraboschi wrote:
>
>
> On Tue, Sep 15, 2015 at 2:31 PM, noc  > wrote:
>
> On 15-9-2015 9:31, Simone Tiraboschi wrote:
>>
>>
>> Yes, can you please attach the logs
>> from  /var/log/ovirt-hosted-engine-ha/ ?
> Attached.
>
>
>
> Now the agent correctly reads the VM configuration file from the
> shared domain so no issue there but then it exits cause VDSM is not up:
>
> MainThread::INFO::2015-09-14
> 20:48:58,007::hosted_engine::667::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
> Reloading vm.conf from the shared storage domain
> MainThread::INFO::2015-09-14
> 20:48:58,163::hosted_engine::460::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(start_monitoring)
> Current state EngineUp (score: 3400)
> MainThread::INFO::2015-09-14
> 20:49:08,201::hosted_engine::610::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_vdsm)
> Initializing VDSM
> MainThread::INFO::2015-09-14
> 20:49:08,317::hosted_engine::655::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
> Connecting the storage
> MainThread::INFO::2015-09-14
> 20:49:08,318::storage_server::110::ovirt_hosted_engine_ha.lib.storage_server.StorageServer::(connect_storage_server)
> Connecting storage server
> MainThread::INFO::2015-09-14
> 20:49:08,318::storage_server::135::ovirt_hosted_engine_ha.lib.storage_server.StorageServer::(connect_storage_server)
> Connecting storage server
> MainThread::INFO::2015-09-14
> 20:49:08,349::hosted_engine::659::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
> Preparing images
> MainThread::INFO::2015-09-14
> 20:49:08,349::image::61::ovirt_hosted_engine_ha.lib.image.Image::(prepare_images)
> Preparing images
> MainThread::INFO::2015-09-14
> 20:49:08,787::hosted_engine::667::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
> Reloading vm.conf from the shared storage domain
> MainThread::INFO::2015-09-14
> 20:49:08,941::hosted_engine::460::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(start_monitoring)
> Current state EngineUp (score: 3400)
> MainThread::INFO::2015-09-14
> 20:49:15,104::agent::143::ovirt_hosted_engine_ha.agent.agent.Agent::(run)
> Agent shutting down
> MainThread::INFO::2015-09-14
> 21:06:07,601::agent::78::ovirt_hosted_engine_ha.agent.agent.Agent::(run)
> ovirt-hosted-engine-ha agent 1.3.0-0.0.master.20150909150500 started
> MainThread::INFO::2015-09-14
> 21:06:07,632::hosted_engine::246::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_get_hostname)
> Found certificate common name: lichthuis.puzzle-it.nu
> 
> MainThread::INFO::2015-09-14
> 21:06:07,633::hosted_engine::610::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_vdsm)
> Initializing VDSM
> MainThread::ERROR::2015-09-14
> 21:06:18,209::hosted_engine::418::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(start_monitoring)
> Service vdsmd is not running and the admin is responsible for starting
> it. Shutting down.
> MainThread::INFO::2015-09-14
> 21:06:18,210::agent::143::ovirt_hosted_engine_ha.agent.agent.Agent::(run)
> Agent shutting down
>
> Martin, any hint there?
>
> Previously you had an issue there:
> MainThread::ERROR::2015-09-14
> 17:34:19,972::agent::201::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
> Error: '[Errno 13] Permission denied:
> '/var/run/ovirt-hosted-engine-ha/vm.conf'' - trying to restart agent
>
> Did you manually created /var/run/ovirt-hosted-engine-ha/vm.conf being
> readable only by root or something similar? ovirt-ha-agent is running
> as vdsm user and it should be able to refresh
>  /var/run/ovirt-hosted-engine-ha/vm.conf  with the master copy on the
> shared storage.
>
Yes, from the log of setup and forgot to chmod to 36:36

> And previously than that it was failing with:
> MainThread::ERROR::2015-09-12
> 13:53:23,727::agent::201::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
> Error: ''StorageServer' object has no attribute 'storageType'' -
> trying to restart agent
> cause you were using ovirt-hosted-engine-ha agent
> 1.3.0-0.0.master.20150819082341 which was unfortunately bugged on
> that. ovirt-hosted-engine-ha agent 1.3.0-0.0.master.20150909150500 you
> are using now should be OK.
>
>
>
Trying a yum upgrade barfs with a depency problem trying to update 2 or
3 ovirt related packages, one of them ovirt-hosted-engine, that was
yesterday, maybe today it will work. I'll check

So what is the sanlock error about?

Joop

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Disappearing vm.conf

2015-09-15 Thread Simone Tiraboschi
On Mon, Sep 14, 2015 at 8:45 AM, noc  wrote:
>
> Next morning the hosted-engine didn't start. Running 'hosted-engine
> --vm-status' tells me I need to deploy first.


This one is indeed an issue: it shouldn't complain there or at least it
should complain that way.
I'm opening a bug on that:
https://bugzilla.redhat.com/1263295
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Disappearing vm.conf

2015-09-15 Thread Simone Tiraboschi
On Tue, Sep 15, 2015 at 2:31 PM, noc  wrote:

> On 15-9-2015 9:31, Simone Tiraboschi wrote:
>
>
>
> Yes, can you please attach the logs from  /var/log/ovirt-hosted-engine-ha/
> ?
>
> Attached.
>


Now the agent correctly reads the VM configuration file from the shared
domain so no issue there but then it exits cause VDSM is not up:

MainThread::INFO::2015-09-14
20:48:58,007::hosted_engine::667::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
Reloading vm.conf from the shared storage domain
MainThread::INFO::2015-09-14
20:48:58,163::hosted_engine::460::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(start_monitoring)
Current state EngineUp (score: 3400)
MainThread::INFO::2015-09-14
20:49:08,201::hosted_engine::610::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_vdsm)
Initializing VDSM
MainThread::INFO::2015-09-14
20:49:08,317::hosted_engine::655::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
Connecting the storage
MainThread::INFO::2015-09-14
20:49:08,318::storage_server::110::ovirt_hosted_engine_ha.lib.storage_server.StorageServer::(connect_storage_server)
Connecting storage server
MainThread::INFO::2015-09-14
20:49:08,318::storage_server::135::ovirt_hosted_engine_ha.lib.storage_server.StorageServer::(connect_storage_server)
Connecting storage server
MainThread::INFO::2015-09-14
20:49:08,349::hosted_engine::659::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
Preparing images
MainThread::INFO::2015-09-14
20:49:08,349::image::61::ovirt_hosted_engine_ha.lib.image.Image::(prepare_images)
Preparing images
MainThread::INFO::2015-09-14
20:49:08,787::hosted_engine::667::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_storage_images)
Reloading vm.conf from the shared storage domain
MainThread::INFO::2015-09-14
20:49:08,941::hosted_engine::460::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(start_monitoring)
Current state EngineUp (score: 3400)
MainThread::INFO::2015-09-14
20:49:15,104::agent::143::ovirt_hosted_engine_ha.agent.agent.Agent::(run)
Agent shutting down
MainThread::INFO::2015-09-14
21:06:07,601::agent::78::ovirt_hosted_engine_ha.agent.agent.Agent::(run)
ovirt-hosted-engine-ha agent 1.3.0-0.0.master.20150909150500 started
MainThread::INFO::2015-09-14
21:06:07,632::hosted_engine::246::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_get_hostname)
Found certificate common name: lichthuis.puzzle-it.nu
MainThread::INFO::2015-09-14
21:06:07,633::hosted_engine::610::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_vdsm)
Initializing VDSM
MainThread::ERROR::2015-09-14
21:06:18,209::hosted_engine::418::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(start_monitoring)
Service vdsmd is not running and the admin is responsible for starting it.
Shutting down.
MainThread::INFO::2015-09-14
21:06:18,210::agent::143::ovirt_hosted_engine_ha.agent.agent.Agent::(run)
Agent shutting down

Martin, any hint there?

Previously you had an issue there:
MainThread::ERROR::2015-09-14
17:34:19,972::agent::201::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Error: '[Errno 13] Permission denied:
'/var/run/ovirt-hosted-engine-ha/vm.conf'' - trying to restart agent

Did you manually created /var/run/ovirt-hosted-engine-ha/vm.conf being
readable only by root or something similar? ovirt-ha-agent is running as
vdsm user and it should be able to refresh
 /var/run/ovirt-hosted-engine-ha/vm.conf  with the master copy on the
shared storage.

And previously than that it was failing with:
MainThread::ERROR::2015-09-12
13:53:23,727::agent::201::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Error: ''StorageServer' object has no attribute 'storageType'' - trying to
restart agent
cause you were using ovirt-hosted-engine-ha agent
1.3.0-0.0.master.20150819082341 which was unfortunately bugged on
that. ovirt-hosted-engine-ha agent 1.3.0-0.0.master.20150909150500 you are
using now should be OK.






>
>
> To me it looks like I better start over but I don't want to wipe my F22
>> install. What is the preferred way to get a 'clean' system so that I can
>> try again? (concerned about sanlock complaining about name conflicts)
>>
>
> I'm quite confident you can save your setup.
>
> Great :-)
>
> Joop
>
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Running an ovirt phyton scrit on a newly RHEL7 installed system

2015-09-15 Thread Jean-Pierre Ribeauville
Hi,

You are totally right.

It's O.K. now.

Thanks for help.

J.P. Ribeauville

-Message d'origine-
De : Juan Hernández [mailto:jhern...@redhat.com] 
Envoyé : mardi 15 septembre 2015 14:04
À : Jean-Pierre Ribeauville; users@ovirt.org
Objet : Re: [ovirt-users] Running an ovirt phyton scrit on a newly RHEL7 
installed system

On 09/15/2015 11:02 AM, Jean-Pierre Ribeauville wrote:
> Hi,
> 
>  
> 
> I'm trying to retrieve datacenters clusters hosts and vms  by using 
> ovirt python script needing some imports :
> 
>  
> 
> /import sys/
> 
> /from ovirtsdk.api import API/
> 
> /from ovirtsdk.xml import params/
> 
> /from threading import Thread/
> 
> /import time/
> 
> /import logging/
> 
>  
> 
>  
> 
> Then I  got following error :
> 
>  
> 
> Ø  python test2.py
> 
> /Traceback (most recent call last):/
> 
> /  File "test2.py", line 32, in /
> 
> /from ovirtsdk.api import API/
> 
> /ImportError: No module named ovirtsdk.api/
> 
>  
> 
> As I run it on a "fresh install" rhel7 server ( python has been 
> added), I'm wondering what's the minimum packages list I've to install 
> on this machine to be able to run this script ?  ( I run it correctly 
> on the system where the ovirt engine is installed)
> 

As you are using a fresh install of RHEL 7 you won't probably have the oVirt 
repositories configured yet, so you will need to configure them:

  # yum install http://resources.ovirt.org/pub/yum-repo/ovirt-release35.rpm

The you need to install the ovirt-engine-sdk-python package:

  # yum install ovirt-engine-sdk-python

--
Dirección Comercial: C/Jose Bardasano Baos, 9, Edif. Gorbea 3, planta 3ºD, 
28016 Madrid, Spain Inscrita en el Reg. Mercantil de Madrid - C.I.F. B82657941 
- Red Hat S.L.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Running an ovirt phyton scrit on a newly RHEL7 installed system

2015-09-15 Thread Juan Hernández
On 09/15/2015 11:02 AM, Jean-Pierre Ribeauville wrote:
> Hi,
> 
>  
> 
> I’m trying to retrieve datacenters clusters hosts and vms  by using
> ovirt python script needing some imports :
> 
>  
> 
> /import sys/
> 
> /from ovirtsdk.api import API/
> 
> /from ovirtsdk.xml import params/
> 
> /from threading import Thread/
> 
> /import time/
> 
> /import logging/
> 
>  
> 
>  
> 
> Then I  got following error :
> 
>  
> 
> Ø  python test2.py
> 
> /Traceback (most recent call last):/
> 
> /  File "test2.py", line 32, in /
> 
> /from ovirtsdk.api import API/
> 
> /ImportError: No module named ovirtsdk.api/
> 
>  
> 
> As I run it on a “fresh install” rhel7 server ( python has been added),
> I’m wondering what’s the minimum packages list I’ve to install
> on this machine to be able to run this script ?  ( I run it correctly on
> the system where the ovirt engine is installed)
> 

As you are using a fresh install of RHEL 7 you won't probably have the
oVirt repositories configured yet, so you will need to configure them:

  # yum install http://resources.ovirt.org/pub/yum-repo/ovirt-release35.rpm

The you need to install the ovirt-engine-sdk-python package:

  # yum install ovirt-engine-sdk-python

-- 
Dirección Comercial: C/Jose Bardasano Baos, 9, Edif. Gorbea 3, planta
3ºD, 28016 Madrid, Spain
Inscrita en el Reg. Mercantil de Madrid – C.I.F. B82657941 - Red Hat S.L.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Extension aaa: No search for principal

2015-09-15 Thread Daniel Helgenberger


On 11.09.2015 17:00, Alon Bar-Lev wrote:
>
>
> - Original Message -
>> From: "Daniel Helgenberger" 
>> To: "Alon Bar-Lev" 
>> Cc: Users@ovirt.org
>> Sent: Friday, September 11, 2015 5:33:21 PM
>> Subject: Re: [ovirt-users] Extension aaa: No search for principal
>>
>> sorry, forgot one:
>>
>> On 11.09.2015 12:48, Alon Bar-Lev wrote:
>>> Hi!
>>>
>>> Thank you for the information, for some reason the administrator user
>>> cannot be resolved to userPrincipalName during login, is it specific for
>>> Administrator or any user?
>> This is the default domain administrator account witch exits in any
>> forest. But just in case I created a new domain user just for the
>> purpose; same outcome
>
Sorry for the delay, Alon.

> I am unsure what actually happens...
I might have an idea, at least from the commands you supplied.

> Something in global catalog is out of sync.
> Usually - you do not add domain administrator to external application... 
> there is no need to expose it.
> By default Administrator does not have "login from network" and "user 
> principal suffix".
>
> Also in my environment I do not get result for administrator, but I do get 
> one for regular user that has upn suffix in user record, you can see these 
> fields in user and domain manager.
>
> So please use regular unprivileged users which belongs to "Domain Users" from 
> now on.
>
> To test if user has userPrincipalName use the following command (assuming we 
> search for u...@int.corp.de):
>
> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H 
> ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w PASSWORD 
> -b '' '(userPrincipalName=u...@int.corp.de)' cn userPrincipalName
It seams with Active Directory (at least) the search base cannot be 
empty (-b '') but needs to be provided.

In my case, the above command fails with:
> # search result
> search: 2
> result: 32 No such object
> text: 208D: NameErr: DSID-03100213, problem 2001 (NO_OBJECT), data 0, 
> best match of:

While adding the most basic search path it succeeds:

$ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H 
ldap://int.corp.de:389/ -x -D 'b...@int.corp.de' -w PASSWORD -b 
'dc=int,dc=corp,dc=de' '(userPrincipalName=administra...@int.corp.de)' 
cn userPrincipalName
> # search reference
> ref: ldap://ForestDnsZones.int.corp.de/DC=ForestDnsZones,DC=int,DC=corp,DC=de
>
> # search reference
> ref: ldap://DomainDnsZones.int.corp.de/DC=DomainDnsZones,DC=int,DC=corp,DC=de
>
> # search reference
> ref: ldap://int.corp.de/CN=Configuration,DC=int,DC=corp,DC=de
>
> # search result
> search: 2
> result: 0 Success
> control: 1.2.840.113556.1.4.319 false DDDSSSDDMM=
> pagedresults: cookie=
>
> # numResponses: 4
> # numReferences: 3

It succeeds with every user I tried.

I would set the search base; but i am not sure where to do so.

>
> This should find the user (return one result), if not, please checkout user 
> in Users and Domains manager for the domain suffix, maybe it is empty.
>
> To find user without userPrincipalName such as Administrator use the 
> following command:
>
> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H 
> ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w PASSWORD 
> -b '' '(sAMAccountName=user)' cn userPrincipalName
>
> For example, the above will work for Administrator, but for kerberos to work 
> properly user principal name must be defined, so these users will not work.
>
> You can dump entire GC and send me a user record if no result so I can 
> determine what is different from expectations:
>
> $ ldapsearch -E pr=1024/noprompt -o ldif-wrap=no -H 
> ldap://qa1.qa.lab.tlv.redhat.com:3268/ -x -D 'b...@int.corp.de' -w PASSWORD 
> -b '' > /tmp/dump.out

If you still require a dump (its even a small one..) please drop a mail.

>
> Regards,
> Alon
>

-- 
Daniel Helgenberger
m box bewegtbild GmbH

P: +49/30/2408781-22
F: +49/30/2408781-10

ACKERSTR. 19
D-10115 BERLIN


www.m-box.de  www.monkeymen.tv

Geschäftsführer: Martin Retschitzegger / Michaela Göllner
Handeslregister: Amtsgericht Charlottenburg / HRB 112767
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] ovirt 3.6 webadmin vm attributes empty

2015-09-15 Thread Marc Werner
Hi
If I try to edit a vm most of dialog is empty and I cant chos e anything.
Is that a known bug?


Best regards
Marc
[cid:image001.png@01D0EFB1.17A0FA20]

ovirt-engine.noarch 
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-backend.noarch 
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-cli.noarch 
   3.6.0.1-0.1.20150821.gitac5082d.el6  
  
@ovirt-3.6
ovirt-engine-dbscripts.noarch   
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-extension-aaa-jdbc.noarch  
   
1.0.0-0.0.master.20150831142838.git4d9c713.el6  
   @ovirt-3.6
ovirt-engine-extensions-api-impl.noarch 
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-jboss-as.x86_64
   7.1.1-1.el6  
  
@ovirt-3.5-pre
ovirt-engine-lib.noarch 
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-restapi.noarch 
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-sdk-python.noarch  
   3.6.0.1-0.1.20150821.gitc8ddcd8.el6  
  
@ovirt-3.6
ovirt-engine-setup.noarch   
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-setup-base.noarch  
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-setup-plugin-ovirt-engine.noarch   
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-setup-plugin-ovirt-engine-common.noarch
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-setup-plugin-vmconsole-proxy-helper.noarch 
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-setup-plugin-websocket-proxy.noarch
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-tools.noarch   
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-userportal.noarch  
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-vmconsole-proxy-helper.noarch  
   
3.6.0-0.0.master.20150909083445.gitbcc44ff.el6  
   @ovirt-3.6
ovirt-engine-webadmin-portal.noarch

Re: [ovirt-users] using RHEL6/7 server as host ?

2015-09-15 Thread Arsène Gschwind

we run RHEL 7 as oVirt host since a while and it is supported.

rgs,
Arsène

On 09/10/2015 01:19 PM, Jean-Pierre Ribeauville wrote:


Hi,

Ovirt quick start guide mentions this :

*/Install Hosts /*

/After you have installed the oVirt Engine, install the hosts to run 
your virtual machines. In oVirt, you can use either oVirt Node, Fedora 
or CentOS as hosts. /


Is it O.K. to use a RHEL6 or 7 server as  host to run Guest VMs  ?

I assume that’s ovirt engine is not installed on one of these hosts  ( 
it’s better to use a dedicated machine) ?


Thx

**

*J.P. Ribeauville*

P: +33.(0).1.47.17.27.87

Puteaux 3 Etage 5  Bureau 4

_jpribeauvi...@axway.com __
_http://www.axway.com 

P Pensez à l’environnement avant d’imprimer.



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Running an ovirt phyton scrit on a newly RHEL7 installed system

2015-09-15 Thread Jean-Pierre Ribeauville
Hi,

I'm trying to retrieve datacenters clusters hosts and vms  by using ovirt 
python script needing some imports :

import sys
from ovirtsdk.api import API
from ovirtsdk.xml import params
from threading import Thread
import time
import logging


Then I  got following error :


Ø  python test2.py
Traceback (most recent call last):
  File "test2.py", line 32, in 
from ovirtsdk.api import API
ImportError: No module named ovirtsdk.api

As I run it on a "fresh install" rhel7 server ( python has been added), I'm 
wondering what's the minimum packages list I've to install
on this machine to be able to run this script ?  ( I run it correctly on the 
system where the ovirt engine is installed)

Thanks.

Regards,



J.P. Ribeauville


P: +33.(0).1.47.17.27.87


Puteaux 3 Etage 5  Bureau 4

jpribeauvi...@axway.com
http://www.axway.com



P Pensez à l'environnement avant d'imprimer.



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Upgrade problem

2015-09-15 Thread Simone Tiraboschi
On Mon, Sep 14, 2015 at 8:27 PM, Koen Vanoppen 
wrote:

> Yes,
> That will indeed be the issue I think. But... Can you explain to me what
> steps I need to take? I rather not change my certs...
>

The issue is that the setup is going to extract the cert to verify its
compatibility with latest browser but to do that it assumes that it was
created with our internal CA with the default password.
So it fails if the user substituted the apache cert with a certificate
signed by a different CA and protected with a different password.

We'll fix it for the next release but I can suggest you a workaround:
- Backup the current /etc/pki/ovirt-engine/keys/apache.p12
- Recreate the apache.p12 with the password 'mypass':
  openssl pkcs12 -export -out /etc/pki/ovirt-engine/keys/apache.p12 -inkey
/etc/pki/ovirt-engine/keys/apache.key.nopass -in
/etc/pki/ovirt-engine/certs/apache.cer
When prompted for password use 'mypass' without the quotation marks
- execute engine-setup to upgrade
- if you prefer, restore your original
/etc/pki/ovirt-engine/keys/apache.p12 and restart the Apache server.


> Kind regards and thanks for your help,
>
> Koen
>
> On 11 September 2015 at 19:25, Yaniv Kaul  wrote:
>
>> On 11/09/15 19:18, Koen Vanoppen wrote:
>>
>> Hi everybody,
>>
>> I'm trying to upgrade ovirt to the ovirt 3.5.4 from my current version
>> 3.5.3.
>> But when I try to run engine-setup I got this:
>>
>> [ INFO  ] Checking for product updates...
>>   Setup has found updates for some packages, do you wish to
>> update them now? (Yes, No) [Yes]:
>> [ INFO  ] Checking for an update for Setup...
>>
>>   --== ALL IN ONE CONFIGURATION ==--
>>
>>
>>   --== NETWORK CONFIGURATION ==--
>>
>>   Setup can automatically configure the firewall on this system.
>>   Note: automatic configuration of the firewall may overwrite
>> current settings.
>>   Do you want Setup to configure the firewall? (Yes, No) [Yes]:
>> [ INFO  ] firewalld will be configured as firewall manager.
>>
>>   --== DATABASE CONFIGURATION ==--
>>
>>   The detected DWH database size is 198 MB.
>>   Setup can backup the existing database. The time and space
>> required for the database backup depend on its size. This process takes
>> time, and in some cases (for instance, when the size is few GBs) may take
>> several hours to complete.
>>   If you choose to not back up the database, and Setup later
>> fails for some reason, it will not be able to restore the database and all
>> DWH data will be lost.
>>   Would you like to backup the existing database before upgrading
>> it? (Yes, No) [Yes]:
>>
>>   --== OVIRT ENGINE CONFIGURATION ==--
>>
>>   Skipping storing options as database already prepared
>>
>>   --== PKI CONFIGURATION ==--
>>
>> [ ERROR ] Failed to execute stage 'Environment customization': Command
>> '/bin/openssl' failed to execute
>> [ INFO  ] Stage: Clean up
>>   Log file is located at
>> /var/log/ovirt-engine/setup/ovirt-engine-setup-20150911181604-1gft67.log
>> [ INFO  ] Generating answer file
>> '/var/lib/ovirt-engine/setup/answers/20150911181626-setup.conf'
>> [ INFO  ] Stage: Pre-termination
>> [ INFO  ] Stage: Termination
>> [ ERROR ] Execution of setup failed
>>
>>
>> Any idea?
>>
>>
>> Looks like https://bugzilla.redhat.com/show_bug.cgi?id=1260752 .
>> Y.
>>
>>
>> Kind regards,
>>
>> Koen
>>
>>
>> ___
>> Users mailing 
>> listUsers@ovirt.orghttp://lists.ovirt.org/mailman/listinfo/users
>>
>>
>>
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] vdsm high mem usage

2015-09-15 Thread Michal Skrivanek

On Sep 10, 2015, at 20:35 , Michael Kleinpaste 
 wrote:

> Hi everybody.
> 
> So I ran into that high mem usage thing. The problem I have with patching is 
> that this is a live system so I can't do it mid day.  Can anybody tell me if 
> it is possible to just restart the vdsm service or does the host have to be 
> in "maintenance mode" before restarting it?  It is using gluster storage, if 
> that makes a difference as well.

Hi,
you can restart vdsm without any effect on running VMs. Other than short 
interruption of communication between engine and host. It can cause a short CPU 
spike on startup, so do that with caution when you run tend or hundreds of VMs 
on a same overloaded host.
Obviously, while vdsm is not running, the system is a bit more vulnerable to 
failures, but as long as you don't do that in the middle of a migration or a 
power failure you're good:)

Thanks,
michal
> 
> Thanks,
> -- 
> Michael Kleinpaste
> Senior Systems Administrator
> SharperLending, LLC.
> www.SharperLending.com
> michael.kleinpa...@sharperlending.com
> (509) 324-1230   Fax: (509) 324-1234
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Strange fencing behaviour 3.5.3

2015-09-15 Thread Martin Perina
Hi,

sorry for late reponse I somehow missed your email :-(

I cannot completely understand you exact issue from the description,
but the situation when engine loses connection to all hypervisors
is always bad. Fortunately we made a few improvements in 3.5, which
should help in those scenarios. Please take a look at "Fencing policy"
tab in "Edit cluster" dialog:

1. Skip fencing if host has live lease on storage
 - when host is connected to storage it has to renew its
   storage lease at least every 60 secs
 - so if the option is enabled and engine tries to fence the host
   host using fence proxy (another host in cluster/DC which has
   good connection), fence proxy checks if non responsive host
   renewed its storage lease in the last 90 secs. And if lease
   was renewed, fencing is aborted

2. Skip fencing on cluster connectivity issues
 - if this options is enabled, engine test prior to fencing
   how many of the hosts in the cluster has connectivity
   issues. And if number of hosts with connectivity issues
   is higher than the specified percentage, fencing is aborted
 - of course this option is useless in clusters with less than
   3 hosts

3. Enable fencing
 - by disabling this option you can completely disable fencing
   for hosts in the cluster
 - this is usable in the situation when you expect connectivity
   issues between engine and hosts (for example during switch
   replacement), so you can disable fencing, replace the switch
   and when connection is restored, enable fencing again
 - however if you disable fencing completely, your HA VMs won't
   be restarted on different hosts, so please use this option
   with caution

Please let me known if have any other issues/questions with fencing.

Thanks

Martin Perina


- Original Message -
> From: "Martin Breault" 
> To: users@ovirt.org
> Sent: Friday, September 11, 2015 9:14:23 PM
> Subject: [ovirt-users] Strange fencing behaviour 3.5.3
> 
> Hello,
> 
> I manage 2 oVirt clusters that are not associated in any way, they each
> have their own management engine running ovirt-engine-3.5.3.1-1.  The
> servers are Dell 6xx series and the power-management is configured using
> idrac5 settings and each cluster is a pair of hypervisors.
> 
> The engines are both in a datacenter that had an electrical issue, each
> cluster is at a different unrelated location.  The problem I had was
> caused by a downed switch causing the individual engines to continue to
> function, however no longer have connectivity to their respective
> clusters.  Once the switch was replaced (about 30 minutes of downtime) ,
> when connectivity was resumed, both engines chose to fence one of the
> two "unresponsive hypervisors" by sending an iDrac command to power down.
> 
> The downed hypervisor Cluster1 for some reason, 8 minutes later, got a
> iDrac command to power-up again.  When I logged into the engine, the
> guests that were running on the powered-down host were in "off" state.
> I simply powered them back on.
> 
> The downed hypervisor on Cluster2 stayed off, and was unresponsive
> according to the engine, however the VMs that were running on it were in
> an unknown state.  I had to power on the host and click the "host has
> been rebooted" dialog for the cluster to free these guests to be booted
> again.
> 
> My question is, is it normal for the engine to fence one or more hosts
> when it loses connectivity to all thehypervisors in the cluster?  Is
> there a minimum of 3 hosts in a cluster for it to not fall into this
> mode?I'd like to know what I can troubleshoot or how I can avoid an
> issue like this should the engine be disconnected from the hypervisors
> temporarily and then resume connectivity only to kill the well-running
> guests.
> 
> Thanks in advance,
> 
> Marty
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
> 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Disappearing vm.conf

2015-09-15 Thread Simone Tiraboschi
On Mon, Sep 14, 2015 at 8:16 PM, Joop  wrote:

> On 14-9-2015 9:19, Joop wrote:
>
> On 14-9-2015 9:17, Simone Tiraboschi wrote:
>
>
>
> On Mon, Sep 14, 2015 at 9:02 AM, Sandro Bonazzola < 
> sbona...@redhat.com> wrote:
>
>>
>>
>> On Mon, Sep 14, 2015 at 8:45 AM, noc < n...@nieuwland.nl
>> > wrote:
>>
>>> Hi All,
>>>
>>> I'm reinstalling hosted-engine on my F22 laptop and I'm making progress.
>>> With beta3 I couldn't get it installed, with beta5 it installed and I
>>> could go into the webui but I had to shutdown my laptop and get some
>>> sleep :-)
>>> Next morning the hosted-engine didn't start. Running 'hosted-engine
>>> --vm-status' tells me I need to deploy first. Looks like its a virgin
>>> system. Looking at what hosted-engine does I found out that
>>> /var/run/ovirt-hosted-engine-ha/vm.conf is missing. In another thread
>>> its mentioned that it now lives on the storage domain but I couldn't
>>> find it there, nothing resembles it.
>>> Question is: how did engine get started and why is that vm.conf now
>>> missing.
>>>
>>
>> vm.conf moved to shared storage as part of
>> 
>> http://www.ovirt.org/Features/Hosted_Engine_configuration_on_shared_storage
>> I guess you hit a bug in this area, adding Simone.
>>
>
> What kind of hared storage are you using?
>
> NFSv3
>
> Could you please attach logs from /var/log/ovirt-hosted-engine-ha/
>
> Yes, later today.
>
> Log attached of the last try with beta5
>
> I managed to extract vm.conf from the log and when copied to
> /var/run/ovirt-hosted-engine-ha the engine will startup up but I get an
> error when I try to attach the hosted_engine storage domain.
> Sanlock.log attached as well.
> Anymore logs you need?
>

Yes, can you please attach the logs from  /var/log/ovirt-hosted-engine-ha/ ?


> To me it looks like I better start over but I don't want to wipe my F22
> install. What is the preferred way to get a 'clean' system so that I can
> try again? (concerned about sanlock complaining about name conflicts)
>

I'm quite confident you can save your setup.


>
> Regards,
>
> Joop
>
> Joop
>
>
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users