Re: [ovirt-users] log out event but not log in?

2017-02-22 Thread Yura Poltoratskiy

I have the same issue since oVirt 4.0.x.

21.02.2017 10:28, Gianluca Cecchi пишет:

Hello,
In oVirt 4.1 web admin gui I see events about users logging out (they
have been created on internal domain with ovirt-aaa-jdbc-tool command),
but I don't see the corresponding log in event.
The same is true for the default admin@internal user.
Is there any reason or is it a bug?
Thanks,
Gianluca


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] ovirt 4.1.0.4 error..

2017-02-22 Thread Thomas Davis
I am getting this error message:

Unexpected character ('<' (code 60)): expected a valid value (number,
String, array, object, 'true', 'false' or 'null') at [Source:
java.io.StringReader@4c65dbb1; line: 1, column: 2]

in both screen after login on ovirt, and in the engine.log.

Any ideas one what it is looking for this?

This ovirt setup has gone from 3.5 -> 3.6 -> 4.0 -> 4.1.

I am currently blocked on doing any VM management in ovirt due to this..

thomas
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Guest Agent Running unconfined on Centos 7

2017-02-22 Thread Simone Tiraboschi
On Wed, Feb 22, 2017 at 10:05 PM, Michal Skrivanek 
wrote:

> > On 22 Feb 2017, at 16:46, Jiri Belka  wrote:
> >
> > - Original Message -
> >> From: "Alan Griffiths" 
> >> To: "Ovirt Users" 
> >> Sent: Friday, February 10, 2017 4:25:28 PM
> >> Subject: [ovirt-users] Guest Agent Running unconfined on Centos 7
> >>
> >> Hi,
> >>
> >> I'm running ovirt-guest-agent from Centos 7 EPEL and I notice that it's
> >> running unconfined rather than within its own domain.
> >>
> >> I see there is a rhev_agentd_exec_t
>
> That sound suspicious on its own. Are you sure you haven't mixed rhev
> and ovirt agents in the same guest at some point? Restoring selinux
> context doesn't help?
>
>
Here the same:
[root@c72he20170222h1 ~]# yum list installed | grep rhev
fence-agents-rhevm.x86_64 4.0.11-47.el7_3.2
 @updates
[root@c72he20170222h1 ~]# yum list installed | grep ovirt-guest-agent
ovirt-guest-agent-common.noarch   1.0.12-4.el7
@epel
[root@c72he20170222h1 ~]# ps auxZ  | grep guest-agent
system_u:system_r:unconfined_service_t:s0 ovirtag+ 732 0.2  0.6 441796
36036 ? Ssl  16:59   0:46 /usr/bin/python
/usr/share/ovirt-guest-agent/ovirt-guest-agent.py
unconfined_u:unconfined_r:unconfined_t:s0-s0:c0.c1023 root 6938 0.0  0.0
112648 964 pts/0 S+ 22:31   0:00 grep --color=auto guest-agent
[root@c72he20170222h1 ~]# semanage fcontext -l | grep rhev_agentd
/var/log/rhev-agent(/.*)?  all files
 system_u:object_r:rhev_agentd_log_t:s0
/var/log/ovirt-guest-agent(/.*)?   all files
 system_u:object_r:rhev_agentd_log_t:s0
/usr/lib/systemd/system/ovirt-guest-agent.*regular file
system_u:object_r:rhev_agentd_unit_file_t:s0
/var/run/rhev-agentd\.pid  regular file
system_u:object_r:rhev_agentd_var_run_t:s0
/usr/share/ovirt-guest-agent   regular file
system_u:object_r:rhev_agentd_exec_t:s0
/var/run/ovirt-guest-agent\.pidregular file
system_u:object_r:rhev_agentd_var_run_t:s0
/usr/share/rhev-agent/rhev-agentd\.py  regular file
system_u:object_r:rhev_agentd_exec_t:s0
/usr/share/rhev-agent/LockActiveSession\.pyregular file
system_u:object_r:rhev_agentd_exec_t:s0
/usr/share/ovirt-guest-agent/LockActiveSession\.py regular file
system_u:object_r:rhev_agentd_exec_t:s0




> >> type, which I attempted to assign to
> >> ovirt-guest-agent.py but it still starts up as unconfined. Is there a
> >> supported process for getting ovirt-guest into its own domain? Or a
> reason
> >> why it's not possible?
> >>
> >> Thanks,
> >>
> >> Alan
> >
> > Hm, it seems many ovirt services run unconfined. For ovirt GA, it seems
> > there's missing glue between systemd -> python -> GA script.
> >
> > Vinzenz, any idea?
> >
> > j.
> > ___
> > Users mailing list
> > Users@ovirt.org
> > http://lists.ovirt.org/mailman/listinfo/users
> >
> >
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Ovirt engine disk size

2017-02-22 Thread rightkicktech.gmail.com
Hi,

it is ok to define it during deploy. I don't recall providing a value, but I do 
see it in the answer file. 
I guess I can edit answer file and deploy it that way.

Thanx

On February 22, 2017 9:38:33 AM EET, Sandro Bonazzola  
wrote:
>On Wed, Feb 22, 2017 at 12:52 AM, rightkicktech.gmail.com <
>rightkickt...@gmail.com> wrote:
>
>> Hi all,
>>
>> I have setup self hosted engine using ovirt v4.1 on two hosts for
>testing.
>> The default engine image downloaded has a disk of 10G. How I can
>increase
>> the disk size of the engine?
>>
>
>Hi,
>you could have set a different size for Hosted Engine disk during its
>deployment. Once the Hosted Engine is deployed there's no way to extend
>its
>disk as far as I know.
>Issue was raised in bug
>https://bugzilla.redhat.com/show_bug.cgi?id=1347731
>which was closed wontfix asking to open a RFE if needed.
>Looks like you need it, can you please open a RFE?
>
>
>
>>
>> Thanx,
>> Alex
>>
>>
>> --
>> Sent from my Android device with K-9 Mail. Please excuse my brevity.
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>>
>>
>
>
>-- 
>Sandro Bonazzola
>Better technology. Faster innovation. Powered by community
>collaboration.
>See how it works at redhat.com

-- 
Sent from my Android device with K-9 Mail. Please excuse my brevity.___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Attaching ISO to hosted engine for OS upgrade

2017-02-22 Thread Simone Tiraboschi
On Wed, Feb 22, 2017 at 8:04 PM, Chris Adams  wrote:

> Once upon a time, Simone Tiraboschi  said:
> > Then ee have a specific helper utility for 3.6/el6 -> 4.0/el7:
> > https://www.ovirt.org/develop/release-management/features/
> hosted-engine-migration-to-4-0/
>
> Ahh, that looks better.  I was looking at this:
>
> https://www.ovirt.org/documentation/migration-engine-36-to-40/
>
> which just kind of glosses over how to upgrade the OS. :)
>
> I do usually use my custom CentOS install (rather than the appliance);
> is there a way to do that?
>

No, not with that flow and not in 4.1 at all.
ovirt-engine-appliance is basically just a centos VM plus ovirt-engine.
Once deployed you can tune it as you need.


>
> Also, is it normally recommended to upgrade one major release at a time?
>

For the engine it's not just recommended, it's mandatory!


> In other words, aside from the engine CentOS6->7 step, would upgrading
> from 3.5 to 4.1 need to go through 3.6 and 4.0 along the way?
> --
> Chris Adams 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Guest Agent Running unconfined on Centos 7

2017-02-22 Thread Michal Skrivanek
> On 22 Feb 2017, at 16:46, Jiri Belka  wrote:
>
> - Original Message -
>> From: "Alan Griffiths" 
>> To: "Ovirt Users" 
>> Sent: Friday, February 10, 2017 4:25:28 PM
>> Subject: [ovirt-users] Guest Agent Running unconfined on Centos 7
>>
>> Hi,
>>
>> I'm running ovirt-guest-agent from Centos 7 EPEL and I notice that it's
>> running unconfined rather than within its own domain.
>>
>> I see there is a rhev_agentd_exec_t

That sound suspicious on its own. Are you sure you haven't mixed rhev
and ovirt agents in the same guest at some point? Restoring selinux
context doesn't help?

>> type, which I attempted to assign to
>> ovirt-guest-agent.py but it still starts up as unconfined. Is there a
>> supported process for getting ovirt-guest into its own domain? Or a reason
>> why it's not possible?
>>
>> Thanks,
>>
>> Alan
>
> Hm, it seems many ovirt services run unconfined. For ovirt GA, it seems
> there's missing glue between systemd -> python -> GA script.
>
> Vinzenz, any idea?
>
> j.
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Attaching ISO to hosted engine for OS upgrade

2017-02-22 Thread Chris Adams
Once upon a time, Simone Tiraboschi  said:
> Then ee have a specific helper utility for 3.6/el6 -> 4.0/el7:
> https://www.ovirt.org/develop/release-management/features/hosted-engine-migration-to-4-0/

Ahh, that looks better.  I was looking at this:

https://www.ovirt.org/documentation/migration-engine-36-to-40/

which just kind of glosses over how to upgrade the OS. :)

I do usually use my custom CentOS install (rather than the appliance);
is there a way to do that?

Also, is it normally recommended to upgrade one major release at a time?
In other words, aside from the engine CentOS6->7 step, would upgrading
from 3.5 to 4.1 need to go through 3.6 and 4.0 along the way?
-- 
Chris Adams 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Attaching ISO to hosted engine for OS upgrade

2017-02-22 Thread Simone Tiraboschi
On Wed, Feb 22, 2017 at 5:33 PM, Chris Adams  wrote:

> I'm working on upgrading an oVirt 3.5 setup.  The physical hosts are
> running CentOS 7, but the hosted engine is CentOS 6.  The upgrade notes
> are "back up the engine, upgrade/reinstall the OS, then restore", but I
> can't see how to actually install CentOS 7 on the engine.
>
> Am I supposed to re-run "hosted-engine --deploy"?  Wouldn't that try to
> re-register the physical hosts, or can I interrupt it to restore the
> backup?
>
> Or, is there a way to just attach an install ISO to the engine VM and
> boot from that?
>

On 3.6 we release the engine also for el6 so for 3.5/el6 to 3.6/el6 you
have just to:
- set global maintenance mode,
- add the 3.6 repo to the engine VM,
- run yum update ovirt-*-setup* on the engine VM
- run engine-setup
- upgrade the host to 3.6 setting maintenance mode one host at a time
- raise the cluster compatibility level

Then ee have a specific helper utility for 3.6/el6 -> 4.0/el7:
https://www.ovirt.org/develop/release-management/features/hosted-engine-migration-to-4-0/



> --
> Chris Adams 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Attaching ISO to hosted engine for OS upgrade

2017-02-22 Thread Chris Adams
I'm working on upgrading an oVirt 3.5 setup.  The physical hosts are
running CentOS 7, but the hosted engine is CentOS 6.  The upgrade notes
are "back up the engine, upgrade/reinstall the OS, then restore", but I
can't see how to actually install CentOS 7 on the engine.

Am I supposed to re-run "hosted-engine --deploy"?  Wouldn't that try to
re-register the physical hosts, or can I interrupt it to restore the
backup?

Or, is there a way to just attach an install ISO to the engine VM and
boot from that?
-- 
Chris Adams 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Removing all the Users Permission on Data Center

2017-02-22 Thread Manuel Luis Aznar
Hello there,

Thanks for yours answer.

I will try your note.

We have an oVirt test installation (very litle users and few permissions,
so you can count permissions) and if I try to get the permissions with the
users entry point, I got a wrong number of them.

user_service = users_service.user_service(id=user.id)

user_permissions_service = user_service.permissions_service()

list = user_permissions_service.list()

The list "list" have wrong number of permissions...Do not know if this is a
bug or what, I will double check later on. In case I am wrong I will be
writing to you again.

Thanks again
Manuel



2017-02-22 12:01 GMT+00:00 Ondra Machacek :

> On Wed, Feb 22, 2017 at 10:16 AM, Manuel Luis Aznar
>  wrote:
> > Hello there,
> >
> > I need to remove the specific Users Permission a user have in a
> DataCenter.
> >
> > I manage to do it in this way:
> >
> > username = "..."
> > dc = "..."
> >
> > system_service = connection.system_service()
> > users_service = system_service.users_service()
> > data_centers_service = system_service.data_centers_service()
> >
> > # Getting the User object
> > for user in users_service.list():
> > if username in user.user_name:
> > break
> >
> > # Getting the DataCenter object
> > for data_center in data_centers_service.list():
> > if dc in data_center.name:
> > break
>
> Just a note that you can use:
>
>data_center = data_centers_service.list(search='name=dc')[0]
>
>
> >
> > # Getting DataCenter service and its Permission Service
> > data_center_service =
> > data_centers_service.data_center_service(id=data_center.id)
> > data_center_permissions_service = data_center_service.
> permissions_service()
> >
> > # Getting the Permission for the User in the DataCenter
> > for data_center_permission in data_center_permissions_service.list():
> > data_center_permission_user = data_center_permission.user
> > if data_center_permission_user:
> > if data_center_permission_user.id == user.id:
> > data_center_permission_user.remove()
> >
> > As you can see I can do it, but, I am using the permissions of the
> > DataCenter and this list could be very long.
> >
> > In the old ovirtsdk (version 3) this was done by the following:
> >
> > u = kvm.users.get(id=userid)
> > for perm in u.permissions.list():
> >udc = perm.get_data_center()
> >   if udc:
> >  globaldc = kvm.datacenters.get(id=udc.get_id())
> >  if globaldc.get_name() == dc:
> > perm.delete()
> >
> > that last piece of code iterate by the user permission list and delete
> the
> > specific Data Center permission. I have been trying this doing the
> > following:
> >
> > user_service = users_service.user_service(id=user.id)
> >
> > user_permissions_service = user_service.permissions_service()
> >
> > list = user_permissions_service.list()
> >
> > Is that last variable, list: the permissions list for the specified
> user, I
> > ask that because if I print the size of the list for an specific user,
> the
> > number I get is not correct...
>
> It should work, what do you expect to see and what do you actually see?
>
> >
> > Thanks for all in advance to all
> > Manuel
> >
> > ___
> > Users mailing list
> > Users@ovirt.org
> > http://lists.ovirt.org/mailman/listinfo/users
> >
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Guest Agent Running unconfined on Centos 7

2017-02-22 Thread Jiri Belka
- Original Message -
> From: "Alan Griffiths" 
> To: "Ovirt Users" 
> Sent: Friday, February 10, 2017 4:25:28 PM
> Subject: [ovirt-users] Guest Agent Running unconfined on Centos 7
> 
> Hi,
> 
> I'm running ovirt-guest-agent from Centos 7 EPEL and I notice that it's
> running unconfined rather than within its own domain.
> 
> I see there is a rhev_agentd_exec_t type, which I attempted to assign to
> ovirt-guest-agent.py but it still starts up as unconfined. Is there a
> supported process for getting ovirt-guest into its own domain? Or a reason
> why it's not possible?
> 
> Thanks,
> 
> Alan

Hm, it seems many ovirt services run unconfined. For ovirt GA, it seems
there's missing glue between systemd -> python -> GA script.

Vinzenz, any idea?

j.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] vm display effect

2017-02-22 Thread Jiri Belka
Do you have SPICE console and SPICE agents installed?

- Original Message -
From: "qinglong dong" 
To: "users" 
Sent: Thursday, February 16, 2017 4:18:22 AM
Subject: [ovirt-users] vm display effect

Hi, all 
I have used windows vm created by ovirt for a long time. I found that the vm 
display effect was not very good in some cases. For example, The codes came out 
word by word when using some c ode browsing tools. And sometimes online video 
playing was not very fluent. 
Anyone coiuld give some advices to improve this? Thanks! 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] moVirt 1.7 RC 1

2017-02-22 Thread Tomas Jelinek
Hey All,

the first public RC of moVirt 1.7 has been released! [3]

It can be downloaded only using direct link [1] - the play store will be
upgraded once it is considered stable.

This release has many enhancements, most notably:
- enhanced dashboard + a documentation written explaining how the values
are actually calculated [2]
- support for signed certificates for communication with oVirt API (e.g.
not self signed ones)
- support for SPICE + VNC and made SPICE connection more stable (thanx
Iordan!).
  To enjoy the fixes, please update aSPICE to 4.2
+ many smaller enhancements/fixes

Big thanx to everyone who have contributed to this RC!

Would you also like to help/contribute?
Sure, for example you can:
- download RC [1], test it and report bugs (here to thread or to [4])
- patches are always welcome :)

have a nice day,
Tomas

[1]: https://github.com/matobet/moVirt/blob/master/moVirt/
moVirt-release.apk?raw=true
[2]: https://github.com/matobet/moVirt/wiki/User%27s-guide#dashboard
[3]: https://github.com/matobet/moVirt
[4]: https://github.com/matobet/moVirt/issues
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Removing all the Users Permission on Data Center

2017-02-22 Thread Ondra Machacek
On Wed, Feb 22, 2017 at 10:16 AM, Manuel Luis Aznar
 wrote:
> Hello there,
>
> I need to remove the specific Users Permission a user have in a DataCenter.
>
> I manage to do it in this way:
>
> username = "..."
> dc = "..."
>
> system_service = connection.system_service()
> users_service = system_service.users_service()
> data_centers_service = system_service.data_centers_service()
>
> # Getting the User object
> for user in users_service.list():
> if username in user.user_name:
> break
>
> # Getting the DataCenter object
> for data_center in data_centers_service.list():
> if dc in data_center.name:
> break

Just a note that you can use:

   data_center = data_centers_service.list(search='name=dc')[0]


>
> # Getting DataCenter service and its Permission Service
> data_center_service =
> data_centers_service.data_center_service(id=data_center.id)
> data_center_permissions_service = data_center_service.permissions_service()
>
> # Getting the Permission for the User in the DataCenter
> for data_center_permission in data_center_permissions_service.list():
> data_center_permission_user = data_center_permission.user
> if data_center_permission_user:
> if data_center_permission_user.id == user.id:
> data_center_permission_user.remove()
>
> As you can see I can do it, but, I am using the permissions of the
> DataCenter and this list could be very long.
>
> In the old ovirtsdk (version 3) this was done by the following:
>
> u = kvm.users.get(id=userid)
> for perm in u.permissions.list():
>udc = perm.get_data_center()
>   if udc:
>  globaldc = kvm.datacenters.get(id=udc.get_id())
>  if globaldc.get_name() == dc:
> perm.delete()
>
> that last piece of code iterate by the user permission list and delete the
> specific Data Center permission. I have been trying this doing the
> following:
>
> user_service = users_service.user_service(id=user.id)
>
> user_permissions_service = user_service.permissions_service()
>
> list = user_permissions_service.list()
>
> Is that last variable, list: the permissions list for the specified user, I
> ask that because if I print the size of the list for an specific user, the
> number I get is not correct...

It should work, what do you expect to see and what do you actually see?

>
> Thanks for all in advance to all
> Manuel
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Fwd: Need documentation

2017-02-22 Thread Yaniv Kaul
On Wed, Feb 22, 2017 at 11:50 AM Александр Пивушков  wrote:

>
>
>
>  Пересылаемое сообщение 
> От кого: Александр Пивушков 
> Кому: users@ovirt.org
> Дата: Вторник, 21 февраля 2017, 12:29 +03:00
> Тема: Need documentation
>
>
> Hello, tell me where to find the official documentation for installing
> and configuring availability and fault tolerance oVirt?
>

The Engine itself -
http://www.ovirt.org/documentation/self-hosted/Self-Hosted_Engine_Guide/


>
> Especially interested in the possibility of physical disconnection from
> the host power supply. fencing.
>

Configuring hosts -
http://www.ovirt.org/documentation/admin-guide/chap-Hosts/
Virtual machine high availability -
http://www.ovirt.org/documentation/vmm-guide/chap-Administrative_Tasks/


>
> We plan to buy equipment for the switch-off - PDU.
> Where can I find a list of equipment (PDU), with which oVirt works?
>

Same link as above, for configuring hosts, has this list:
apc - APC MasterSwitch network power switch. Not for use with APC 5.x power
switch devices.
apc_snmp - Use with APC 5.x power switch devices.
bladecenter - IBM Bladecenter Remote Supervisor Adapter.
cisco_ucs - Cisco Unified Computing System.
drac5 - Dell Remote Access Controller for Dell computers.
drac7 - Dell Remote Access Controller for Dell computers.
eps - ePowerSwitch 8M+ network power switch.
hpblade - HP BladeSystem.
ilo, ilo2, ilo3, ilo4 - HP Integrated Lights-Out.
ipmilan - Intelligent Platform Management Interface and Sun Integrated
Lights Out Management devices.
rsa - IBM Remote Supervisor Adapter.
rsb - Fujitsu-Siemens RSB management interface.
wti - WTI Network Power Switch.

Others can be added as well - generally via the fence-agents package.
Y.


>
>
> Александр Пивушков
>
> --
>
> Александр Пивушков
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Mirgration issues

2017-02-22 Thread Yanir Quinn
I can spot in engine.log that ovirt-node03.mgmt.lan.company.lan (if it is
your host 3)  is being filtered out when trying to migrate the VM :

2017-02-21 04:32:33,618-06 INFO
[org.ovirt.engine.core.bll.MigrateVmCommand]
(org.ovirt.thread.pool-7-thread-44) [8ff8601b-238b-4565-b3bf-de6211cb4685]
Running command: MigrateVmCommand internal: false. Entitie
s affected :  ID: e051b38c-fd63-40f0-8d64-26c12ff7b880 Type: VMAction group
MIGRATE_VM with role type USER
2017-02-21 04:32:33,627-06 INFO
[org.ovirt.engine.core.bll.scheduling.SchedulingManager]
(org.ovirt.thread.pool-7-thread-44) [8ff8601b-238b-4565-b3bf-de6211cb4685]
Candidate host 'ovirt-node03.mgmt.lan.company.
lan' ('9b0feba5-d9a0-491e-b2c2-0742d30af304') was filtered out by
'VAR__FILTERTYPE__INTERNAL' filter 'HA' (correlation id:
8ff8601b-238b-4565-b3bf-de6211cb4685)

I suggest you first check if the host is functioning correctly.

Regards,
Yanir Quinn


On Tue, Feb 21, 2017 at 3:04 PM, Sven Achtelik 
wrote:

> Hi,
>
>
>
> there is a VM running, but not that one.
>
>
>
> [root@ovirt-node02 log]#  vdsClient -s localhost list table
>
> 2e0e0da8-eaa5-44ee-8f11-f1297d149be3  14551  NAME
> Up   10.6.0.181
>
>
>
> I even tried that after restarting host 2 and at this point I’m sure there
> were no VMs running.
>
>
>
>
>
> *Von:* Fred Rolland [mailto:froll...@redhat.com]
> *Gesendet:* Dienstag, 21. Februar 2017 13:59
> *An:* Sven Achtelik 
> *Cc:* users 
> *Betreff:* Re: [ovirt-users] Mirgration issues
>
>
>
> I see the following in the source VDSM log :
>
> 2017-02-21 05:53:28,067 INFO  (migsrc/8733d4a6) [virt.vm]
> (vmId='8733d4a6-0844-4955-804f-6b919e93e076') starting migration to
> qemu+tls://ovirt-node02.mgmt.lan.company.lan/system with miguri tcp://
> 172.16.4.19 (migration:453)
> 2017-02-21 05:53:28,262 ERROR (migsrc/8733d4a6) [virt.vm]
> (vmId='8733d4a6-0844-4955-804f-6b919e93e076') operation failed: domain
> 'DATA_p' is already defined with uuid 8733d4a6-0844-4955-804f-6b919e93e076
> (migration:265)
>
> libvirtError: operation failed: domain 'DATA_p' is already defined with
> uuid 8733d4a6-0844-4955-804f-6b919e93e076
>
> Can you check on host 2 if you have any VM already running there ?
>
> You can use :virsh list
>
>
>
> On Tue, Feb 21, 2017 at 2:15 PM, Sven Achtelik 
> wrote:
>
> Hi All,
>
>
>
> I’m having issues with migrating a VM. I have a 3 Host cluster and the VM
> is able to migrate between host 1 and 3, but not to host 2.  I don’t know
> why and I tried figuring this out with the log files and had no luck. All
> other VMs migrate to the host 2 without any issues.
>
>
>
> If you have some advice for me that would help a lot.
>
>
>
>
>
> Thank you,
>
>
>
> Sven
>
>
>
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] moving disk failed.. remained locked

2017-02-22 Thread Gianluca Cecchi
On Wed, Feb 22, 2017 at 10:59 AM, Nir Soffer  wrote:

>
>
> Lesson, use only storage without problems ;-)
>

hopefully... ;-)


> >> Can you share the output of:
> >>
> >> sanlock client renewal -s 900b1853-e192-4661-a0f9-7c7c396f6f49
> >
> >
> > No, the storage domain has been removed
>
> Next time when you have storage issues, please remember to grab
> the output of this command.
>
> Nir
>



For example, on a currently active storage domain I get:

[g.cecchi@ovmsrv05 ~]$ sudo sanlock client renewal -s
922b5269-ab56-4c4d-838f-49d33427e2ab
timestamp=1207533 read_ms=2 write_ms=0 next_timeouts=0 next_errors=0
timestamp=1207554 read_ms=2 write_ms=0 next_timeouts=0 next_errors=0
...
timestamp=1211163 read_ms=2 write_ms=0 next_timeouts=0 next_errors=0
timestamp=1211183 read_ms=2 write_ms=0 next_timeouts=0 next_errors=0
timestamp=1211204 read_ms=2 write_ms=0 next_timeouts=0 next_errors=0

How do I translate this output above? What would be the difference in case
of problems?
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] moving disk failed.. remained locked

2017-02-22 Thread Nir Soffer
On Wed, Feb 22, 2017 at 11:45 AM, Gianluca Cecchi
 wrote:
> On Wed, Feb 22, 2017 at 9:56 AM, Nir Soffer  wrote:
>>
>>
>>
>> Gianluca, what is domain 900b1853-e192-4661-a0f9-7c7c396f6f49?
>>
>> is this the domain you are migrating to in the same time?
>
>
> That was the id of the storage domain created on the LUN with problems at
> storage array level.

This explains sanlock issues with this domain.

> It only contained one disk of a VM. I was able to previously move other 2
> disks I had on it to another storage domain
>
> The disk was a data disk of a VM; its system disk was on another storage
> domain without problems
>
> The order of my operations yesterday was:
> - try move disk to another storge domain-> failure in auto snapshot
> - try snapshot of VM selecting both disks --> failure

The first step in moving disk to another domain when the vm is online,
is creating a snapshot on old storage.

Then we start mirroring process of the active (empty) snapshot
to the destination storage domain.

Then we copy the rest of the chain (readonly) to the destination
storage domain.

Finally we switch the active layer to the snapshot on the destination
storage domain, and delete the old chain on the source domain.

If the source storage is broken you have to stop the vm to move the
disk. This is can also fail if we cannot read the disk from this storage.

Lesson, use only storage without problems ;-)

> - try snapshot of VM selecting only the system disk (the good one) --> ok
> and also snapshot deletion ok
> - try snapshot of VM selecting only the data disk --> failure
> - hot add disk (in a good storage domain) to the VM --> OK
> - try pvmove at VM OS level from problematic disk to new disk --> failure:
> VM paused at 47% of pvmove and not able to continue
> - power off VM --> OK
> - remove disk from VM and delete --> OK
>
> Only at this point, with storage domain empty, I started to work on storage
> domain itself, putting it to maintenance and removing it without problems;
> and then the related LUN removal at host level with the notes described in
> other thread
>
>>
>>
>> Can you share the output of:
>>
>> sanlock client renewal -s 900b1853-e192-4661-a0f9-7c7c396f6f49
>
>
> No, the storage domain has been removed

Next time when you have storage issues, please remember to grab
the output of this command.

Nir
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Fwd: Need documentation

2017-02-22 Thread Александр Пивушков



 Пересылаемое сообщение 
От кого: Александр Пивушков 
Кому: users@ovirt.org
Дата: Вторник, 21 февраля 2017, 12:29 +03:00
Тема: Need documentation

Hello, tell me where to find the official documentation for installing and 
configuring availability and fault tolerance oVirt?
Especially interested in the possibility of physical disconnection from the 
host power supply . fencing.
We plan to buy equipment for the switch-off - PDU.
Where can I find a list of equipment (PDU),   with which oVirt works?


Александр Пивушков
--

Александр Пивушков
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] moving disk failed.. remained locked

2017-02-22 Thread Gianluca Cecchi
On Wed, Feb 22, 2017 at 9:56 AM, Nir Soffer  wrote:

>
>
> Gianluca, what is domain 900b1853-e192-4661-a0f9-7c7c396f6f49?
>
> is this the domain you are migrating to in the same time?
>

That was the id of the storage domain created on the LUN with problems at
storage array level.
It only contained one disk of a VM. I was able to previously move other 2
disks I had on it to another storage domain

The disk was a data disk of a VM; its system disk was on another storage
domain without problems

The order of my operations yesterday was:
- try move disk to another storge domain-> failure in auto snapshot
- try snapshot of VM selecting both disks --> failure
- try snapshot of VM selecting only the system disk (the good one) --> ok
and also snapshot deletion ok
- try snapshot of VM selecting only the data disk --> failure
- hot add disk (in a good storage domain) to the VM --> OK
- try pvmove at VM OS level from problematic disk to new disk --> failure:
VM paused at 47% of pvmove and not able to continue
- power off VM --> OK
- remove disk from VM and delete --> OK

Only at this point, with storage domain empty, I started to work on storage
domain itself, putting it to maintenance and removing it without problems;
and then the related LUN removal at host level with the notes described in
other thread.


>
> Can you share the output of:
>
> sanlock client renewal -s 900b1853-e192-4661-a0f9-7c7c396f6f49
>

No, the storage domain has been removed

Gianluca
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Removing all the Users Permission on Data Center

2017-02-22 Thread Manuel Luis Aznar
Hello,

Sorry but there is a errata in this piece:

u = kvm.users.get(id=userid)
for perm in u.permissions.list():
   udc = perm.get_data_center()
  if udc:
 globaldc = kvm.datacenters.get(id=udc.get_id())
 if globaldc.get_name() == dc:
perm.delete()

The perm.delete() call is not correct and you need to put the following two
lines:

   permission_service = permissions_service.permission_service(id=
data_center_permission.id)
   permission_service.remove()

Of course you need:

 permissions_service = system_service.permissions_service()

Thanks and sorry for the errata
Manuel
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Questions about network implementation

2017-02-22 Thread Edward Haas
Hello Ben,

It will help if you provide a diagram of what you would like to achieve,
it may help understand your need better.

With oVirt the VM networks are implemented on the hosts as bridges
with one port bind to a nic/bond/vlan for external access.
The connectivity itself on the switch is a given.

Recently work is under way to use external networks using OVN.
OVN uses OVS to create bridges on the hosts and connects them
using tunnels for interconnectivity.
See https://www.ovirt.org/blog/2016/11/ovirt-provider-ovn for details.

Thanks,
Edy.


On Tue, Feb 21, 2017 at 7:31 PM, Ben Kincaid  wrote:

> Hi List,
>
> I am currently in the process of evaluating several KVM management
> packages in order to replace some old Vsphere.
>
> I have been running up test labs with the following:
>
> * oVirt
> * OpenNebula
> * Cloudstack
> * Proxmox
> * Ganeti
>
> While I am more than happy to script up a few CLI tools and interact
> with Libvirt directly, that isn’t going to work for most end users of
> this infrastructure.
>
> I am running in to a bit of a challenge around the networking aspect
> of what I am trying to create here, since most of these management
> tools assume you are building an all-in-one box deployment, or you
> manage your own network infrastructure in your datacenter.
>
> We have 6 soon to be 8 boxes in a remote DC, and we don’t have any
> flexibility around the networking.
>
> Each box has one NIC, with a public IP, and we have a couple of /27
> address ranges to use.  We need to specify which port on the switch
> that /27 will be going to.
>
> So what I would like to do is build a private address network across
> all 8 boxes, either using something like PeerVPN / Tinc or the new VPN
> features built in to Vswitch, I would then put that on vmbr1 for
> example.
>
> Vmbr0 would be the public internet port on each box, and on one box I
> would run up an instance of PFSense or similar in a VM which I would
> route the /27’s to and then forward them to IPs on the internal IP
> pool.
>
> Does oVirt have the ability to manage a network structure such as
> this, and if so how might I go about it? I've had a read through the
> docs but couldn't work out how I'd build this type of structure in
> oVirt.
>
> Thanks for such a great peace of software, and I appreciate any
> suggestions or advice anyone can offer on this issue.
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Removing all the Users Permission on Data Center

2017-02-22 Thread Manuel Luis Aznar
Hello there,

I need to remove the specific Users Permission a user have in a DataCenter.

I manage to do it in this way:

username = "..."
dc = "..."

system_service = connection.system_service()
users_service = system_service.users_service()
data_centers_service = system_service.data_centers_service()

# Getting the User object
for user in users_service.list():
if username in user.user_name:
break

# Getting the DataCenter object
for data_center in data_centers_service.list():
if dc in data_center.name:
break

# Getting DataCenter service and its Permission Service
data_center_service = data_centers_service.data_center_service(id=
data_center.id)
data_center_permissions_service = data_center_service.permissions_service()

# Getting the Permission for the User in the DataCenter
for data_center_permission in data_center_permissions_service.list():
data_center_permission_user = data_center_permission.user
if data_center_permission_user:
if data_center_permission_user.id == user.id:
data_center_permission_user.remove()

As you can see I can do it, but, I am using the permissions of the
DataCenter and this list could be very long.

In the old ovirtsdk (version 3) this was done by the following:

u = kvm.users.get(id=userid)
for perm in u.permissions.list():
   udc = perm.get_data_center()
  if udc:
 globaldc = kvm.datacenters.get(id=udc.get_id())
 if globaldc.get_name() == dc:
perm.delete()

that last piece of code iterate by the user permission list and delete the
specific Data Center permission. I have been trying this doing the
following:

user_service = users_service.user_service(id=user.id)

user_permissions_service = user_service.permissions_service()

list = user_permissions_service.list()

Is that last variable, list: the permissions list for the specified user, I
ask that because if I print the size of the list for an specific user, the
number I get is not correct...

Thanks for all in advance to all
Manuel
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] oVirt 4.1 - the VDSM host was found in a failed state - iptables related ?

2017-02-22 Thread Joseph Kelly
Hello,

We've just deployed the ovirt-4.1 hosted_engine and I have noticed 2 things 
that it would be great if people can comment on:

1) It seems that iptables is a hard requirement:

> iptables was detected on your computer, do you wish setup to configure it? 
> (Yes, No)[Yes]: no

[ INFO  ] Connecting to the Engine
[ INFO  ] Waiting for the host to become operational in the engine. This may 
take several minutes...
[ ERROR ] The VDSM host was found in a failed state. Please check engine and 
bootstrap installation logs.
[ ERROR ] Unable to add example-host.tradingscreen.com to the manager
[ INFO  ] Saving hosted-engine configuration on the shared storage domain
[ INFO  ] Shutting down the engine VM

Here is a bug report from 3.5.0 with this exact error:

Bug 1167074 - 
[hosted-engine] Deployment fails with "The VDSM host was found in a failed 
state. Please check engine and bootstrap installation logs."

Previously, iptables-services was not a required package during hosted-engine 
deployment, so hosted-engine was unable to configure iptables. Now, 
iptables-services is
required, so hosted-engine can configure iptables correctly.

2) We have to add new storage to the default domain before the hosted_engine 
storage domain is imported. So is this happening because of the iptables issue 
in # 1 above ?
Or is adding this new storage to the default domain also a hard requirement ? 
Ideally we'd like the hosted_storage to be the Data Master and not have to 
create stub storage for this.

Here is a bug report from 3.6.1 with this exact error:

Bug 1293928  Cause: 
importing the nfs domain could interleave with the engine restart as a result 
of installation.

Consequence: the hosted engine domain won't be imported or activated on the 
engine setup, and the engine vm wouldn't be imported and displayed in the 
webadmin or REST.

Fix: Exclusively lock the import attempt so interleaving of 2 imports is 
impossible and prevent starting the import if there is no DATA domain and an 
ACTIVE DC.
Note: this means that in order to see the engine VM a user must first import a 
DATA domain and activate the DC.

Result: the hosted engine domain is imported and the hosted engine VM.

Thanks,
Joe.

--


J. Kelly
Infrastructure Engineer
TradingScreen
www.tradingscreen.com

Follow TradingScreen on Twitter, Facebook, or our blog, Trading Smarter

This message is intended only for the recipient(s) named above and may
contain confidential information. If you are not an intended recipient,
you should not review, distribute or copy this message. Please notify
the sender immediately by e-mail if you have received this message in
error and delete it from your system.



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] moving disk failed.. remained locked

2017-02-22 Thread Nir Soffer
On Wed, Feb 22, 2017 at 10:32 AM, Nir Soffer  wrote:
> On Wed, Feb 22, 2017 at 10:31 AM, Nir Soffer  wrote:
>> On Mon, Feb 20, 2017 at 4:49 PM, Gianluca Cecchi
>>  wrote:
>>> Hello,
>>> I'm trying to move a disk from one storage domain A to another B in oVirt
>>> 4.1
>>> The corresponding VM is powered on in the mean time
>>>
>>> When executing the action, there was already in place a disk move from
>>> storage domain C to A (this move was for a disk of a powered off VM and then
>>> completed ok)
>>> I got this in events of webadmin gui for the failed move A -> B:
>>>
>>> Feb 20, 2017 2:42:00 PM Failed to complete snapshot 'Auto-generated for Live
>>> Storage Migration' creation for VM 'dbatest6'.
>>> Feb 20, 2017 2:40:51 PM VDSM ovmsrv06 command HSMGetAllTasksStatusesVDS
>>> failed: Error creating a new volume
>>> Feb 20, 2017 2:40:51 PM Snapshot 'Auto-generated for Live Storage Migration'
>>> creation for VM 'dbatest6' was initiated by admin@internal-authz.
>>>
>>>
>>> And in relevant vdsm.log of referred host ovmsrv06
>>>
>>> 2017-02-20 14:41:44,899 ERROR (tasks/8) [storage.Volume] Unexpected error
>>> (volume:1087)
>>> Traceback (most recent call last):
>>>   File "/usr/share/vdsm/storage/volume.py", line 1081, in create
>>> cls.newVolumeLease(metaId, sdUUID, volUUID)
>>>   File "/usr/share/vdsm/storage/volume.py", line 1361, in newVolumeLease
>>> return cls.manifestClass.newVolumeLease(metaId, sdUUID, volUUID)
>>>   File "/usr/share/vdsm/storage/blockVolume.py", line 310, in newVolumeLease
>>> sanlock.init_resource(sdUUID, volUUID, [(leasePath, leaseOffset)])
>>> SanlockException: (-202, 'Sanlock resource init failure', 'Sanlock
>>> exception')
>>
>> This means that sanlock could not initialize a lease in the new volume 
>> created
>> for the snapshot.

David, looking in sanlock log - we don't see any error matching this failure,
but the domain 900b1853-e192-4661-a0f9-7c7c396f6f49 has renewal errors.

I guess because sanlock_init_resource is implemented in the library,
not going trough sanlock deamon?

2017-02-20 14:30:09+0100 1050804 [11738]: 900b1853 aio timeout RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 ioto 10 to_count 1
2017-02-20 14:30:09+0100 1050804 [11738]: s3 delta_renew read timeout
10 sec offset 0 /dev/900b1853-e192-4661-a0f9-7c7c396f6f49/ids
2017-02-20 14:30:09+0100 1050804 [11738]: s3 renewal error -202
delta_length 10 last_success 1050773
2017-02-20 14:30:11+0100 1050806 [11738]: 900b1853 aio collect RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 result 1048576:0 match
reap
2017-02-20 14:35:58+0100 1051153 [11738]: 900b1853 aio timeout RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 ioto 10 to_count 2
2017-02-20 14:35:58+0100 1051153 [11738]: s3 delta_renew read timeout
10 sec offset 0 /dev/900b1853-e192-4661-a0f9-7c7c396f6f49/ids
2017-02-20 14:35:58+0100 1051153 [11738]: s3 renewal error -202
delta_length 10 last_success 1051122
2017-02-20 14:36:01+0100 1051156 [11738]: 900b1853 aio collect RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 result 1048576:0 match
reap
2017-02-20 14:44:36+0100 1051671 [11738]: 900b1853 aio timeout RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 ioto 10 to_count 3
2017-02-20 14:44:36+0100 1051671 [11738]: s3 delta_renew read timeout
10 sec offset 0 /dev/900b1853-e192-4661-a0f9-7c7c396f6f49/ids
2017-02-20 14:44:36+0100 1051671 [11738]: s3 renewal error -202
delta_length 10 last_success 1051641
2017-02-20 14:44:37+0100 1051672 [11738]: 900b1853 aio collect RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 result 1048576:0 match
reap
2017-02-20 14:48:02+0100 1051877 [11738]: 900b1853 aio timeout RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 ioto 10 to_count 4
2017-02-20 14:48:02+0100 1051877 [11738]: s3 delta_renew read timeout
10 sec offset 0 /dev/900b1853-e192-4661-a0f9-7c7c396f6f49/ids
2017-02-20 14:48:02+0100 1051877 [11738]: s3 renewal error -202
delta_length 10 last_success 1051846
2017-02-20 14:48:02+0100 1051877 [11738]: 900b1853 aio collect RD
0x7f41d8c0:0x7f41d8d0:0x7f41e2afa000 result 1048576:0 match
reap

Gianluca, what is domain 900b1853-e192-4661-a0f9-7c7c396f6f49?

is this the domain you are migrating to in the same time?

Can you share the output of:

sanlock client renewal -s 900b1853-e192-4661-a0f9-7c7c396f6f49

>>> 2017-02-20 14:41:44,900 ERROR (tasks/8) [storage.TaskManager.Task]
>>> (Task='d694b892-b078-4d86-a035-427ee4fb3b13') Unexpected error (task:870)
>>> Traceback (most recent call last):
>>>   File "/usr/share/vdsm/storage/task.py", line 877, in _run
>>> return fn(*args, **kargs)
>>>   File "/usr/share/vdsm/storage/task.py", line 333, in run
>>> return self.cmd(*self.argslist, **self.argsdict)
>>>   File "/usr/lib/python2.7/site-packages/vdsm/storage/securable.py", line
>>> 79, in wrapper
>>> return method(self, *args, **kwargs)
>>>   File "/usr/share/vdsm/storage/sp.py", line 1929, in createVolume
>>> initialSize=initialSize)
>>>   File "/usr/share/vdsm/storage/sd.py", line 762, in creat

Re: [ovirt-users] moving disk failed.. remained locked

2017-02-22 Thread Gianluca Cecchi
On Wed, Feb 22, 2017 at 9:32 AM, Nir Soffer  wrote:

> On Wed, Feb 22, 2017 at 10:31 AM, Nir Soffer  wrote:
>
> >
> > This means that sanlock could not initialize a lease in the new volume
> created
> > for the snapshot.
> >
> > Can you attach sanlock.log?
>
> Found it in your next message
>
>
OK.
Just to recap what happened from a physical point of view:

- apparently I had an array of disks with no more spare disks and on this
array was the LUN composing the disk storage domain.
So I was in involved in moving disks of the impacted storage domain and
then removal of storage domain itself, so that we can remove the logical
array on storage
This is a test storage system without support so at the moment I had no
more spare disks on it

- actually there was another disk problem with the array, generating loss
of data because of no more spare available at that time

- No evidence of error at VM OS level and at storage domain level

- But probably the 2 operations:
1) move disk
2) create snapshot of the VM containing the disk
could not complete due to this low level problem

It should be nice to find an evidence to this. Storage domain didn't go
offline BTW

- I got confirmation of the loss of data this way:
The original disk of the VM, inside the VM, was a PV of a VG
I added a disk (on another storage domain) to the VM, made it a PV and
added to the original VG
Tried pvmove from source disk to new disk, but it reached about 47% and
then stopped/failed, pausing the VM.
I could start again the VM but as soon as the pvmove continued, the VM came
back to paused state.
So I powered off the VM and was able to detach/delete the corrupted disk
and then remove the storage domain (see other thread opened yesterday)

I then managed to recover the now corrupted VG and restore from backup the
data contained in original fs.

So the original problem was low level error of storage.
If can be of help to narrow down oVirt behavior in this case scenario I can
provide further logs from VM OS or from hosts/engine.
Let me know.

Some questions:
- how is it managed the reaction of putting VM in paused mode due to I/O
error as in this case? Can I in some way manage to keep VM on a ndlet it
generate errors as in real physical server or not?
- Why I didn't get any message at storage domain level but only at VM disk
level?

Thanks for the given help
Gianluca
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] best way to remove SAN lun

2017-02-22 Thread Yaniv Kaul
On Wed, Feb 22, 2017 at 9:27 AM Nir Soffer  wrote:

> On Wed, Feb 22, 2017 at 9:03 AM, Nelson Lameiras
>  wrote:
> > Hello,
> >
> > Not sure it is the same issue, but we have had a "major" issue recently
> in our production system when removing a ISCSI volume from oVirt, and then
> removing it from SAN.
>
> What version? OS version?
>
> The order must be:
>
> 1. remove the LUN from storage domain
> will be available in next 4.1 release. in older versions you have
> to remove the storage domain
>
> 2. unzone the LUN on the server
>
> 3. remove the multipath devices and the paths on the nodes
>
> > The issue being that each host was still trying to access regularly to
> the SAN volume in spite of not being completely removed from oVirt.
>
> What do you mean by "not being completely removed"?
>
> Who was accessing the volume?
>
> > This led to an massive increase of error logs, which filled completely
> /var/log partition,
>
> Which log was full with errors?
>
> > which snowballed into crashing vdsm and other nasty consequences.
>
> You should have big enough /var/log to avoid such issues.
>

- Log rotation should be set better not to consume excessive amounts of
space.
I'm seeing /etc/vdsm/logrotate/vdsm - not sure why it's not under
/etc/logrotate.d . Looking at the file, seems like there's a 15M limit and
100 files, which translates to 1.5GB - and it is supposed to be compressed
(not sure XZ is a good choice - it's very CPU intensive).

Others (Gluster?) do not seem to have a size limit, just weekly. Need to
look at other components as well.
- At least on ovirt-node, we'd like to separate some directories to
different partitions. So for example core dumps (which should be limited as
well) on /var/core do not fill the same partition as /var/log is and thus
render the host unusable.
And again, looking at file, we have a 'size 0' on /var/log/core/*.dump -
and 'rotate 1' - not sure what it means - but it should not be in
/var/log/core, but /var/core, I reckon.
Y.


> >
> > Anyway, the solution was to manually logout from SAN (in each host) with
> iscsiadm and manually remove iscsi targets (again in each host). It was not
> difficult once the problem was found because currently we only have 3 hosts
> in this cluster, but I'm wondering what would happen if we had hundreds of
> hosts ?
> >
> > Maybe I'm being naive but shouldn't this be "oVirt job" ? Is there a RFE
> still waiting to be included on this subject or should I write one ?
>
> We have RFE for this here:
> https://bugzilla.redhat.com/1310330
>
> But you must understand that ovirt does not control your storage server,
> you are responsible to add devices on the storage server, and remove
> them. We are only consuming the devices.
>
> Even we we provide a way to remove devices on all hosts, you will have
> to remove the device on the storage server before removing it from
> hosts. If not, ovirt will find the removed devices again in the next
> scsi rescan,
> and we do lot of these to support automatic discovery of new devices
> or resized devices.
>
> Nir
>
> >
> > cordialement, regards,
> >
> >
> > Nelson LAMEIRAS
> > Ingénieur Systèmes et Réseaux / Systems and Networks engineer
> > Tel: +33 5 32 09 09 70 <+33%205%2032%2009%2009%2070>
> > nelson.lamei...@lyra-network.com
> >
> > www.lyra-network.com | www.payzen.eu
> >
> >
> >
> >
> >
> > Lyra Network, 109 rue de l'innovation, 31670 Labège, FRANCE
> >
> > - Original Message -
> > From: "Nir Soffer" 
> > To: "Gianluca Cecchi" , "Adam Litke" <
> ali...@redhat.com>
> > Cc: "users" 
> > Sent: Tuesday, February 21, 2017 6:32:18 PM
> > Subject: Re: [ovirt-users] best way to remove SAN lun
> >
> > On Tue, Feb 21, 2017 at 7:25 PM, Gianluca Cecchi
> >  wrote:
> >> On Tue, Feb 21, 2017 at 6:10 PM, Nir Soffer  wrote:
> >>>
> >>> This is caused by active lvs on the remove storage domains that were
> not
> >>> deactivated during the removal. This is a very old known issue.
> >>>
> >>> You have remove the remove device mapper entries - you can see the
> devices
> >>> using:
> >>>
> >>> dmsetup status
> >>>
> >>> Then you can remove the mapping using:
> >>>
> >>> dmsetup remove device-name
> >>>
> >>> Once you removed the stale lvs, you will be able to remove the
> multipath
> >>> device and the underlying paths, and lvm will not complain about read
> >>> errors.
> >>>
> >>> Nir
> >>
> >>
> >> OK Nir, thanks for advising.
> >>
> >> So what I run with success on the 2 hosts
> >>
> >> [root@ovmsrv05 vdsm]# for dev in $(dmsetup status | grep
> >> 900b1853--e192--4661--a0f9--7c7c396f6f49 | cut -d ":" -f 1)
> >> do
> >>dmsetup remove $dev
> >> done
> >> [root@ovmsrv05 vdsm]#
> >>
> >> and now I can run
> >>
> >> [root@ovmsrv05 vdsm]# multipath -f 3600a0b8000299902cd3c5501458f
> >> [root@ovmsrv05 vdsm]#
> >>
> >> Also, with related names depending on host,
> >>
> >> previous maps to single devices were for example in ovmsrv05:
> >>
> >> 3600a0b8000299902cd3c5501458f dm-4 IBM ,1814  FA

Re: [ovirt-users] moving disk failed.. remained locked

2017-02-22 Thread Nir Soffer
On Wed, Feb 22, 2017 at 10:31 AM, Nir Soffer  wrote:
> On Mon, Feb 20, 2017 at 4:49 PM, Gianluca Cecchi
>  wrote:
>> Hello,
>> I'm trying to move a disk from one storage domain A to another B in oVirt
>> 4.1
>> The corresponding VM is powered on in the mean time
>>
>> When executing the action, there was already in place a disk move from
>> storage domain C to A (this move was for a disk of a powered off VM and then
>> completed ok)
>> I got this in events of webadmin gui for the failed move A -> B:
>>
>> Feb 20, 2017 2:42:00 PM Failed to complete snapshot 'Auto-generated for Live
>> Storage Migration' creation for VM 'dbatest6'.
>> Feb 20, 2017 2:40:51 PM VDSM ovmsrv06 command HSMGetAllTasksStatusesVDS
>> failed: Error creating a new volume
>> Feb 20, 2017 2:40:51 PM Snapshot 'Auto-generated for Live Storage Migration'
>> creation for VM 'dbatest6' was initiated by admin@internal-authz.
>>
>>
>> And in relevant vdsm.log of referred host ovmsrv06
>>
>> 2017-02-20 14:41:44,899 ERROR (tasks/8) [storage.Volume] Unexpected error
>> (volume:1087)
>> Traceback (most recent call last):
>>   File "/usr/share/vdsm/storage/volume.py", line 1081, in create
>> cls.newVolumeLease(metaId, sdUUID, volUUID)
>>   File "/usr/share/vdsm/storage/volume.py", line 1361, in newVolumeLease
>> return cls.manifestClass.newVolumeLease(metaId, sdUUID, volUUID)
>>   File "/usr/share/vdsm/storage/blockVolume.py", line 310, in newVolumeLease
>> sanlock.init_resource(sdUUID, volUUID, [(leasePath, leaseOffset)])
>> SanlockException: (-202, 'Sanlock resource init failure', 'Sanlock
>> exception')
>
> This means that sanlock could not initialize a lease in the new volume created
> for the snapshot.
>
> Can you attach sanlock.log?

Found it in your next message

>
>> 2017-02-20 14:41:44,900 ERROR (tasks/8) [storage.TaskManager.Task]
>> (Task='d694b892-b078-4d86-a035-427ee4fb3b13') Unexpected error (task:870)
>> Traceback (most recent call last):
>>   File "/usr/share/vdsm/storage/task.py", line 877, in _run
>> return fn(*args, **kargs)
>>   File "/usr/share/vdsm/storage/task.py", line 333, in run
>> return self.cmd(*self.argslist, **self.argsdict)
>>   File "/usr/lib/python2.7/site-packages/vdsm/storage/securable.py", line
>> 79, in wrapper
>> return method(self, *args, **kwargs)
>>   File "/usr/share/vdsm/storage/sp.py", line 1929, in createVolume
>> initialSize=initialSize)
>>   File "/usr/share/vdsm/storage/sd.py", line 762, in createVolume
>> initialSize=initialSize)
>>   File "/usr/share/vdsm/storage/volume.py", line 1089, in create
>> (volUUID, e))
>> VolumeCreationError: Error creating a new volume: (u"Volume creation
>> d0d938bd-1479-49cb-93fb-85b6a32d6cb4 failed: (-202, 'Sanlock resource init
>> failure', 'Sanlock exception')",)
>> 2017-02-20 14:41:44,941 INFO  (tasks/8) [storage.Volume] Metadata rollback
>> for sdUUID=900b1853-e192-4661-a0f9-7c7c396f6f49 offs=8 (blockVolume:448)
>>
>>
>> Was the error generated due to the other migration still in progress?
>> Is there a limit of concurrent migrations from/to a particular storage
>> domain?
>
> No, maybe your network was overloaded by the concurrent migrations?
>
>>
>> Now I would like to retry, but I see that the disk is in state locked with
>> hourglass.
>> The autogenerated snapshot of the failed action was apparently removed with
>> success as I don't see it.
>>
>> How can I proceed to move the disk?
>>
>> Thanks in advance,
>> Gianluca
>>
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] moving disk failed.. remained locked

2017-02-22 Thread Nir Soffer
On Mon, Feb 20, 2017 at 4:49 PM, Gianluca Cecchi
 wrote:
> Hello,
> I'm trying to move a disk from one storage domain A to another B in oVirt
> 4.1
> The corresponding VM is powered on in the mean time
>
> When executing the action, there was already in place a disk move from
> storage domain C to A (this move was for a disk of a powered off VM and then
> completed ok)
> I got this in events of webadmin gui for the failed move A -> B:
>
> Feb 20, 2017 2:42:00 PM Failed to complete snapshot 'Auto-generated for Live
> Storage Migration' creation for VM 'dbatest6'.
> Feb 20, 2017 2:40:51 PM VDSM ovmsrv06 command HSMGetAllTasksStatusesVDS
> failed: Error creating a new volume
> Feb 20, 2017 2:40:51 PM Snapshot 'Auto-generated for Live Storage Migration'
> creation for VM 'dbatest6' was initiated by admin@internal-authz.
>
>
> And in relevant vdsm.log of referred host ovmsrv06
>
> 2017-02-20 14:41:44,899 ERROR (tasks/8) [storage.Volume] Unexpected error
> (volume:1087)
> Traceback (most recent call last):
>   File "/usr/share/vdsm/storage/volume.py", line 1081, in create
> cls.newVolumeLease(metaId, sdUUID, volUUID)
>   File "/usr/share/vdsm/storage/volume.py", line 1361, in newVolumeLease
> return cls.manifestClass.newVolumeLease(metaId, sdUUID, volUUID)
>   File "/usr/share/vdsm/storage/blockVolume.py", line 310, in newVolumeLease
> sanlock.init_resource(sdUUID, volUUID, [(leasePath, leaseOffset)])
> SanlockException: (-202, 'Sanlock resource init failure', 'Sanlock
> exception')

This means that sanlock could not initialize a lease in the new volume created
for the snapshot.

Can you attach sanlock.log?

> 2017-02-20 14:41:44,900 ERROR (tasks/8) [storage.TaskManager.Task]
> (Task='d694b892-b078-4d86-a035-427ee4fb3b13') Unexpected error (task:870)
> Traceback (most recent call last):
>   File "/usr/share/vdsm/storage/task.py", line 877, in _run
> return fn(*args, **kargs)
>   File "/usr/share/vdsm/storage/task.py", line 333, in run
> return self.cmd(*self.argslist, **self.argsdict)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/securable.py", line
> 79, in wrapper
> return method(self, *args, **kwargs)
>   File "/usr/share/vdsm/storage/sp.py", line 1929, in createVolume
> initialSize=initialSize)
>   File "/usr/share/vdsm/storage/sd.py", line 762, in createVolume
> initialSize=initialSize)
>   File "/usr/share/vdsm/storage/volume.py", line 1089, in create
> (volUUID, e))
> VolumeCreationError: Error creating a new volume: (u"Volume creation
> d0d938bd-1479-49cb-93fb-85b6a32d6cb4 failed: (-202, 'Sanlock resource init
> failure', 'Sanlock exception')",)
> 2017-02-20 14:41:44,941 INFO  (tasks/8) [storage.Volume] Metadata rollback
> for sdUUID=900b1853-e192-4661-a0f9-7c7c396f6f49 offs=8 (blockVolume:448)
>
>
> Was the error generated due to the other migration still in progress?
> Is there a limit of concurrent migrations from/to a particular storage
> domain?

No, maybe your network was overloaded by the concurrent migrations?

>
> Now I would like to retry, but I see that the disk is in state locked with
> hourglass.
> The autogenerated snapshot of the failed action was apparently removed with
> success as I don't see it.
>
> How can I proceed to move the disk?
>
> Thanks in advance,
> Gianluca
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ovirt network setup hangs

2017-02-22 Thread Yedidyah Bar David
On Wed, Feb 22, 2017 at 2:41 AM, Bill James  wrote:
> We recently upgraded our QA ovirt installation from ovirt3.6 to 4.0.3.

Why not latest 4.0 or 4.1?

> All seems to work fine except when I try to setup network configuration.

There were several changes/fixes in setup network since 4.0.3.

>
> Even if I take an already configured host and change nothing when I select
> "Setup Host Networks" and press "ok" the window just hangs (spinning wheel)
> and task never finishes.
> I can Delete the job and restart ovirt-engine and all is clean, till I try
> to setup a network, on any host, again.
>
> I haven't found any log entries that seem to apply, but here's the start of
> the task:
>
> 2017-02-21 16:15:35,557 INFO
> [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> (default
> task-40) [77a44650] START, HostSetupNetworksVDSCommand(HostName =
> webdev3.dev.j2noc.com,
> HostSetupNetworksVdsCommandParameters:{runAsync='true',
> hostId='85f43ad0-bba8-46af-97db-062651d23a84',
> vds='Host[webdev3.dev.j2noc.com,85f43ad0-bba8-46af-97db-062651d23a84]',
> rollbackOnFailure='true', connectivityTimeout='120',
> networks='[HostNetwork:{defaultRoute='false', bonding='false',
> networkName='V102_dev', nicName='enp4s0f0', vlan='482', mtu='0',
>  vmNetwork='true', stp='false', properties='[]', ipv4BootProtocol='NONE',
> ipv4Address='null', ipv4Netmask='null
> ', ipv4Gateway='null', ipv6BootProtocol='NONE', ipv6Address='null',
> ipv6Prefix='null', ipv6Gateway='null', swit
> chType='LEGACY'},
>
>
> Where would I look for more info?

Usually on the host, in /var/log/vdsm .

>
> Another error that is probably related is when I open Setup Host Network and
> click on the edit pencil I get an error: (ui.log)
>
> 2017-02-21 16:33:09,486 ERROR
> [org.ovirt.engine.ui.frontend.server.gwt.OvirtRemoteLoggingService] (default
> task-1) [] Permutation name: EBD94055CA35ABB08F35A7613B98C193
> 2017-02-21 16:33:09,486 ERROR
> [org.ovirt.engine.ui.frontend.server.gwt.OvirtRemoteLoggingService] (default
> task-1) [] Uncaught exception:
> com.google.gwt.event.shared.UmbrellaException: Exception caught: (TypeError)
>  __gwt$exception: : Cannot read property 'c' of null
> at Unknown.Ev(webadmin-0.js@25078)
> at Unknown.Mv(webadmin-0.js@41)
> at Unknown.X7(webadmin-0.js@19)
> at Unknown.$7(webadmin-0.js@19)
> at Unknown.i7(webadmin-0.js@117)
> ...
> Caused by: com.google.gwt.core.client.JavaScriptException: (TypeError)
>  __gwt$exception: : Cannot read property 'c' of null

No idea, but I think you can find out more by de-obfuscating the log [1].
If/when you do, please make sure to install debuginfo packages of the exact
same version that you currently have.

But I'd personally not spend time debugging this before upgrading to latest
4.0 at least.

[1] 
https://www.ovirt.org/develop/developer-guide/engine/engine-debug-obfuscated-ui/

>
>
> Did I miss something in the upgrade?

No idea, did you run engine-setup and it finished successfully?

Best,

>
>
> ovirt-engine-4.0.3-1.el7.centos.noarch
> vdsm-4.18.11-1.el7.centos.x86_64
>
> Thanks.
>
> --
>
> 'ovirt-engine' and component 'Frontend.WebAdmin'
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users



-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Troubles after resizing the iscsi storage.

2017-02-22 Thread Fred Rolland
Here is the feature page :
http://www.ovirt.org/develop/release-management/features/storage/lun-resize/

On Tue, Feb 21, 2017 at 6:32 PM, Nir Soffer  wrote:

> On Tue, Feb 21, 2017 at 5:57 PM, Gianluca Cecchi
>  wrote:
> > On Tue, Feb 21, 2017 at 4:46 PM, Nir Soffer  wrote:
> >>
> >>
> >> The resize button should be in the "Additional size" column when a LUN
> >> can be resized.
> >>
> >> Vdsm reports the size of each LUN and the size of the PV. If the LUN
> >> and the PV can be resized you should see the button.
> >>
> >> Nir
> >>
> >
> > Perhaps he can select the storage domain in storage tab and "Scan Disks"
> to
> > induce an iscsi rescan on hosts and then in manage domain see this
> > additional column?
>
> I don't know what "scan disks", does, but it is not required.
>
> When you open the "new/manage domain" dialog we fetch the device
> list from vdsm (getDeviceList) and this perform all the steps needed to
> detect resized devices.
>
> Nir
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users