[ovirt-devel] Re: OST fails, cannot connect to repo

2019-07-18 Thread Sahina Bose
+Sac as it's the repo maintained by him, but I doubt if it is this repo
specific

On Thu, Jul 18, 2019 at 2:11 PM Vojtech Juranek  wrote:

> Hi,
> OST fails with
>
> 09:47:03
> https://copr-be.cloud.fedoraproject.org/results/sac/gluster-ansible/
> epel-7-x86_64/repodata/repomd.xml
> :
> [Errno 14] curl#7 - "Failed connect to
> copr-be.cloud.fedoraproject.org:443; Connection refused"
>
> see e.g. [1] for full log. Stared to fail this morning.
> Can anyone take a look and fix it?
>
> Thanks in advance.
> Vojta
>
> [1] https://jenkins.ovirt.org/job/ovirt-system-tests_manual/5132/console
> ___
> Devel mailing list -- devel@ovirt.org
> To unsubscribe send an email to devel-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/devel@ovirt.org/message/DXDNFFNWE3DC2IJTOH7CMXN7FD4PO4HU/
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/MOTEJVTFNYBDK32RG3SSNZH4S5W7B6G2/


[ovirt-devel] Re: [ovirt-users] Feature Request: oVirt to warn when VDO is getting full

2019-06-04 Thread Sahina Bose
On Tue, Jun 4, 2019 at 3:26 PM Strahil  wrote:

> Hello All,
>
> I would like to ask how many of you use VDO  before asking the oVirt Devs
> to assess a feature in oVirt for monitoring the size of the VDOs on
> hyperconverged systems.
>
> I think such warning, will save a lot of headaches, but it will not be
> usefull if most of the community is not using VDO at all.
>

We do have a feature that monitors the space usage of VDO volumes. If this
is not working as expected, can you raise a bug.
Is the storage domain linked to the gluster volume using the VDO devices?

Best Regards,
> Strahil Nikolov
> ___
> Users mailing list -- us...@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/us...@ovirt.org/message/R2VAHSMRPJQA5P6O5IAX5UZFRXRCIJWO/
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/X6ZSQU3NTNPQWBROGB77QHQNU5ELIN5H/


[ovirt-devel] Re: 4.2 HC suite - missing sanlock-lib 3.7.1 dependency

2019-04-13 Thread Sahina Bose
On Sat, Apr 13, 2019 at 11:41 AM Sahina Bose  wrote:
>
> The 4.2 HC suite is failing on missing dependencies
>
>  Error: Package: python2-sanlock-3.7.1-1.el7.x86_64 (alocalsync)
>Requires: sanlock-lib = 3.7.1-1.el7
>Installing: sanlock-lib-3.6.0-1.el7.x86_64 (alocalsync)
>sanlock-lib = 3.6.0-1.el7
>
> Any recent changes here?

Same issues with 4.3 and master as well

>
>
> On Sat, Apr 13, 2019 at 7:10 AM  wrote:
> >
> > Project: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-4.2/
> > Build: 
> > http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-4.2/852/
> > Build Number: 852
> > Build Status:  Still Failing
> > Triggered By: Started by timer
> >
> > -
> > Changes Since Last Success:
> > -
> > Changes for Build #850
> > [Dominik Holler] network-suite-4.2: Skip test to for vnic profile live 
> > update
> >
> >
> > Changes for Build #851
> > [Martin Perina] common: Really enable debug logging for engine
> >
> > [Simone Tiraboschi] Skip he-basic-role-remote-suite-4.2
> >
> > [Daniel Belenky] mock_configs: update rawhide releasever
> >
> > [Daniel Belenky] stdci_slaves: add timeouts to loader node
> >
> >
> > Changes for Build #852
> > [Ondra Machacek] ansible_suite: Update datacenter version
> >
> >
> >
> >
> > -
> > Failed Tests:
> > -
> > No tests ran.
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/MUKPH7EA3JQM6KBBL245ISBLSOU7YGPJ/


[ovirt-devel] 4.2 HC suite - missing sanlock-lib 3.7.1 dependency

2019-04-13 Thread Sahina Bose
The 4.2 HC suite is failing on missing dependencies

 Error: Package: python2-sanlock-3.7.1-1.el7.x86_64 (alocalsync)
   Requires: sanlock-lib = 3.7.1-1.el7
   Installing: sanlock-lib-3.6.0-1.el7.x86_64 (alocalsync)
   sanlock-lib = 3.6.0-1.el7

Any recent changes here?


On Sat, Apr 13, 2019 at 7:10 AM  wrote:
>
> Project: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-4.2/
> Build: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-4.2/852/
> Build Number: 852
> Build Status:  Still Failing
> Triggered By: Started by timer
>
> -
> Changes Since Last Success:
> -
> Changes for Build #850
> [Dominik Holler] network-suite-4.2: Skip test to for vnic profile live update
>
>
> Changes for Build #851
> [Martin Perina] common: Really enable debug logging for engine
>
> [Simone Tiraboschi] Skip he-basic-role-remote-suite-4.2
>
> [Daniel Belenky] mock_configs: update rawhide releasever
>
> [Daniel Belenky] stdci_slaves: add timeouts to loader node
>
>
> Changes for Build #852
> [Ondra Machacek] ansible_suite: Update datacenter version
>
>
>
>
> -
> Failed Tests:
> -
> No tests ran.
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/VRNPTAS5DLA6OUIYRCLLMH5ONLPD74X5/


[ovirt-devel] Re: OST failures (Update to use gluster 5)

2019-03-19 Thread Sahina Bose
On Wed, Mar 20, 2019 at 10:46 AM Galit Rosenthal 
wrote:

> HI Sahina,
>
> The glusterfs 5, should be updated as well on 4.3?
>
Also on 4.3

or only on master?
>
> Regards,
> Galit
>
> On Wed, Mar 20, 2019 at 6:24 AM Sahina Bose  wrote:
>
>>
>>
>> On Wed, Mar 20, 2019 at 12:31 AM Greg Sheremeta 
>> wrote:
>>
>>> Hey,
>>>
>>> Is someone looking at all the OST failures?
>>> he, hc, and network are all failing with the 409 error. Started after
>>> the merge of
>>> "Update to use gluster 5"
>>> I'm not positive that's the problem, but seems likely.
>>>
>>>
>> Do you mean https://gerrit.ovirt.org/#/c/98470/ ? this should have
>> affected only the hc-master suite
>>
>> Both hc and he suites are failing with this error..this is probably not
>> related to gluster, as he suite does not use gluster. Adding Simone and
>> devel ml for insights
>>
>>  "tar: ebaaaefb-5d8c-4059-a59e-b81cdb09e001.ovf: Not found in archive\ntar: 
>> Exiting with failure status due to previous errors\n20+0 records in\n20+0 
>> records out\n10240 bytes (10 kB) copied, 0.00144612 s, 7.1 MB/s",
>> "stderr_lines": ["tar: ebaaaefb-5d8c-4059-a59e-b81cdb09e001.ovf: Not found 
>> in archive"
>>
>>
>>
>>> Best wishes,
>>> Greg
>>>
>>> --
>>>
>>> GREG SHEREMETA
>>>
>>> SENIOR SOFTWARE ENGINEER - TEAM LEAD - RHV UX
>>>
>>> Red Hat NA
>>>
>>> <https://www.redhat.com/>
>>>
>>> gsher...@redhat.comIRC: gshereme
>>> <https://red.ht/sig>
>>>
>> ___
>> Devel mailing list -- devel@ovirt.org
>> To unsubscribe send an email to devel-le...@ovirt.org
>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
>> oVirt Code of Conduct:
>> https://www.ovirt.org/community/about/community-guidelines/
>> List Archives:
>> https://lists.ovirt.org/archives/list/devel@ovirt.org/message/HT4POQ4RV5NQK6VCSWWEOIRWMN52OGZD/
>>
>
>
> --
>
> GALIT ROSENTHAL
>
> SOFTWARE ENGINEER
>
> Red Hat
>
> <https://www.redhat.com/>
>
> ga...@gmail.comT: 972-9-7692230
> <https://red.ht/sig>
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/VGVXXWNQ25VO6GHDE7WMBMGAPVYK3NNJ/


[ovirt-devel] Re: OST failures (Update to use gluster 5)

2019-03-19 Thread Sahina Bose
On Wed, Mar 20, 2019 at 12:31 AM Greg Sheremeta  wrote:

> Hey,
>
> Is someone looking at all the OST failures?
> he, hc, and network are all failing with the 409 error. Started after the
> merge of
> "Update to use gluster 5"
> I'm not positive that's the problem, but seems likely.
>
>
Do you mean https://gerrit.ovirt.org/#/c/98470/ ? this should have affected
only the hc-master suite

Both hc and he suites are failing with this error..this is probably not
related to gluster, as he suite does not use gluster. Adding Simone and
devel ml for insights

 "tar: ebaaaefb-5d8c-4059-a59e-b81cdb09e001.ovf: Not found in
archive\ntar: Exiting with failure status due to previous errors\n20+0
records in\n20+0 records out\n10240 bytes (10 kB) copied, 0.00144612
s, 7.1 MB/s",
"stderr_lines": ["tar: ebaaaefb-5d8c-4059-a59e-b81cdb09e001.ovf: Not
found in archive"



> Best wishes,
> Greg
>
> --
>
> GREG SHEREMETA
>
> SENIOR SOFTWARE ENGINEER - TEAM LEAD - RHV UX
>
> Red Hat NA
>
> 
>
> gsher...@redhat.comIRC: gshereme
> 
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/HT4POQ4RV5NQK6VCSWWEOIRWMN52OGZD/


[ovirt-devel] Re: [TEST NEEDED] Please test oVirt with Gluster 5

2018-12-10 Thread Sahina Bose
+Kaustav

On Tue, 11 Dec 2018 at 12:55 PM, Sandro Bonazzola 
wrote:

> Hi Sahina,
> can your team please test oVirt with Gluster 5?
> Waiting for test results in order to merge
> https://gerrit.ovirt.org/#/c/95833/
>
> Thanks,
>
>
> --
>
> SANDRO BONAZZOLA
>
> MANAGER, SOFTWARE ENGINEERING, EMEA R RHV
>
> Red Hat EMEA 
>
> sbona...@redhat.com
> 
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/AFICYGXAZ6AHXKPMZUJUT5YMCMQMPVGF/


[ovirt-devel] Re: GlusterFS rebase for oVirt 4.3

2018-11-19 Thread Sahina Bose
On Tue, Nov 13, 2018 at 3:48 PM Niels de Vos  wrote:
>
> On Tue, Nov 13, 2018 at 10:01:15AM +0100, Sandro Bonazzola wrote:
> > According to Sahina we dropped dependency on gluster-gnfs and it's safe to
> > move on from the unsupported 3.12 version to a newer one 4.1 / 5.0.
> >
> > This is for getting an agreement on what we should require in oVirt 4.3:
> > CentOS Storage SIG provides bot 4.1 and 5 for x86_64[1] and ppc64le[2]
> >
> > If I understood correctly, 5 is shipping glusterd2 which requires a
> > significant effort to get support for while 4.1 is still on glusterd which
> > should work with current oVirt code.
>
> Also Gluster 5 still offers the traditional glusterd service. glusterd2
> is available for both 4.1 and 5, but it is still an opt-in.
>
> HTH,
> Niels
>
>
> PS: Gluster 5 is not announced for CentOS yet, the
> centos-release-gluster5 package is not yet publicly available

I think we should move to latest version, i.e Gluster 5.
Niels, is there any ETA for when the package will be available in CentOS?

Also, for oVirt 4.2 due to dependency on gluster-gnfs we are still
dependent on gluster 3.12 which is EOL. What's our best option here?
Shyam, Amar, looking for your inputs.
>
>
> > Sahina, can you give directions on what we should move to?
> >
> > Thanks.
> >
> > [1] http://mirror.centos.org/centos/7/storage/x86_64/
> > [2] http://mirror.centos.org/altarch/7/storage/ppc64le/
> >
> > --
> >
> > SANDRO BONAZZOLA
> >
> > MANAGER, SOFTWARE ENGINEERING, EMEA R RHV
> >
> > Red Hat EMEA 
> >
> > sbona...@redhat.com
> > 
>
> > ___
> > Devel mailing list -- devel@ovirt.org
> > To unsubscribe send an email to devel-le...@ovirt.org
> > Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> > oVirt Code of Conduct: 
> > https://www.ovirt.org/community/about/community-guidelines/
> > List Archives: 
> > https://lists.ovirt.org/archives/list/devel@ovirt.org/message/IQYAWSN5KLO5OQJ4TT75BYB3GZFECVE2/
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/SLC6IX7DME4QINQKFMCSPA6LW72YD3J4/


[ovirt-devel] Re: Setting Type and action Variables in Ovirt Validation messages

2018-11-15 Thread Sahina Bose
On Thu, Nov 15, 2018 at 3:25 PM Kaustav Majumder  wrote:
>
> Hi all,
> I am working on a bug where the error messages comes as non interpolated 
> [https://bugzilla.redhat.com/show_bug.cgi?id=1369319].I am trying to figure 
> out where the 'type' and 'action' variables are set in ovirt for validation 
> error messages
> One such example from AppErrors.properties is :-
>
>  ACTION_TYPE_FAILED_GENERATOR_NOT_SUPPORTED_BY_CLUSTER=Cannot ${action} 
> ${type}. The random number generator is not supported by the Cluster.

There used to be a page that explains this -
http://www.ovirt.org/Engine_Adding_Messages. Not sure where it resides
now. A quick search did not reveal it to me.

These are set in the setActionMessageParameters() that you would
override in your Command class.
See, for instance,
https://github.com/oVirt/ovirt-engine/blob/master/backend/manager/modules/bll/src/main/java/org/ovirt/engine/core/bll/gluster/CreateGlusterVolumeCommand.java

>
>
> Thanks,
> Kaustav
> ___
> Devel mailing list -- devel@ovirt.org
> To unsubscribe send an email to devel-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/devel@ovirt.org/message/RJFCG6GIAWWB6MEAUVMRT5MU7LYVHRE7/
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/YDV3HGCKNDLMG4NYROJMUBHMI7BZVMIX/


[ovirt-devel] Re: [VDSM] Proposing Denis as vdsm gluster maintainer

2018-11-12 Thread Sahina Bose
On Mon, Nov 12, 2018 at 6:03 PM Nir Soffer  wrote:
>
> Hi all,
>
> Denis is practically maintaining vdsm gluster code in the recent years,
> and it is time to make this official.
>
> Please ack,
> Nir

+1
Denis has been adding features and reviewing code related to gluster
in vdsm. An ack from me
-sahina
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/VGDS7FHVLHN25BGEAVNQX6YG5WUVLJZK/


[ovirt-devel] Re: [OST][hc-basic-suite-master] Fails on HE deployment

2018-09-18 Thread Sahina Bose
On Tue, Sep 11, 2018 at 5:24 PM Sahina Bose  wrote:

>
>
> On Mon, Sep 3, 2018 at 7:00 PM, Sahina Bose  wrote:
>
>> xargs -I{} sudo -u vdsm dd if={} | tar -tvf -
>> 772a3dfe-aee8-45d9-9df5-beddcbf92010.ovf\", u'removes': None, u'creates':
>> None, u'chdir': None, u'stdin': None}}, u'stdout_lines': [], u'stderr':
>> u\"vdsm-client: Command Image.prepare with args {'storagepoolID':
>> 'e90984b6-af79-11e8-a2dd-00163e24d363', 'storagedomainID':
>> 'b4c84ead-fb7f-4478-acd2-0309d569b9aa', 'volumeID':
>> 'b18f30b1-5625-4581-b815-94291b226c79', 'imageID':
>> '[u7894c850-665f-47dd-8376-e4da5ae7807f]'} failed:\\n(code=201,
>> message=Volume does not exist:
>> (u'b18f30b1-5625-4581-b815-94291b226c79',))\\ntar: This does not look like
>> a tar archive
>>
>> Is this a known issue?
>>
>
> vdsm log contains
>
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/task.py", line 882, in 
> _run
> return fn(*args, **kargs)
>   File "", line 2, in prepareImage
>   File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 49, in 
> method
> ret = func(*args, **kwargs)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/hsm.py", line 3176, in 
> prepareImage
> raise se.VolumeDoesNotExist(leafUUID)
> VolumeDoesNotExist: Volume does not exist: 
> (u'10730bab-8b24-4001-afde-a83659063c00',)
>
>
> See
> https://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-master/684/artifact/exported-artifacts/test_logs/hc-basic-suite-master/post-002_bootstrap.py/lago-hc-basic-suite-master-host-0/_var_log/vdsm/vdsm.log
>
> Any clues as to what's going wrong? (Failing for the last 13 days)
>
>
Turned out to be issue with ansible version - the play that filters out the
disks returning wrong results with older ansible?
CI passes with https://gerrit.ovirt.org/#/c/94415/. Can someone please
merge?

thanks!


>
>> thanks!
>> sahina
>>
>
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/BBAHLXX4QUSDCLE5DHNFV6KOYEODRHLY/


[ovirt-devel] Re: [OST][hc-basic-suite-master] Fails on HE deployment

2018-09-11 Thread Sahina Bose
On Mon, Sep 3, 2018 at 7:00 PM, Sahina Bose  wrote:

> xargs -I{} sudo -u vdsm dd if={} | tar -tvf - 
> 772a3dfe-aee8-45d9-9df5-beddcbf92010.ovf\",
> u'removes': None, u'creates': None, u'chdir': None, u'stdin': None}},
> u'stdout_lines': [], u'stderr': u\"vdsm-client: Command Image.prepare with
> args {'storagepoolID': 'e90984b6-af79-11e8-a2dd-00163e24d363',
> 'storagedomainID': 'b4c84ead-fb7f-4478-acd2-0309d569b9aa', 'volumeID':
> 'b18f30b1-5625-4581-b815-94291b226c79', 'imageID':
> '[u7894c850-665f-47dd-8376-e4da5ae7807f]'} failed:\\n(code=201,
> message=Volume does not exist: 
> (u'b18f30b1-5625-4581-b815-94291b226c79',))\\ntar:
> This does not look like a tar archive
>
> Is this a known issue?
>

vdsm log contains

Traceback (most recent call last):
  File "/usr/lib/python2.7/site-packages/vdsm/storage/task.py", line
882, in _run
return fn(*args, **kargs)
  File "", line 2, in prepareImage
  File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 49, in method
ret = func(*args, **kwargs)
  File "/usr/lib/python2.7/site-packages/vdsm/storage/hsm.py", line
3176, in prepareImage
raise se.VolumeDoesNotExist(leafUUID)
VolumeDoesNotExist: Volume does not exist:
(u'10730bab-8b24-4001-afde-a83659063c00',)


See
https://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-master/684/artifact/exported-artifacts/test_logs/hc-basic-suite-master/post-002_bootstrap.py/lago-hc-basic-suite-master-host-0/_var_log/vdsm/vdsm.log

Any clues as to what's going wrong? (Failing for the last 13 days)



> thanks!
> sahina
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/KVKSWHZHIQ7ZDNDVSMOK3U6EBYQBSWOJ/


[ovirt-devel] [OST][hc-basic-suite-master] Fails on HE deployment

2018-09-03 Thread Sahina Bose
xargs -I{} sudo -u vdsm dd if={} | tar -tvf -
772a3dfe-aee8-45d9-9df5-beddcbf92010.ovf\", u'removes': None, u'creates':
None, u'chdir': None, u'stdin': None}}, u'stdout_lines': [], u'stderr':
u\"vdsm-client: Command Image.prepare with args {'storagepoolID':
'e90984b6-af79-11e8-a2dd-00163e24d363', 'storagedomainID':
'b4c84ead-fb7f-4478-acd2-0309d569b9aa', 'volumeID':
'b18f30b1-5625-4581-b815-94291b226c79', 'imageID':
'[u7894c850-665f-47dd-8376-e4da5ae7807f]'} failed:\\n(code=201,
message=Volume does not exist:
(u'b18f30b1-5625-4581-b815-94291b226c79',))\\ntar: This does not look like
a tar archive

Is this a known issue?

thanks!
sahina
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/427NEKBIIJQD5E52NRTBPH6WXMQTMECJ/


[ovirt-devel] Re: [CentOS-announce] Announcing the release of Gluster 4.1 on CentOS Linux 7 x86_64

2018-07-17 Thread Sahina Bose
On Fri, Jul 6, 2018 at 4:38 PM, Sandro Bonazzola 
wrote:

> Hi,
> as you can read below Gluster 4.1 LTM release is available in CentOS.
> We are currently using 3.12 in oVirt 4.2 but this release is going EOL in
> 6 months.
> I would suggest to move 4.3 / Master to Gluster 4.1 now. Any objection?
> If no objection I will start pushing patches for it on Monday, July 16th.
>

Gluster 4.1 has removed the gluster-gnfs package. We cannot move master to
it until we fix the vdsm dependencies and provide a migration path for
existing gluster-nfs users to nfs-ganesha.


>
> -- Forwarded message --
> From: Niels de Vos 
> Date: 2018-06-27 16:42 GMT+02:00
> Subject: [CentOS-announce] Announcing the release of Gluster 4.1 on CentOS
> Linux 7 x86_64
> To: centos-annou...@centos.org
>
>
> I am happy to announce the General Availability of Gluster 4.1 for
> CentOS 7 on x86_64. These packages are following the upstream Gluster
> Community releases, and will receive monthly bugfix updates.
>
> Gluster 4.1 is a Long-Term-Maintenance release, and will receive
> updates for approximately 18 months. The difference between
> Long-Term-Maintenance and Short-Term-Maintenance releases is explained
> on the Gluster release schedule page:
>   https://www.gluster.org/community/release-schedule/
>
> Users of CentOS 7 can now simply install Gluster 4.1 with only these two
> commands:
>
>   # yum install centos-release-gluster
>   # yum install glusterfs-server
>
> The centos-release-gluster package is delivered via CentOS Extras repos.
> This contains all the metadata and dependency information, needed to
> install Gluster 4.1. The actual package that will get installed is
> centos-release-gluster41. Users of the now End-Of-Life
> Short-Term-Maintenance Gluster 4.0 will automatically get the update to
> Gluster 4.1, whereas users of Gluster 3.12 can stay on that
> Long-Term-Maintenance release for an other six months.
>
> Users of Gluster 3.10 will need to manually upgrade by uninstalling the
> centos-release-gluster310 package, and replacing it with either the
> Gluster 4.1 or 3.12 version. Additional details about the upgrade
> process are linked in the announcement from the Gluster Community:
>   https://lists.gluster.org/pipermail/announce/2018-June/000102.html
>
> We have a quickstart guide specifically built around the packages are
> available, it makes for a good introduction to Gluster and will help get
> you started in just a few simple steps, this quick start is available at
>   https://wiki.centos.org/SpecialInterestGroup/Storage/gluster-Quickstart
>
> More details about the packages that the Gluster project provides in the
> Storage SIG is available in the documentation:
>   https://wiki.centos.org/SpecialInterestGroup/Storage/Gluster
>
> The centos-release-gluster* repositories offer additional packages that
> enhance the usability of Gluster itself. Utilities and tools that were
> working with previous versions of Gluster are expected to stay working
> fine. If there are any problems, or requests for additional tools and
> applications to be provided, just send us an email with your
> suggestions. The current list of packages that is (planned to become)
> available can be found here:
>   https://wiki.centos.org/SpecialInterestGroup/Storage/Gluster
> /Ecosystem-pkgs
>
> We welcome all feedback, comments and contributions. You can get in
> touch with the CentOS Storage SIG on the centos-devel mailing list
> (https://lists.centos.org ) and with the Gluster developer and user
> communities at https://www.gluster.org/mailman/listinfo , we are also
> available on irc at #gluster on irc.freenode.net, and on twitter at
> @gluster .
>
> Cheers,
> Niels de Vos
> Storage SIG member & Gluster maintainer
>
> ___
> CentOS-announce mailing list
> centos-annou...@centos.org
> https://lists.centos.org/mailman/listinfo/centos-announce
>
>
>
>
> --
>
> SANDRO BONAZZOLA
>
> MANAGER, SOFTWARE ENGINEERING, EMEA R RHV
>
> Red Hat EMEA 
>
> sbona...@redhat.com
> 
>
___
Devel mailing list -- devel@ovirt.org
To unsubscribe send an email to devel-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/devel@ovirt.org/message/U2IO4U5TJE6JIZ6QIPT5AG7KMU5XWRQX/


[ovirt-devel] Re: ovirt-system-tests_hc-basic-suite failing due to host not in cluster and incorrect content served by the engine to SDK.

2018-07-09 Thread Sahina Bose
On Sun, Jul 8, 2018 at 12:23 PM, Yaniv Kaul  wrote:

>
>
> On Fri, Jul 6, 2018 at 1:01 PM, Sandro Bonazzola 
> wrote:
>
>> https://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-4.2/326
>>
>> fails on add host test with:
>>
>> Error: The response content type 'text/html; charset=iso-8859-1' isn't the 
>> expected XML
>>
>>
>> Something bad happened during the deployment because the engine complains
>> about an host not included in the cluster:
>>
>> 2018-07-05 21:34:47,768-04 WARN  
>> [org.ovirt.engine.core.vdsbroker.gluster.GlusterVolumesListReturn] 
>> (DefaultQuartzScheduler6) [3009952a] Could not add brick 
>> 'lago-hc-basic-suite-4-2-host1:/rhs/brick1/engine' to volume 
>> 'c1146520-3bf7-4b81-b31a-7cc5475b6438' - server uuid 
>> '50e37ed8-86f3-4b50-9258-f516169025ea' not found in cluster 
>> '3125aa60-80bb-11e8-a143-00163e24d363'
>>
>>
> In[2] we can see:
> 2018-07-05 22:03:42,975-0400 ERROR (monitor/f6c4ab4) [storage.Monitor]
> Error checking domain f6c4ab4a-005d-4ab7-acda-03810014c841 (monitor:424)
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/monitor.py", line
> 405, in _checkDomainStatus
> self.domain.selftest()
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/sdc.py", line 48,
> in __getattr__
> return getattr(self.getRealDomain(), attrName)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/sdc.py", line 51,
> in getRealDomain
> return self._cache._realProduce(self._sdUUID)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/sdc.py", line 134,
> in _realProduce
> domain = self._findDomain(sdUUID)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/sdc.py", line 151,
> in _findDomain
> return findMethod(sdUUID)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/glusterSD.py", line
> 55, in findDomain
> return GlusterStorageDomain(GlusterStorageDomain.
> findDomainPath(sdUUID))
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/fileSD.py", line
> 391, in __init__
> validateFileSystemFeatures(manifest.sdUUID, manifest.mountpoint)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/fileSD.py", line
> 104, in validateFileSystemFeatures
> oop.getProcessPool(sdUUID).directTouch(testFilePath)
>   File "/usr/lib/python2.7/site-packages/vdsm/storage/outOfProcess.py",
> line 320, in directTouch
> ioproc.touch(path, flags, mode)
>   File "/usr/lib/python2.7/site-packages/ioprocess/__init__.py", line
> 567, in touch
> self.timeout)
>   File "/usr/lib/python2.7/site-packages/ioprocess/__init__.py", line
> 451, in _sendCommand
> raise OSError(errcode, errstr)
> OSError: [Errno 30] Read-only file system
>
> And just before that:
>
> 2018-07-05 22:03:33,214-0400 INFO  (libvirt/events) [virt.vm] 
> (vmId='a2f514e6-81ca-4d41-acf9-77cc910f6eaf') abnormal vm stop device 
> ua-c0592bd6-20e6-4dbf-9610-9a35e3f566ab error eother (vm:5116)
> 2018-07-05 22:03:33,214-0400 INFO  (libvirt/events) [virt.vm] 
> (vmId='a2f514e6-81ca-4d41-acf9-77cc910f6eaf') CPU stopped: onIOError (vm:6157)
> 2018-07-05 22:03:33,222-0400 INFO  (libvirt/events) [virt.vm] 
> (vmId='a2f514e6-81ca-4d41-acf9-77cc910f6eaf') CPU stopped: onSuspend (vm:6157)
> 2018-07-05 22:03:33,225-0400 WARN  (libvirt/events) [virt.vm] 
> (vmId='a2f514e6-81ca-4d41-acf9-77cc910f6eaf') device vda reported I/O error 
> (vm:4065)
>
>
> And indeed, @[3]:
>
> [2018-07-05 22:04:38,936] WARNING [utils - 298:publish_to_webhook] - Event 
> push failed to URL: http://hc-engine:80/ovirt-engine/services/glusterevents, 
> Event: {"event": "QUORUM_LOST", "message": {"volume": "vmstore"}, "nodeid": 
> "59bf7956-60a4-4152-9cf9-99fcdccb211f", "ts": 1530842614}, Status: 
> ('Connection aborted.', error(113, 'No route to host'))
>
>
> And we can also see https://bugzilla.redhat.com/show_bug.cgi?id=1595436
> there as well.
>
>
> Sahina, Gobinda, can you please investigate?
>>
>> Ondra, no idea why the engine is returning text/html instead of xml here,
>> can you please check?
>>
>
> Because of the exception[1].
> Y.
>

Thanks Yaniv!

The failure to add hosts is because engine was down due to quorum loss.
I see that HC suite has failed in the past due to similar errors, and even
in the runs that pass there are quorum loss messages (as glusterd is
restarted whenever the host is added). I need to dig into the reason for
quorum loss - if it's the parallel addition of hosts causing it, or
something else. Will update this thread.


> [1] https://jenkins.ovirt.org/job/ovirt-system-tests_hc-
> basic-suite-4.2/326/artifact/exported-artifacts/test_logs/
> hc-basic-suite-4.2/post-002_bootstrap.py/lago-hc-basic-
> suite-4-2-engine/_var_log/ovirt-engine/server.log
> [2] https://jenkins.ovirt.org/job/ovirt-system-tests_hc-
> basic-suite-4.2/326/artifact/exported-artifacts/test_logs/
> hc-basic-suite-4.2/post-002_bootstrap.py/lago-hc-basic-
> suite-4-2-host0/_var_log/vdsm/vdsm.log
> [3] https://jenkins.ovirt.org/job/ovirt-system-tests_hc-
> 

Re: [ovirt-devel] Update: HC suites failing for 3 weeks ( was: [OST][HC] HE fails to deploy )

2018-04-30 Thread Sahina Bose
On Mon, Apr 30, 2018 at 5:45 PM, Eyal Edri <ee...@redhat.com> wrote:

>
>
> On Wed, Apr 25, 2018 at 1:53 PM, Sahina Bose <sab...@redhat.com> wrote:
>
>>
>>
>> On Wed, Apr 25, 2018 at 3:54 PM, Sahina Bose <sab...@redhat.com> wrote:
>>
>>>
>>>
>>> On Mon, Apr 23, 2018 at 6:28 PM, Sahina Bose <sab...@redhat.com> wrote:
>>>
>>>>
>>>> On Mon, Apr 23, 2018 at 5:41 PM, Eyal Edri <ee...@redhat.com> wrote:
>>>>
>>>>> Sahina,
>>>>> Any update on this?
>>>>>
>>>>
>>>> Sorry, haven't been able to spend any time on this. The last I checked
>>>> the  HE install was failing at task - Get Local VM IP.
>>>> and there were no logs from HE VM to debug.
>>>>
>>>> Will spend sometime on this tomorrow
>>>>
>>>
>>> https://gerrit.ovirt.org/#/c/89953/ - fixes the issue, atleast when I
>>> tried this on my local setup.
>>>
>>
>>
>> The CI however still fails in the HE install with :
>>
>> TASK [Get local VM IP]", "[ ERROR ] fatal: [localhost]: FAILED! => 
>> {\"attempts\": 50, \"changed\": true, \"cmd\": \"virsh -r net-dhcp-leases 
>> default | grep -i 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 -d'/'\", 
>> \"delta\": \"0:00:00.043961\", \"end\": \"2018-04-25 05:51:34.226374\", 
>> \"rc\": 0, \"start\": \"2018-04-25 05:51:34.182413\", \"stderr\": \"\", 
>> \"stderr_lines\": [], \"stdout\": \"\", \"stdout_lines\": []}"
>>
>>
>>
>> FWIW, my local setup , ost repo was at I3fc2976ab2400e5908760aadc3258
>> 329c0ffdf4d
>>
>
>
> Any update? suites are still failing.
>

The suites work locally, on the CI systems they fail with above error.
Unfortunately, no clue as to why this is so.
With the current run, are we able to get logs from the engine VM?


>
>>
>>
>>>
>>>
>>>>
>>>>> On Wed, Apr 18, 2018 at 3:40 PM, Sandro Bonazzola <sbona...@redhat.com
>>>>> > wrote:
>>>>>
>>>>>>
>>>>>>
>>>>>> 2018-04-18 9:37 GMT+02:00 Eyal Edri <ee...@redhat.com>:
>>>>>>
>>>>>>> FYI,
>>>>>>>
>>>>>>> I've disabled the 4.2 and master HC suites nightly run on CI as they
>>>>>>> are constantly failing for almost 3 weeks and spamming the mailing 
>>>>>>> lists.
>>>>>>>
>>>>>>
>>>>>>
>>>>>> HC uses gdeploy 2.0.6 which was released in December and was based on
>>>>>> ansible 2.4.
>>>>>> ansible-2.5 landed 3 weeks ago in EPEL, my guess is that gdeploy is
>>>>>> not supporting ansible-2.5 properly.
>>>>>> I had no time to validate my guess with proof, so please Sahina cross
>>>>>> check this.
>>>>>>
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> I think this should get higher priority for a fix if we want it to
>>>>>>> provide any value,
>>>>>>> Work can continue using the manual jobs or via check-patch.
>>>>>>>
>>>>>>>
>>>>>>> On Mon, Apr 16, 2018 at 10:56 AM, Gal Ben Haim <gbenh...@redhat.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Any update on https://gerrit.ovirt.org/#/c/7/ ?
>>>>>>>> The HC suites still failing and it's hard to understand why without
>>>>>>>> the logs from the engine VM.
>>>>>>>>
>>>>>>>> On Sat, Apr 7, 2018 at 7:19 AM, Sahina Bose <sab...@redhat.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Fri, Apr 6, 2018 at 1:10 PM, Simone Tiraboschi <
>>>>>>>>> stira...@redhat.com> wrote:
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> On Fri, Apr 6, 2018 at 9:28 AM, Sahina Bose <sab...@redhat.com>
>>>>>>>>>> wrote:
>>>>>>>>>>
&g

Re: [ovirt-devel] Update: HC suites failing for 3 weeks ( was: [OST][HC] HE fails to deploy )

2018-04-25 Thread Sahina Bose
On Wed, Apr 25, 2018 at 3:54 PM, Sahina Bose <sab...@redhat.com> wrote:

>
>
> On Mon, Apr 23, 2018 at 6:28 PM, Sahina Bose <sab...@redhat.com> wrote:
>
>>
>> On Mon, Apr 23, 2018 at 5:41 PM, Eyal Edri <ee...@redhat.com> wrote:
>>
>>> Sahina,
>>> Any update on this?
>>>
>>
>> Sorry, haven't been able to spend any time on this. The last I checked
>> the  HE install was failing at task - Get Local VM IP.
>> and there were no logs from HE VM to debug.
>>
>> Will spend sometime on this tomorrow
>>
>
> https://gerrit.ovirt.org/#/c/89953/ - fixes the issue, atleast when I
> tried this on my local setup.
>


The CI however still fails in the HE install with :

TASK [Get local VM IP]", "[ ERROR ] fatal: [localhost]: FAILED! =>
{\"attempts\": 50, \"changed\": true, \"cmd\": \"virsh -r
net-dhcp-leases default | grep -i 00:16:3e:24:d3:63 | awk '{ print $5
}' | cut -f1 -d'/'\", \"delta\": \"0:00:00.043961\", \"end\":
\"2018-04-25 05:51:34.226374\", \"rc\": 0, \"start\": \"2018-04-25
05:51:34.182413\", \"stderr\": \"\", \"stderr_lines\": [], \"stdout\":
\"\", \"stdout_lines\": []}"



FWIW, my local setup , ost repo was at
I3fc2976ab2400e5908760aadc3258329c0ffdf4d


>
>
>>
>>> On Wed, Apr 18, 2018 at 3:40 PM, Sandro Bonazzola <sbona...@redhat.com>
>>> wrote:
>>>
>>>>
>>>>
>>>> 2018-04-18 9:37 GMT+02:00 Eyal Edri <ee...@redhat.com>:
>>>>
>>>>> FYI,
>>>>>
>>>>> I've disabled the 4.2 and master HC suites nightly run on CI as they
>>>>> are constantly failing for almost 3 weeks and spamming the mailing lists.
>>>>>
>>>>
>>>>
>>>> HC uses gdeploy 2.0.6 which was released in December and was based on
>>>> ansible 2.4.
>>>> ansible-2.5 landed 3 weeks ago in EPEL, my guess is that gdeploy is not
>>>> supporting ansible-2.5 properly.
>>>> I had no time to validate my guess with proof, so please Sahina cross
>>>> check this.
>>>>
>>>>
>>>>
>>>>>
>>>>> I think this should get higher priority for a fix if we want it to
>>>>> provide any value,
>>>>> Work can continue using the manual jobs or via check-patch.
>>>>>
>>>>>
>>>>> On Mon, Apr 16, 2018 at 10:56 AM, Gal Ben Haim <gbenh...@redhat.com>
>>>>> wrote:
>>>>>
>>>>>> Any update on https://gerrit.ovirt.org/#/c/7/ ?
>>>>>> The HC suites still failing and it's hard to understand why without
>>>>>> the logs from the engine VM.
>>>>>>
>>>>>> On Sat, Apr 7, 2018 at 7:19 AM, Sahina Bose <sab...@redhat.com>
>>>>>> wrote:
>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Fri, Apr 6, 2018 at 1:10 PM, Simone Tiraboschi <
>>>>>>> stira...@redhat.com> wrote:
>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Fri, Apr 6, 2018 at 9:28 AM, Sahina Bose <sab...@redhat.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> 2018-04-05 20:46:52,773-0400 INFO 
>>>>>>>>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>>>>>>>>> ansible_utils._process_output:100 TASK [Get local VM IP]
>>>>>>>>> 2018-04-05 20:55:28,217-0400 DEBUG 
>>>>>>>>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>>>>>>>>> ansible_utils._process_output:94 {u'_ansible_parsed': True, 
>>>>>>>>> u'stderr_lines': [], u'cmd': u"virsh -r net-dhcp-leases default | 
>>>>>>>>> grep -i 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 -d'/'", 
>>>>>>>>> u'end': u'2018-04-05 20:55:28.046320', u'_ansible_no_log': False, 
>>>>>>>>> u'stdout': u'', u'changed': True, u'invocation': {u'module_args': 
>>>>>>>>> {u'warn': True, u'executable': None, u'_uses_shell': True, 
>>>>>>>>> u'_raw_params': u"virsh -r net-dhcp-leases default | grep -i 
>>>>>>>>> 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 -d'/'", u'rem

Re: [ovirt-devel] Update: HC suites failing for 3 weeks ( was: [OST][HC] HE fails to deploy )

2018-04-23 Thread Sahina Bose
On Mon, Apr 23, 2018 at 5:41 PM, Eyal Edri <ee...@redhat.com> wrote:

> Sahina,
> Any update on this?
>

Sorry, haven't been able to spend any time on this. The last I checked the
HE install was failing at task - Get Local VM IP.
and there were no logs from HE VM to debug.

Will spend sometime on this tomorrow


> On Wed, Apr 18, 2018 at 3:40 PM, Sandro Bonazzola <sbona...@redhat.com>
> wrote:
>
>>
>>
>> 2018-04-18 9:37 GMT+02:00 Eyal Edri <ee...@redhat.com>:
>>
>>> FYI,
>>>
>>> I've disabled the 4.2 and master HC suites nightly run on CI as they are
>>> constantly failing for almost 3 weeks and spamming the mailing lists.
>>>
>>
>>
>> HC uses gdeploy 2.0.6 which was released in December and was based on
>> ansible 2.4.
>> ansible-2.5 landed 3 weeks ago in EPEL, my guess is that gdeploy is not
>> supporting ansible-2.5 properly.
>> I had no time to validate my guess with proof, so please Sahina cross
>> check this.
>>
>>
>>
>>>
>>> I think this should get higher priority for a fix if we want it to
>>> provide any value,
>>> Work can continue using the manual jobs or via check-patch.
>>>
>>>
>>> On Mon, Apr 16, 2018 at 10:56 AM, Gal Ben Haim <gbenh...@redhat.com>
>>> wrote:
>>>
>>>> Any update on https://gerrit.ovirt.org/#/c/7/ ?
>>>> The HC suites still failing and it's hard to understand why without the
>>>> logs from the engine VM.
>>>>
>>>> On Sat, Apr 7, 2018 at 7:19 AM, Sahina Bose <sab...@redhat.com> wrote:
>>>>
>>>>>
>>>>>
>>>>> On Fri, Apr 6, 2018 at 1:10 PM, Simone Tiraboschi <stira...@redhat.com
>>>>> > wrote:
>>>>>
>>>>>>
>>>>>>
>>>>>> On Fri, Apr 6, 2018 at 9:28 AM, Sahina Bose <sab...@redhat.com>
>>>>>> wrote:
>>>>>>
>>>>>>> 2018-04-05 20:46:52,773-0400 INFO 
>>>>>>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>>>>>>> ansible_utils._process_output:100 TASK [Get local VM IP]
>>>>>>> 2018-04-05 20:55:28,217-0400 DEBUG 
>>>>>>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>>>>>>> ansible_utils._process_output:94 {u'_ansible_parsed': True, 
>>>>>>> u'stderr_lines': [], u'cmd': u"virsh -r net-dhcp-leases default | grep 
>>>>>>> -i 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 -d'/'", u'end': 
>>>>>>> u'2018-04-05 20:55:28.046320', u'_ansible_no_log': False, u'stdout': 
>>>>>>> u'', u'changed': True, u'invocation': {u'module_args': {u'warn': True, 
>>>>>>> u'executable': None, u'_uses_shell': True, u'_raw_params': u"virsh -r 
>>>>>>> net-dhcp-leases default | grep -i 00:16:3e:24:d3:63 | awk '{ print $5 
>>>>>>> }' | cut -f1 -d'/'", u'removes': None, u'creates': None, u'chdir': 
>>>>>>> None, u'stdin': None}}, u'start': u'2018-04-05 20:55:28.000470', 
>>>>>>> u'attempts': 50, u'stderr': u'', u'rc': 0, u'delta': u'0:00:00.045850', 
>>>>>>> u'stdout_lines': []}
>>>>>>> 2018-04-05 20:55:28,318-0400 ERROR 
>>>>>>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>>>>>>> ansible_utils._process_output:98 fatal: [localhost]: FAILED! => 
>>>>>>> {"attempts": 50, "changed": true, "cmd": "virsh -r net-dhcp-leases 
>>>>>>> default | grep -i 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 
>>>>>>> -d'/'", "delta": "0:00:00.045850", "end": "2018-04-05 20:55:28.046320", 
>>>>>>> "rc": 0, "start": "2018-04-05 20:55:28.000470", "stderr": "", 
>>>>>>> "stderr_lines": [], "stdout": "", "stdout_lines": []}
>>>>>>>
>>>>>>> Both the 4.2 and master suites are failing on getting local VM IP.
>>>>>>> Any idea what changed or if I have to change the test?
>>>>>>>
>>>>>>> thanks!
>>>>>>>
>>>>>>
>>>>>> Hi Sahina,
>>>>>> 4.2 and master suite non HC are correctly running this morning.
>>>>>> http://jenkins.ovirt.org/view/oVirt%20system%

Re: [ovirt-devel] [OST][HC] HE fails to deploy

2018-04-06 Thread Sahina Bose
On Fri, Apr 6, 2018 at 1:10 PM, Simone Tiraboschi <stira...@redhat.com>
wrote:

>
>
> On Fri, Apr 6, 2018 at 9:28 AM, Sahina Bose <sab...@redhat.com> wrote:
>
>> 2018-04-05 20:46:52,773-0400 INFO 
>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>> ansible_utils._process_output:100 TASK [Get local VM IP]
>> 2018-04-05 20:55:28,217-0400 DEBUG 
>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>> ansible_utils._process_output:94 {u'_ansible_parsed': True, u'stderr_lines': 
>> [], u'cmd': u"virsh -r net-dhcp-leases default | grep -i 00:16:3e:24:d3:63 | 
>> awk '{ print $5 }' | cut -f1 -d'/'", u'end': u'2018-04-05 20:55:28.046320', 
>> u'_ansible_no_log': False, u'stdout': u'', u'changed': True, u'invocation': 
>> {u'module_args': {u'warn': True, u'executable': None, u'_uses_shell': True, 
>> u'_raw_params': u"virsh -r net-dhcp-leases default | grep -i 
>> 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 -d'/'", u'removes': None, 
>> u'creates': None, u'chdir': None, u'stdin': None}}, u'start': u'2018-04-05 
>> 20:55:28.000470', u'attempts': 50, u'stderr': u'', u'rc': 0, u'delta': 
>> u'0:00:00.045850', u'stdout_lines': []}
>> 2018-04-05 20:55:28,318-0400 ERROR 
>> otopi.ovirt_hosted_engine_setup.ansible_utils 
>> ansible_utils._process_output:98 fatal: [localhost]: FAILED! => {"attempts": 
>> 50, "changed": true, "cmd": "virsh -r net-dhcp-leases default | grep -i 
>> 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 -d'/'", "delta": 
>> "0:00:00.045850", "end": "2018-04-05 20:55:28.046320", "rc": 0, "start": 
>> "2018-04-05 20:55:28.000470", "stderr": "", "stderr_lines": [], "stdout": 
>> "", "stdout_lines": []}
>>
>> Both the 4.2 and master suites are failing on getting local VM IP.
>> Any idea what changed or if I have to change the test?
>>
>> thanks!
>>
>
> Hi Sahina,
> 4.2 and master suite non HC are correctly running this morning.
> http://jenkins.ovirt.org/view/oVirt%20system%20tests/job/
> ovirt-system-tests_he-basic-ansible-suite-master/146/
> http://jenkins.ovirt.org/view/oVirt%20system%20tests/job/
> ovirt-system-tests_he-basic-ansible-suite-4.2/76/
>
> I'll try to check the difference with HC suites.
>
> Are you using more than one subnet in the HC suites?
>

No, I'm not. And we havent's changed anything related to network in the
test suite.
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [OST][HC] HE fails to deploy

2018-04-06 Thread Sahina Bose
2018-04-05 20:46:52,773-0400 INFO
otopi.ovirt_hosted_engine_setup.ansible_utils
ansible_utils._process_output:100 TASK [Get local VM IP]
2018-04-05 20:55:28,217-0400 DEBUG
otopi.ovirt_hosted_engine_setup.ansible_utils
ansible_utils._process_output:94 {u'_ansible_parsed': True,
u'stderr_lines': [], u'cmd': u"virsh -r net-dhcp-leases default | grep
-i 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1 -d'/'", u'end':
u'2018-04-05 20:55:28.046320', u'_ansible_no_log': False, u'stdout':
u'', u'changed': True, u'invocation': {u'module_args': {u'warn': True,
u'executable': None, u'_uses_shell': True, u'_raw_params': u"virsh -r
net-dhcp-leases default | grep -i 00:16:3e:24:d3:63 | awk '{ print $5
}' | cut -f1 -d'/'", u'removes': None, u'creates': None, u'chdir':
None, u'stdin': None}}, u'start': u'2018-04-05 20:55:28.000470',
u'attempts': 50, u'stderr': u'', u'rc': 0, u'delta':
u'0:00:00.045850', u'stdout_lines': []}
2018-04-05 20:55:28,318-0400 ERROR
otopi.ovirt_hosted_engine_setup.ansible_utils
ansible_utils._process_output:98 fatal: [localhost]: FAILED! =>
{"attempts": 50, "changed": true, "cmd": "virsh -r net-dhcp-leases
default | grep -i 00:16:3e:24:d3:63 | awk '{ print $5 }' | cut -f1
-d'/'", "delta": "0:00:00.045850", "end": "2018-04-05
20:55:28.046320", "rc": 0, "start": "2018-04-05 20:55:28.000470",
"stderr": "", "stderr_lines": [], "stdout": "", "stdout_lines": []}

Both the 4.2 and master suites are failing on getting local VM IP.
Any idea what changed or if I have to change the test?

thanks!
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [OST][HC] HE fails to deploy

2018-04-04 Thread Sahina Bose
On Tue, Apr 3, 2018 at 1:50 PM, Simone Tiraboschi <stira...@redhat.com>
wrote:

>
>
> On Tue, Apr 3, 2018 at 10:14 AM, Simone Tiraboschi <stira...@redhat.com>
> wrote:
>
>>
>>
>> On Mon, Apr 2, 2018 at 4:44 PM, Sahina Bose <sab...@redhat.com> wrote:
>>
>>> HE fails to deploy at waiting for host to be up in the local HE VM.
>>> The setup logs does not indicate why it failed - atleast I couldn't find
>>> anything
>>>
>>
>> I see:
>>
>> "status": "install_failed"
>>
>> So I think that something went wrong with host-deploy on that host but we
>> definitively need host-deploy logs for that and they are just on the engine
>> VM.
>>
>
> According to the timestamps it could be related to:
> Apr  2 09:58:13 lago-hc-basic-suite-master-host-0 systemd: Starting Open
> vSwitch Database Unit...
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 ovs-ctl: runuser:
> System error
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 ovs-ctl:
> /etc/openvswitch/conf.db does not exist ... (warning).
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 ovs-ctl: Creating empty
> database /etc/openvswitch/conf.db runuser: System error
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 ovs-ctl: [FAILED]
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd:
> ovsdb-server.service: control process exited, code=exited status=1
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: Failed to
> start Open vSwitch Database Unit.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: Unit
> ovsdb-server.service entered failed state.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd:
> ovsdb-server.service failed.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: Cannot add
> dependency job for unit lvm2-lvmetad.socket, ignoring: Invalid request
> descriptor
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: Assertion
> failed for Open vSwitch Delete Transient Ports.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd:
> ovsdb-server.service holdoff time over, scheduling restart.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: Cannot add
> dependency job for unit lvm2-lvmetad.socket, ignoring: Unit is masked.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: start request
> repeated too quickly for ovsdb-server.service
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: Failed to
> start Open vSwitch Database Unit.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd: Unit
> ovsdb-server.service entered failed state.
> Apr  2 09:58:14 lago-hc-basic-suite-master-host-0 systemd:
> ovsdb-server.service failed.
>

Does this require an update to openvswitch rpms used in suite?
Are the HE suites passing?


>
>>
>>
>>>
>>> -- Forwarded message --
>>> From: <jenk...@jenkins.phx.ovirt.org>
>>> Date: Mon, Apr 2, 2018 at 7:50 PM
>>> Subject: [oVirt Jenkins] ovirt-system-tests_hc-basic-suite-master -
>>> Build # 276 - Still Failing!
>>> To: in...@ovirt.org, sab...@redhat.com
>>>
>>>
>>> Project: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-sui
>>> te-master/
>>> Build: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-sui
>>> te-master/276/
>>> Build Number: 276
>>> Build Status:  Still Failing
>>> Triggered By: Started by timer
>>>
>>> -
>>> Changes Since Last Success:
>>> -
>>> Changes for Build #265
>>> [Gal Ben Haim] Check if the prefix exists before printing its size
>>>
>>> [Sandro Bonazzola] ovirt-engine: add jobs for 4.1.10 async
>>>
>>>
>>> Changes for Build #266
>>> [Gal Ben Haim] Check if the prefix exists before printing its size
>>>
>>>
>>> Changes for Build #267
>>> [Gal Ben Haim] Check if the prefix exists before printing its size
>>>
>>> [Daniel Belenky] ppc repos: Use qemu EV release instead of test
>>>
>>> [Daniel Belenky] global_setup: Add generic package remove function
>>>
>>> [Daniel Belenky] Fix package verification in verify_packages
>>>
>>>
>>> Changes for Build #268
>>> [Gal Ben Haim] Check if the prefix exists before printing its size
>>>
>>>
>>> Changes for Build #269
>>> [Gal Ben Haim] Check if the prefix exists before printing its size
>>>
>>>
>>> Changes for Build #270
>>> [Gal Ben 

[ovirt-devel] [OST][HC] HE fails to deploy

2018-04-02 Thread Sahina Bose
HE fails to deploy at waiting for host to be up in the local HE VM.
The setup logs does not indicate why it failed - atleast I couldn't find
anything

-- Forwarded message --
From: 
Date: Mon, Apr 2, 2018 at 7:50 PM
Subject: [oVirt Jenkins] ovirt-system-tests_hc-basic-suite-master - Build #
276 - Still Failing!
To: in...@ovirt.org, sab...@redhat.com


Project: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-
suite-master/
Build: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-
suite-master/276/
Build Number: 276
Build Status:  Still Failing
Triggered By: Started by timer

-
Changes Since Last Success:
-
Changes for Build #265
[Gal Ben Haim] Check if the prefix exists before printing its size

[Sandro Bonazzola] ovirt-engine: add jobs for 4.1.10 async


Changes for Build #266
[Gal Ben Haim] Check if the prefix exists before printing its size


Changes for Build #267
[Gal Ben Haim] Check if the prefix exists before printing its size

[Daniel Belenky] ppc repos: Use qemu EV release instead of test

[Daniel Belenky] global_setup: Add generic package remove function

[Daniel Belenky] Fix package verification in verify_packages


Changes for Build #268
[Gal Ben Haim] Check if the prefix exists before printing its size


Changes for Build #269
[Gal Ben Haim] Check if the prefix exists before printing its size


Changes for Build #270
[Gal Ben Haim] Check if the prefix exists before printing its size


Changes for Build #271
[Gal Ben Haim] Check if the prefix exists before printing its size


Changes for Build #272
[Gal Ben Haim] Check if the prefix exists before printing its size


Changes for Build #273
[Eitan Raviv] network: macpool: test disallowing dups while dups exist

[Daniel Belenky] docker cleanup:Fix edge case for unamed containers

[Daniel Belenky] nested_config: Count nesting level of options

[Daniel Belenky] Introduce conditional execution in STDCI DSL

[Daniel Belenky] Add OST STDCI V2 jobs


Changes for Build #274
[Gal Ben Haim] he-iscsi-master: Temporarily exclude in check-patch


Changes for Build #275
[Gal Ben Haim] he-iscsi-master: Temporarily exclude in check-patch


Changes for Build #276
[Barak Korren] Force STDCI V2 job to use physical host

[Daniel Belenky] Build container on changes to docker_cleanup




-
Failed Tests:
-
No tests ran.
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] [OST][HC] Failed to deploy HE

2018-03-07 Thread Sahina Bose
Error is - The host has been set in non_operational status, please check
engine logs, fix accordingly and re-deploy.

However there are no engine logs available. Is this error seen in HE suite
as well?

On Thu, Mar 8, 2018 at 8:58 AM,  wrote:

> Project: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-
> suite-master/
> Build: http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-
> suite-master/213/
> Build Number: 213
> Build Status:  Failure
> Triggered By: Started by timer
>
> -
> Changes Since Last Success:
> -
> Changes for Build #213
> [Sandro Bonazzola] image-ng-suite-4.2: sync 001_initialize_engine
>
> [Barak Korren] Enable mock to use bootstrap chroot for FC27/RAW
>
> [Sandro Bonazzola] gerrit-admin: move to el7
>
> [Daniel Belenky] Add symlinks resolving capabilities to usrc
>
> [Barak Korren] Fix auto-merge in downstream instances
>
> [Dafna Ron] sync_mirror: removing fc24
>
>
>
>
> -
> Failed Tests:
> -
> No tests ran.
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [OST][HC] HE fails to deploy

2018-02-06 Thread Sahina Bose
On Tue, Feb 6, 2018 at 12:35 PM, Yaniv Kaul <yk...@redhat.com> wrote:

>
>
> On Feb 6, 2018 7:53 AM, "Sahina Bose" <sab...@redhat.com> wrote:
>
>
>
> On Mon, Feb 5, 2018 at 2:59 PM, Sahina Bose <sab...@redhat.com> wrote:
>
>> Hi all,
>>
>> I see the HE fails to deploy after task in running ansible playbook 
>> create_target_vm :
>>
>> TASK [Wait for the engine to come up on the target VM]",
>>
>> with Error engine state=EngineUnexpectedlyDown
>>
>> Is this a known issue that you are working on?
>>
>>
> This does seem like a race, because I see that the HC suite again failed
> with the same error after a successful run yesterday.
> Do I need to open a bug or do we have one tracking this already?
>
>
> Please open a bug.
> I kind of remember we've had some (infra?) issue where Engine timed out on
> HE setup from time to time. Not sure it was solved.
> Please attach server.log and engine.log and let's have a look.
>

Simone already identified the issue as
https://bugzilla.redhat.com/show_bug.cgi?id=1541328 in another thread,
updating here too.



> Y.
>
>
>
>> thanks!
>>
>> sahina
>>
>>
>> Full HE setup log at 
>> http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/3875/artifact/exported-artifacts/hc-basic-suite-master__logs/test_logs/hc-basic-suite-master/post-002_bootstrap.py/lago-hc-basic-suite-master-host0/_var_log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20180205033809-ybwdxp.log
>>
>>
>
> ___
> Devel mailing list
> Devel@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/devel
>
>
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [OST][HC] HE fails to deploy

2018-02-05 Thread Sahina Bose
On Mon, Feb 5, 2018 at 2:59 PM, Sahina Bose <sab...@redhat.com> wrote:

> Hi all,
>
> I see the HE fails to deploy after task in running ansible playbook 
> create_target_vm :
>
> TASK [Wait for the engine to come up on the target VM]",
>
> with Error engine state=EngineUnexpectedlyDown
>
> Is this a known issue that you are working on?
>
>
This does seem like a race, because I see that the HC suite again failed
with the same error after a successful run yesterday.
Do I need to open a bug or do we have one tracking this already?


> thanks!
>
> sahina
>
>
> Full HE setup log at 
> http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/3875/artifact/exported-artifacts/hc-basic-suite-master__logs/test_logs/hc-basic-suite-master/post-002_bootstrap.py/lago-hc-basic-suite-master-host0/_var_log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20180205033809-ybwdxp.log
>
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] [OST][HC] HE fails to deploy

2018-02-05 Thread Sahina Bose
Hi all,

I see the HE fails to deploy after task in running ansible playbook
create_target_vm :

TASK [Wait for the engine to come up on the target VM]",

with Error engine state=EngineUnexpectedlyDown

Is this a known issue that you are working on?

thanks!

sahina


Full HE setup log at
http://jenkins.ovirt.org/job/ovirt-system-tests_master_check-patch-el7-x86_64/3875/artifact/exported-artifacts/hc-basic-suite-master__logs/test_logs/hc-basic-suite-master/post-002_bootstrap.py/lago-hc-basic-suite-master-host0/_var_log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20180205033809-ybwdxp.log
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [ OST Failure Report ] [ oVirt hc master ] [ 19-01-2018 ] [ 002_bootstrap.add_hosts ]

2018-01-22 Thread Sahina Bose
On Sat, Jan 20, 2018 at 1:08 AM, Yaniv Kaul  wrote:

>
>
> On Fri, Jan 19, 2018 at 5:06 PM, Dafna Ron  wrote:
>
>> Hi,
>>
>> we are failing hc master basic suite on test: 002_bootstrap.add_hosts
>>
>>
>>
>>
>>
>>
>>
>> *Link and headline of suspected patches: Link to
>> Job:http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-master/163/
>> Link
>> to all
>> logs:http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-master/163/artifact/
>> (Relevant)
>> error snippet from the log: *2018-01-18 22:30:56,141-05 ERROR
>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>> (VdsDeploy) [3e58f8ce] EVENT_ID: VDS_INSTALL_IN_PROGRESS_ERROR(511), An
>> error has occurred during installation of Host lago_basic_suite_hc_host0:
>> Failed to execute stage 'Closing up': 'Plugin' object has no attribute
>> 'exist'
>>
>>
>> **
>>
>
> Dafna,
> The relevant log is[1], which shows:
>
> 2018-01-18 22:49:25,385-0500 DEBUG otopi.plugins.otopi.services.systemd
> plugin.execute:921 execute-output: ('/usr/bin/systemctl', 'start',
> 'glusterd.service') stdout:
> 2018-01-18 22:49:25,385-0500 DEBUG otopi.plugins.otopi.services.systemd
> plugin.execute:926 execute-output: ('/usr/bin/systemctl', 'start',
> 'glusterd.service') stderr:
> 2018-01-18 22:49:25,385-0500 DEBUG otopi.context
> context._executeMethod:143 method exception
> Traceback (most recent call last):
>   File "/tmp/ovirt-xJomKMYufQ/pythonlib/otopi/context.py", line 133, in
> _executeMethod
> method['method']()
>   File 
> "/tmp/ovirt-xJomKMYufQ/otopi-plugins/ovirt-host-deploy/gluster/packages.py",
> line 95, in _closeup
> if self.services.exist('glustereventsd'):
> AttributeError: 'Plugin' object has no attribute 'exist'
>

https://gerrit.ovirt.org/86529 to fix this


>
> Y.
>
> [1] http://jenkins.ovirt.org/job/ovirt-system-tests_hc-
> basic-suite-master/163/artifact/exported-artifacts/
> test_logs/hc-basic-suite-master/post-002_bootstrap.py/
> lago-hc-basic-suite-master-engine/_var_log/ovirt-engine/
> host-deploy/ovirt-host-deploy-20180118224925-192.168.200.4-7bbdac84.log
>
>
>>
>> ___
>> Devel mailing list
>> Devel@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/devel
>>
>
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [OST Failure] [oVirt Master] [HC] Hosted engine fails to install

2017-12-04 Thread Sahina Bose
This failure is erratic. Looking at the jenkins runs, it seems like the HC
suite fails with this error on every other run.

On Thu, Nov 30, 2017 at 2:25 PM, Yaniv Kaul <yk...@redhat.com> wrote:

>
>
> On Thu, Nov 30, 2017 at 8:58 AM, Sahina Bose <sab...@redhat.com> wrote:
>
>> Hi,
>>
>> The error with HE install is :  Starting vdsmd", "[ ERROR ] Failed to
>> execute stage 'Misc configuration': Couldn't  connect to VDSM within 15
>> seconds".
>>
>> Is there a configuration parameter that needs to be set to change the
>> timeout, or is this a bug?
>>
>
> From the log it doesn't seem it even waited 15 secs:
> 2017-11-29 21:45:03,031-0500 DEBUG otopi.plugins.otopi.services.systemd
> plugin.executeRaw:813 execute: ('/usr/bin/systemctl', 'start',
> 'vdsmd.service'), executable='None', cwd='None', env=None
> 2017-11-29 21:45:05,881-0500 DEBUG otopi.plugins.otopi.services.systemd
> plugin.executeRaw:863 execute-result: ('/usr/bin/systemctl', 'start',
> 'vdsmd.service'), rc=0
> 2017-11-29 21:45:05,882-0500 DEBUG otopi.plugins.otopi.services.systemd
> plugin.execute:921 execute-output: ('/usr/bin/systemctl', 'start',
> 'vdsmd.service') stdout:
>
>
> 2017-11-29 21:45:05,882-0500 DEBUG otopi.plugins.otopi.services.systemd
> plugin.execute:926 execute-output: ('/usr/bin/systemctl', 'start',
> 'vdsmd.service') stderr:
>
>
> 2017-11-29 21:45:07,001-0500 DEBUG otopi.plugins.gr_he_setup.system.vdsmenv
> util.__log_debug:374 VDSM jsonrpc connection is not ready
> 2017-11-29 21:45:07,002-0500 DEBUG otopi.plugins.gr_he_setup.system.vdsmenv
> util.__log_debug:374 Creating a new json-rpc connection to VDSM
> 2017-11-29 21:45:07,202-0500 DEBUG otopi.plugins.gr_he_setup.system.vdsmenv
> util.__log_debug:374 VDSM jsonrpc connection is not ready
> 2017-11-29 21:45:07,203-0500 DEBUG otopi.context
> context._executeMethod:143 method exception
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/otopi/context.py", line 133, in
> _executeMethod
> method['method']()
>   File "/usr/share/ovirt-hosted-engine-setup/scripts/../
> plugins/gr-he-setup/system/vdsmenv.py", line 158, in _misc
> timeout=ohostedcons.Const.VDSCLI_SSL_TIMEOUT,
>   File "/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/util.py",
> line 442, in connect_vdsm_json_rpc
> __vdsm_json_rpc_connect(logger, timeout)
>   File "/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/util.py",
> line 398, in __vdsm_json_rpc_connect
> timeout=VDSM_MAX_RETRY * VDSM_DELAY
> RuntimeError: Couldn't  connect to VDSM within 15 seconds
> 2017-11-29 21:45:07,204-0500 ERROR otopi.context
> context._executeMethod:152 Failed to execute stage 'Misc configuration':
> Couldn't  connect to VDSM within 15 seconds
> 2017-11-29 21:45:07,205-0500 DEBUG otopi.transaction transaction.abort:119
> aborting 'Yum Transaction'
> 2017-11-29 21:45:07,205-0500 INFO otopi.plugins.otopi.packagers.yumpackager
> yumpackager.info:80 Yum Performing yum transaction rollback
>
>
>
>> Logs at : http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-sui
>> te-master/107/artifact/exported-artifacts/test_logs/hc-
>> basic-suite-master/post-002_bootstrap.py/lago-hc-basic-
>> suite-master-host0/_var_log/ovirt-hosted-engine-setup/ovir
>> t-hosted-engine-setup-20171129214218-7vns3t.log
>>
>> thanks
>> sahina
>>
>> ___
>> Devel mailing list
>> Devel@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/devel
>>
>
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] [OST Failure] [oVirt Master] [HC] Hosted engine fails to install

2017-11-29 Thread Sahina Bose
Hi,

The error with HE install is :  Starting vdsmd", "[ ERROR ] Failed to
execute stage 'Misc configuration': Couldn't  connect to VDSM within 15
seconds".

Is there a configuration parameter that needs to be set to change the
timeout, or is this a bug?

Logs at :
http://jenkins.ovirt.org/job/ovirt-system-tests_hc-basic-suite-master/107/artifact/exported-artifacts/test_logs/hc-basic-suite-master/post-002_bootstrap.py/lago-hc-basic-suite-master-host0/_var_log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20171129214218-7vns3t.log

thanks
sahina
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] [Lago] [HC] Failed to run HE VM - gfapi related?

2017-08-16 Thread Sahina Bose
The VM fails to start with error. Denis, could you take a look?

2017-08-16 22:18:31,900-0400 ERROR (vm/260cb6e0) [virt.vm]
(vmId='260cb6e0-441c-4b3f-9f05-a710ab4158f9') The vm start process
failed (vm:853)
Traceback (most recent call last):
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 782,
in _startUnderlyingVm
self._run()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2550, in _run
self._domDependentInit()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2264,
in _domDependentInit
self._vmDependentInit()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2274,
in _vmDependentInit
self._save_legacy_disk_conf_to_metadata()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2342,
in _save_legacy_disk_conf_to_metadata
self._sync_metadata()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 4696,
in _sync_metadata
self._md_desc.dump(self._dom)
  File "/usr/lib/python2.7/site-packages/vdsm/virt/metadata.py", line
382, in dump
md_xml = self._build_xml()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/metadata.py", line
552, in _build_xml
dev_elem = _dump_device(metadata_obj, data)
  File "/usr/lib/python2.7/site-packages/vdsm/virt/metadata.py", line
609, in _dump_device
elems.append(md_obj.dump(key, **value))
  File "/usr/lib/python2.7/site-packages/vdsm/virt/metadata.py", line
198, in dump
_keyvalue_to_elem(self._add_ns(key), value, elem)
  File "/usr/lib/python2.7/site-packages/vdsm/virt/metadata.py", line
700, in _keyvalue_to_elem
raise UnsupportedType(key, value)
UnsupportedType: Unsupported [{'port': '0', 'transport': 'tcp',
'name': 'lago-hc-basic-suite-master-host0'}, {'port': '0',
'transport': 'tcp', 'name': 'lago-hc-basic-suite-master-host1'},
{'port': '0', 'transport': 'tcp', 'name':
'lago-hc-basic-suite-master-host2'}] for hosts



On Thu, Aug 17, 2017 at 8:01 AM,  wrote:

> Project: http://jenkins.ovirt.org/job/system-tests_hc-basic-suite-master/
> Build: http://jenkins.ovirt.org/job/system-tests_hc-basic-suite-master/23/
> Build Number: 23
> Build Status:  Still Failing
> Triggered By: Started by timer
>
> -
> Changes Since Last Success:
> -
> Changes for Build #16
> [Yaniv Kaul] Added missing dependencies to allow offline installation
>
> [Daniel Belenky] Append ansible suite to OST's manual job
>
>
> Changes for Build #17
> [Shani Leviim] Change '004_basic_sanity#hotunplug_disk' test to use sdk4
>
> [Gil Shinar] Mount upstream sources folder in mock
>
> [Gil Shinar] Ability to pass U/S cache folder path
>
>
> Changes for Build #18
> [Shani Leviim] Change '004_basic_sanity#hotunplug_disk' test to use sdk4
>
>
> Changes for Build #19
> [Shani Leviim] Change '004_basic_sanity#hotunplug_disk' test to use sdk4
>
>
> Changes for Build #20
> [Gal Ben Haim] check-patch: Don't fail on missing logs
>
> [Barak Korren] Adding injection of mirrors to slaves
>
> [Barak Korren] Add repo configuration for FC24 slaves
>
> [Barak Korren] Add repo configuration for FC25 and FC26 slaves
>
>
> Changes for Build #21
> [Gal Ben Haim] he_3.6: Remove suite
>
>
> Changes for Build #22
> [Gal Ben Haim] docs: Use mkdocs insted of sphinx
>
>
> Changes for Build #23
> [Eyal Shenitzky] basic-suite-master: add cold_storage_migration test
>
> [Martin Perina] Add ovirt-engine-wildfly builds
>
> [Martin Perina] Add ovirt-engine-wildfly-overlay builds
>
> [Gil Shinar] Removed timed triggers
>
> [Martin Perina] Remove jobs to build WildFly from releng-tools
>
> [Martin Perina] Remove jobs to build WildFly overlay from releng-tools
>
>
>
>
> -
> Failed Tests:
> -
> 1 tests failed.
> FAILED:  002_bootstrap.wait_engine
>
> Error Message:
> None != True after 600 seconds
>
> Stack Trace:
> Traceback (most recent call last):
>   File "/usr/lib64/python2.7/unittest/case.py", line 369, in run
> testMethod()
>   File "/usr/lib/python2.7/site-packages/nose/case.py", line 197, in
> runTest
> self.test(*self.arg)
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 129,
> in wrapped_test
> test()
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 59,
> in wrapper
> return func(get_test_prefix(), *args, **kwargs)
>   File "/home/jenkins/workspace/system-tests_hc-basic-suite-
> master/ovirt-system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
> line 102, in wait_engine
> testlib.assert_true_within(_engine_is_up, timeout=10 * 60)
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 256,
> in assert_true_within
> assert_equals_within(func, True, timeout, allowed_exceptions)
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 230,
> in assert_equals_within
> '%s != %s after %s seconds' % (res, value, timeout)
> AssertionError: None != True after 600 seconds

[ovirt-devel] [Lago] [hc-basic-suite-master] - Error starting HE VM

2017-08-16 Thread Sahina Bose
Hosted-engine setup logs indicate
  File
"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_setup/mixins.py",
line 315, in _create_vm
'The VM is not powering up: please check VDSM logs'
RuntimeError: The VM is not powering up: please check VDSM logs

And the vdsm logs:

2017-08-15 22:20:46,953-0400 ERROR (vm/55fd65fe) [virt.vm]
(vmId='55fd65fe-b10e-4510-8210-37acc35a207a') The vm start process failed
(vm:853)
Traceback (most recent call last):
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 782, in
_startUnderlyingVm
self._run()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2538, in
_run
dom = self._connection.createXML(domxml, flags)
  File "/usr/lib/python2.7/site-packages/vdsm/libvirtconnection.py", line
125, in wrapper
ret = f(*args, **kwargs)
  File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 586, in
wrapper
return func(inst, *args, **kwargs)
  File "/usr/lib64/python2.7/site-packages/libvirt.py", line 3782, in
createXML
if ret is None:raise libvirtError('virDomainCreateXML() failed',
conn=self)
libvirtError: invalid argument: could not find capabilities for arch=x86_64
domaintype=kvm


Isn't nested VT enabled for hosts running these tests?


On Wed, Aug 16, 2017 at 8:04 AM,  wrote:

> Project: http://jenkins.ovirt.org/job/system-tests_hc-basic-suite-master/
> Build: http://jenkins.ovirt.org/job/system-tests_hc-basic-suite-master/22/
> Build Number: 22
> Build Status:  Still Failing
> Triggered By: Started by timer
>
> -
> Changes Since Last Success:
> -
> Changes for Build #16
> [Yaniv Kaul] Added missing dependencies to allow offline installation
>
> [Daniel Belenky] Append ansible suite to OST's manual job
>
>
> Changes for Build #17
> [Shani Leviim] Change '004_basic_sanity#hotunplug_disk' test to use sdk4
>
> [Gil Shinar] Mount upstream sources folder in mock
>
> [Gil Shinar] Ability to pass U/S cache folder path
>
>
> Changes for Build #18
> [Shani Leviim] Change '004_basic_sanity#hotunplug_disk' test to use sdk4
>
>
> Changes for Build #19
> [Shani Leviim] Change '004_basic_sanity#hotunplug_disk' test to use sdk4
>
>
> Changes for Build #20
> [Gal Ben Haim] check-patch: Don't fail on missing logs
>
> [Barak Korren] Adding injection of mirrors to slaves
>
> [Barak Korren] Add repo configuration for FC24 slaves
>
> [Barak Korren] Add repo configuration for FC25 and FC26 slaves
>
>
> Changes for Build #21
> [Gal Ben Haim] he_3.6: Remove suite
>
>
> Changes for Build #22
> [Gal Ben Haim] docs: Use mkdocs insted of sphinx
>
>
>
>
> -
> Failed Tests:
> -
> 1 tests failed.
> FAILED:  002_bootstrap.py.junit.xml.[empty]
>
> Error Message:
>
>
> Stack Trace:
> Test report file /home/jenkins/workspace/system-tests_hc-basic-suite-
> master/exported-artifacts/002_bootstrap.py.junit.xml was length 0
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [Lago][HC] tuned service fails to start on one of HC hosts

2017-06-29 Thread Sahina Bose
On Wed, Jun 28, 2017 at 12:20 PM, Yaniv Kaul <yk...@redhat.com> wrote:

>
>
> On Wed, Jun 28, 2017 at 8:22 AM, Sahina Bose <sab...@redhat.com> wrote:
>
>> One of the host fails to install with
>>
>> Failed to install Host lago-hc-basic-suite-master-host2. Failed to execute 
>> stage 'Misc configuration': Failed to start service 'tuned'
>>
>> The installation on other 2 hosts have been successful - all hosts are
>> created from same template/repo. So this is strange.
>>
>
> https://bugzilla.redhat.com/show_bug.cgi?id=1258868#c5
>
> Please escalate if need, so we'll get it fixed.
>

Is this a race? The HC OST test ran successfully in the last build.


> Y.
>
>
>>
>>
>>
>>
>> On Wed, Jun 28, 2017 at 8:58 AM, <jenk...@jenkins.phx.ovirt.org> wrote:
>>
>>> Project: http://jenkins.ovirt.org/job/ovirt_master_hc-system-tests/
>>> Build: http://jenkins.ovirt.org/job/ovirt_master_hc-system-tests/154/
>>> Build Number: 154
>>> Build Status:  Failure
>>> Triggered By: Started by timer
>>>
>>> -
>>> Changes Since Last Success:
>>> -
>>> Changes for Build #154
>>> [Daniel Belenky] Exclude packages from ovirt-master.repo
>>>
>>> [Barak Korren] Filter builds sent to change queues by version
>>>
>>> [Barak Korren] Make change queue invoke OST basic suit
>>>
>>> [Barak Korren] Various UX improvements in change-queue jobs
>>>
>>> [Barak Korren] Add a job for direct deployments to 'tested'
>>>
>>> [Barak Korren] Use new job to update 'tested' from experimental
>>>
>>> [Barak Korren] Use new tested deploy job in timed builders
>>>
>>> [Eyal Edri] update build retention policy for big artifacts
>>>
>>>
>>>
>>>
>>> -
>>> Failed Tests:
>>> -
>>> 1 tests failed.
>>> FAILED:  002_bootstrap.add_hosts
>>>
>>> Error Message:
>>> Host lago-hc-basic-suite-master-host2 failed to install
>>>  >> begin captured logging << 
>>> ovirtlago.testlib: ERROR: * Unhandled exception in >> _host_is_up at 0x3f11cf8>
>>> Traceback (most recent call last):
>>>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line
>>> 217, in assert_equals_within
>>> res = func()
>>>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/ovirt-
>>> system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
>>> line 151, in _host_is_up
>>> raise RuntimeError('Host %s failed to install' % host.name())
>>> RuntimeError: Host lago-hc-basic-suite-master-host2 failed to install
>>> - >> end captured logging << -
>>>
>>> Stack Trace:
>>>   File "/usr/lib64/python2.7/unittest/case.py", line 369, in run
>>> testMethod()
>>>   File "/usr/lib/python2.7/site-packages/nose/case.py", line 197, in
>>> runTest
>>> self.test(*self.arg)
>>>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line
>>> 129, in wrapped_test
>>> test()
>>>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line
>>> 59, in wrapper
>>> return func(get_test_prefix(), *args, **kwargs)
>>>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/ovirt-
>>> system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
>>> line 164, in add_hosts
>>> testlib.assert_true_within(_host_is_up, timeout=15 * 60)
>>>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line
>>> 256, in assert_true_within
>>> assert_equals_within(func, True, timeout, allowed_exceptions)
>>>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line
>>> 217, in assert_equals_within
>>> res = func()
>>>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/ovirt-
>>> system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
>>> line 151, in _host_is_up
>>> raise RuntimeError('Host %s failed to install' % host.name())
>>> Host lago-hc-basic-suite-master-host2 failed to install
>>>  >> begin captured logging << 
>>> ovirtlago.testlib: ERROR: * Unhandled exception in >> _host_is_up at 0x3f11cf8>
>>> Traceback (most recent call last):
>>>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line
>>> 217, in assert_equals_within
>>> res = func()
>>>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/ovirt-
>>> system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
>>> line 151, in _host_is_up
>>> raise RuntimeError('Host %s failed to install' % host.name())
>>> RuntimeError: Host lago-hc-basic-suite-master-host2 failed to install
>>> - >> end captured logging << -
>>
>>
>>
>> ___
>> Devel mailing list
>> Devel@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/devel
>>
>
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] [Lago][HC] tuned service fails to start on one of HC hosts

2017-06-27 Thread Sahina Bose
One of the host fails to install with

Failed to install Host lago-hc-basic-suite-master-host2. Failed to
execute stage 'Misc configuration': Failed to start service 'tuned'

The installation on other 2 hosts have been successful - all hosts are
created from same template/repo. So this is strange.




On Wed, Jun 28, 2017 at 8:58 AM,  wrote:

> Project: http://jenkins.ovirt.org/job/ovirt_master_hc-system-tests/
> Build: http://jenkins.ovirt.org/job/ovirt_master_hc-system-tests/154/
> Build Number: 154
> Build Status:  Failure
> Triggered By: Started by timer
>
> -
> Changes Since Last Success:
> -
> Changes for Build #154
> [Daniel Belenky] Exclude packages from ovirt-master.repo
>
> [Barak Korren] Filter builds sent to change queues by version
>
> [Barak Korren] Make change queue invoke OST basic suit
>
> [Barak Korren] Various UX improvements in change-queue jobs
>
> [Barak Korren] Add a job for direct deployments to 'tested'
>
> [Barak Korren] Use new job to update 'tested' from experimental
>
> [Barak Korren] Use new tested deploy job in timed builders
>
> [Eyal Edri] update build retention policy for big artifacts
>
>
>
>
> -
> Failed Tests:
> -
> 1 tests failed.
> FAILED:  002_bootstrap.add_hosts
>
> Error Message:
> Host lago-hc-basic-suite-master-host2 failed to install
>  >> begin captured logging << 
> ovirtlago.testlib: ERROR: * Unhandled exception in  _host_is_up at 0x3f11cf8>
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 217,
> in assert_equals_within
> res = func()
>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/
> ovirt-system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
> line 151, in _host_is_up
> raise RuntimeError('Host %s failed to install' % host.name())
> RuntimeError: Host lago-hc-basic-suite-master-host2 failed to install
> - >> end captured logging << -
>
> Stack Trace:
>   File "/usr/lib64/python2.7/unittest/case.py", line 369, in run
> testMethod()
>   File "/usr/lib/python2.7/site-packages/nose/case.py", line 197, in
> runTest
> self.test(*self.arg)
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 129,
> in wrapped_test
> test()
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 59,
> in wrapper
> return func(get_test_prefix(), *args, **kwargs)
>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/
> ovirt-system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
> line 164, in add_hosts
> testlib.assert_true_within(_host_is_up, timeout=15 * 60)
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 256,
> in assert_true_within
> assert_equals_within(func, True, timeout, allowed_exceptions)
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 217,
> in assert_equals_within
> res = func()
>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/
> ovirt-system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
> line 151, in _host_is_up
> raise RuntimeError('Host %s failed to install' % host.name())
> Host lago-hc-basic-suite-master-host2 failed to install
>  >> begin captured logging << 
> ovirtlago.testlib: ERROR: * Unhandled exception in  _host_is_up at 0x3f11cf8>
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/ovirtlago/testlib.py", line 217,
> in assert_equals_within
> res = func()
>   File "/home/jenkins/workspace/ovirt_master_hc-system-tests/
> ovirt-system-tests/hc-basic-suite-master/test-scenarios/002_bootstrap.py",
> line 151, in _host_is_up
> raise RuntimeError('Host %s failed to install' % host.name())
> RuntimeError: Host lago-hc-basic-suite-master-host2 failed to install
> - >> end captured logging << -
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] [Lago] [HC] Hosted engine deploy fails on master

2017-06-23 Thread Sahina Bose
hosted-engine --deploy fails with

["***L:ERROR Internal error: No module named M2Crypto"]

Known issue?



On Fri, Jun 23, 2017 at 7:22 PM,  wrote:

> Project: http://jenkins.ovirt.org/job/ovirt_master_hc-system-tests/
> Build: http://jenkins.ovirt.org/job/ovirt_master_hc-system-tests/144/
> Build Number: 144
> Build Status:  Still Failing
> Triggered By: Started by user Sandro Bonazzola
>
> -
> Changes Since Last Success:
> -
> Changes for Build #141
> [Shani Leviim] Remove 'provisioned_size' disk parameter
>
> [Shirly Radco] ovirt-engine-metrics: add build-on-demand job
>
>
> Changes for Build #142
> [Shani Leviim] Changed some parameters on 004_basic_sanity#add_disk test
>
> [Gil Shinar] Separate JJB deploy to project and template
>
> [Gil Shinar] Added upstream-source-collector to jjb deploy job
>
>
> Changes for Build #143
> [Yaniv Kaul] Move disks to be ext4 based instead of XFS
>
> [Evgheni Dereveanchin] OVIRT-1451 - Add a CI mirror for Gluster 3.10
>
> [Juan Hernandez] Fix branches of metamodel jobs
>
> [Eyal Edri] reduce experimental history to 14
>
>
> Changes for Build #144
> [Yaniv Kaul] Move disks to be ext4 based instead of XFS
>
>
>
>
> -
> Failed Tests:
> -
> 1 tests failed.
> FAILED:  002_bootstrap.py.junit.xml.[empty]
>
> Error Message:
>
>
> Stack Trace:
> Test report file /home/jenkins/workspace/ovirt_master_hc-system-tests/
> exported-artifacts/002_bootstrap.py.junit.xml was length 0
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] ovirt_master_hc-system-tests and gluster

2017-06-16 Thread Sahina Bose
Thanks for pointing that out, Sandro!

Posted https://gerrit.ovirt.org/78246 to fix it.

On Fri, Jun 16, 2017 at 6:30 PM, Sandro Bonazzola 
wrote:

> Hi,
>
> I was looking at Bug 1429537
>  - [RFE] Rebase on
> gluster-3.10
> for oVirt Hosted Engine and pretty happy that ovirt_master_hc-system-tests
> is green.
>
> But despite ovirt-release-master is enabling gluster 10
> repositories[1], ovirt_master_hc-system-tests is consuming glusterfs
> 3.8.12[2]
>
> This jenkins job is therefore ineffective. Can you please fix the job for
> consuming glusterfs 3.10?
>
> Thanks,
>
> [1] https://gerrit.ovirt.org/#/c/74129/
> [2] http://jenkins.ovirt.org/job/ovirt_master_hc-system-
> tests/133/artifact/exported-artifacts/test_logs/hc-basic-
> suite-master/post-002_bootstrap.py/lago-hc-basic-
> suite-master-host0/_var_log/yum.log
>
> --
>
> SANDRO BONAZZOLA
>
> ASSOCIATE MANAGER, SOFTWARE ENGINEERING, EMEA ENG VIRTUALIZATION R
>
> Red Hat EMEA 
> 
> TRIED. TESTED. TRUSTED. 
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [oVirt Jenkins] ovirt_4.1_hc-system-tests - Build # 76 - Failure!

2017-05-29 Thread Sahina Bose
Yes, it is. See
http://jenkins.ovirt.org/job/ovirt_4.1_hc-system-tests/81/artifact/exported-artifacts/test_logs/hc-basic-suite-4.1/post-002_bootstrap.py/lago-hc-basic-suite-4-1-engine/_var_log/ovirt-engine/engine.log

On Mon, May 29, 2017 at 4:45 PM, Sandro Bonazzola <sbona...@redhat.com>
wrote:

>
>
> On Sat, May 27, 2017 at 8:23 PM, Eyal Edri <ee...@redhat.com> wrote:
>
>> Since HC tests are using appliance, I think we need to verify the
>> appliance build is working and includes the latest fix.
>>
>
> Is this still an issue?
>
>
>
>>
>> On Fri, May 26, 2017 at 3:39 PM, Sahina Bose <sab...@redhat.com> wrote:
>>
>>> I see that the 4.1 hc test is still failing with same error :
>>>
>>> Caused by: java.lang.ClassNotFoundException: 
>>> org.apache.commons.lang.StringUtils from [Module 
>>> "org.ovirt.vdsm-jsonrpc-java:main" from local module loader @5e91993f 
>>> (finder: local module finder @1c4af82c (roots: 
>>> /usr/share/ovirt-engine-wildfly-overlay/modules,/usr/share/ovirt-engine/modules/common,/usr/share/ovirt-engine-extension-aaa-jdbc/modules,/usr/share/ovirt-engine-wildfly/modules,/usr/share/ovirt-engine-wildfly/modules/system/layers/base))]
>>>
>>> Does the lago test need to be updated, as this fix has already landed?
>>>
>>> On Thu, May 25, 2017 at 2:03 PM, Eyal Edri <ee...@redhat.com> wrote:
>>>
>>>> So that was fixed already since hc suite is running once a day, I guess
>>>> triggering it will fix it.
>>>>
>>>> On Thu, May 25, 2017 at 11:31 AM, Piotr Kliczewski <
>>>> piotr.kliczew...@gmail.com> wrote:
>>>>
>>>>> This is the issue I described in the other thread. Engine you are
>>>>> using do not have a patch [1].
>>>>>
>>>>> [1] https://gerrit.ovirt.org/#/c/77203
>>>>>
>>>>> On Thu, May 25, 2017 at 10:26 AM, Eyal Edri <ee...@redhat.com> wrote:
>>>>>
>>>>>> Adding devel and Piotr.
>>>>>>
>>>>>> On Thu, May 25, 2017 at 9:31 AM, Sahina Bose <sab...@redhat.com>
>>>>>> wrote:
>>>>>>
>>>>>>> Connecting to lago-hc-basic-suite-4-1-host0.lago.local/192.168.200.2
>>>>>>> 2017-05-24 23:31:00,492-04 ERROR 
>>>>>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.PollVDSCommand] 
>>>>>>> (org.ovirt.thread.pool-7-thread-1) [712d05a6] Error: 
>>>>>>> org.ovirt.engine.core.vdsbroker.TransportRunTimeException: Connection 
>>>>>>> issues during send request
>>>>>>> 2017-05-24 23:31:00,492-04 ERROR 
>>>>>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.PollVDSCommand] 
>>>>>>> (org.ovirt.thread.pool-7-thread-1) [712d05a6] Exception: 
>>>>>>> java.util.concurrent.ExecutionException: 
>>>>>>> org.ovirt.engine.core.vdsbroker.TransportRunTimeException: Connection 
>>>>>>> issues during send request
>>>>>>> Caused by: java.lang.ClassNotFoundException: 
>>>>>>> org.apache.commons.lang.StringUtils from [Module 
>>>>>>> "org.ovirt.vdsm-jsonrpc-java:main" from local module loader @5e91993f 
>>>>>>> (finder: local module finder @1c4af82c (roots: 
>>>>>>> /usr/share/ovirt-engine-wildfly-overlay/modules,/usr/share/ovirt-engine/modules/common,/usr/share/ovirt-engine-extension-aaa-jdbc/modules,/usr/share/ovirt-engine-wildfly/modules,/usr/share/ovirt-engine-wildfly/modules/system/layers/base))]
>>>>>>>
>>>>>>>
>>>>>>> Known issue?
>>>>>>>
>>>>>>>
>>>>>>> On Thu, May 25, 2017 at 9:16 AM, <jenk...@jenkins.phx.ovirt.org>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Project: http://jenkins.ovirt.org/job/ovirt_4.1_hc-system-tests/
>>>>>>>> Build: http://jenkins.ovirt.org/job/ovirt_4.1_hc-system-tests/76/
>>>>>>>> Build Number: 76
>>>>>>>> Build Status:  Failure
>>>>>>>> Triggered By: Started by timer
>>>>>>>>
>>>>>>>> -
>>>>>>>> Changes Since Last Success:
>>>>>>>> -
>>>>>>>> Changes for Build #76
>>>>>>>> [OndÅ™

Re: [ovirt-devel] [oVirt Jenkins] ovirt_4.1_hc-system-tests - Build # 76 - Failure!

2017-05-26 Thread Sahina Bose
I see that the 4.1 hc test is still failing with same error :

Caused by: java.lang.ClassNotFoundException:
org.apache.commons.lang.StringUtils from [Module
"org.ovirt.vdsm-jsonrpc-java:main" from local module loader @5e91993f
(finder: local module finder @1c4af82c (roots:
/usr/share/ovirt-engine-wildfly-overlay/modules,/usr/share/ovirt-engine/modules/common,/usr/share/ovirt-engine-extension-aaa-jdbc/modules,/usr/share/ovirt-engine-wildfly/modules,/usr/share/ovirt-engine-wildfly/modules/system/layers/base))]

Does the lago test need to be updated, as this fix has already landed?

On Thu, May 25, 2017 at 2:03 PM, Eyal Edri <ee...@redhat.com> wrote:

> So that was fixed already since hc suite is running once a day, I guess
> triggering it will fix it.
>
> On Thu, May 25, 2017 at 11:31 AM, Piotr Kliczewski <
> piotr.kliczew...@gmail.com> wrote:
>
>> This is the issue I described in the other thread. Engine you are using
>> do not have a patch [1].
>>
>> [1] https://gerrit.ovirt.org/#/c/77203
>>
>> On Thu, May 25, 2017 at 10:26 AM, Eyal Edri <ee...@redhat.com> wrote:
>>
>>> Adding devel and Piotr.
>>>
>>> On Thu, May 25, 2017 at 9:31 AM, Sahina Bose <sab...@redhat.com> wrote:
>>>
>>>> Connecting to lago-hc-basic-suite-4-1-host0.lago.local/192.168.200.2
>>>> 2017-05-24 23:31:00,492-04 ERROR 
>>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.PollVDSCommand] 
>>>> (org.ovirt.thread.pool-7-thread-1) [712d05a6] Error: 
>>>> org.ovirt.engine.core.vdsbroker.TransportRunTimeException: Connection 
>>>> issues during send request
>>>> 2017-05-24 23:31:00,492-04 ERROR 
>>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.PollVDSCommand] 
>>>> (org.ovirt.thread.pool-7-thread-1) [712d05a6] Exception: 
>>>> java.util.concurrent.ExecutionException: 
>>>> org.ovirt.engine.core.vdsbroker.TransportRunTimeException: Connection 
>>>> issues during send request
>>>> Caused by: java.lang.ClassNotFoundException: 
>>>> org.apache.commons.lang.StringUtils from [Module 
>>>> "org.ovirt.vdsm-jsonrpc-java:main" from local module loader @5e91993f 
>>>> (finder: local module finder @1c4af82c (roots: 
>>>> /usr/share/ovirt-engine-wildfly-overlay/modules,/usr/share/ovirt-engine/modules/common,/usr/share/ovirt-engine-extension-aaa-jdbc/modules,/usr/share/ovirt-engine-wildfly/modules,/usr/share/ovirt-engine-wildfly/modules/system/layers/base))]
>>>>
>>>>
>>>> Known issue?
>>>>
>>>>
>>>> On Thu, May 25, 2017 at 9:16 AM, <jenk...@jenkins.phx.ovirt.org> wrote:
>>>>
>>>>> Project: http://jenkins.ovirt.org/job/ovirt_4.1_hc-system-tests/
>>>>> Build: http://jenkins.ovirt.org/job/ovirt_4.1_hc-system-tests/76/
>>>>> Build Number: 76
>>>>> Build Status:  Failure
>>>>> Triggered By: Started by timer
>>>>>
>>>>> -
>>>>> Changes Since Last Success:
>>>>> -
>>>>> Changes for Build #76
>>>>> [Ondřej Svoboda] Use SDKv4 code in add_labeled_network.
>>>>>
>>>>> [Barak Korren] Refactor: Use macros in defaults file
>>>>>
>>>>> [Barak Korren] Default values for 'arch' and 'distro' in STD-CI
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> -
>>>>> Failed Tests:
>>>>> -
>>>>> 1 tests failed.
>>>>> FAILED:  002_bootstrap.add_hosts
>>>>>
>>>>> Error Message:
>>>>>
>>>>> status: 409
>>>>> reason: Conflict
>>>>> detail: Cannot add Host. There is no available server in the cluster
>>>>> to probe the new server.
>>>>>  >> begin captured logging << 
>>>>> lago.utils: ERROR: Error while running thread
>>>>> Traceback (most recent call last):
>>>>>   File "/usr/lib/python2.7/site-packages/lago/utils.py", line 58, in
>>>>> _ret_via_queue
>>>>> queue.put({'return': func()})
>>>>>   File "/home/jenkins/workspace/ovirt_4.1_hc-system-tests/ovirt-sys
>>>>> tem-tests/hc-basic-suite-4.1/test-scenarios/002_bootstrap.py", line
>>>>> 141, in _add_host
>>>>> return api.hosts.add(p)
>>>>>

Re: [ovirt-devel] [OST] [HC] HE VM fails to start

2017-04-06 Thread Sahina Bose
On Thu, Apr 6, 2017 at 5:41 PM, Yaniv Kaul <yk...@redhat.com> wrote:

>
>
> On Thu, Apr 6, 2017 at 3:02 PM, Sahina Bose <sab...@redhat.com> wrote:
>
>>
>>
>> On Thu, Apr 6, 2017 at 2:24 PM, Dan Kenigsberg <dan...@redhat.com> wrote:
>>
>>> On Thu, Apr 6, 2017 at 11:31 AM, Sahina Bose <sab...@redhat.com> wrote:
>>> >
>>> >
>>> > On Thu, Apr 6, 2017 at 1:31 PM, Dan Kenigsberg <dan...@redhat.com>
>>> wrote:
>>> >>
>>> >> I've merged the fix of https://gerrit.ovirt.org/#/c/75134/
>>> >>
>>> >> With it, the hc-basic suit no longer fail - it hangs for hours, and I
>>> >> don't know why.
>>> >>
>>> >> Sahina, can you look at
>>> >> http://jenkins.ovirt.org/job/ovirt-system-tests_manual/199/p
>>> arameters/
>>> >
>>> >
>>> > The gluster setup and HE install does take around 15-20 minutes. Looks
>>> like
>>> > you aborted in between?
>>>
>>>
>>> What does the log say?
>>> http://jenkins.ovirt.org/job/ovirt-system-tests_manual/199/console
>>>
>>> I aborted the job after 1 hour and 47 minutes.
>>>
>>
>> Sorry, missed that. I could not make out much from logs apart from the
>> fact that it's stuck on starting gluster services. Since there are no
>> gluster logs available from the host VMs, cannot dig further.
>>
>
> Which logs are needed? I thought we collect everything from /var/log ?
>

I think logs are collected only for the tests?
This is prior to running the tests in tests-scenario


> Y.
>
>
>>
>>
>>
>> ___
>> Devel mailing list
>> Devel@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/devel
>>
>
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [OST] [HC] HE VM fails to start

2017-04-04 Thread Sahina Bose
Job's still failing on master.
Could this be related to network patches that got merged on Mar 28, for
instance https://gerrit.ovirt.org/#/c/74390/ ?

On Thu, Mar 30, 2017 at 11:41 AM, Sahina Bose <sab...@redhat.com> wrote:

> The error in vdsm.log
>
> Traceback (most recent call last):
>   File "/usr/share/vdsm/virt/vm.py", line 2016, in _setup_devices
> dev_object.setup()
>   File "/usr/lib/python2.7/site-packages/vdsm/virt/vmdevices/graphics.py", 
> line 63, in setup
> net_api.create_libvirt_network(display_network, self.conf['vmId'])
>   File "/usr/lib/python2.7/site-packages/vdsm/network/api.py", line 90, in 
> create_libvirt_network
> libvirt.create_network(netname, user_reference)
>   File "/usr/lib/python2.7/site-packages/vdsm/network/libvirt.py", line 94, 
> in create_network
> if not is_libvirt_network(netname):
>   File "/usr/lib/python2.7/site-packages/vdsm/network/libvirt.py", line 159, 
> in is_libvirt_network
> netname = LIBVIRT_NET_PREFIX + netname
> TypeError: cannot concatenate 'str' and 'NoneType' objects
> 2017-03-29 22:58:39,559-0400 ERROR (vm/d71bdf4e) [virt.vm] 
> (vmId='d71bdf4e-1eb3-4762-bd0e-05bb9f5e43ef') The vm start process failed 
> (vm:659)
>
> The tests last passed on Mar 28. Did a recent patch break this?
>
> The full build logs at http://jenkins.ovirt.org/job/
> ovirt_master_hc-system-tests/52/artifact/exported-artifacts/test_logs/
>
> thanks
> sahina
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] Gluster/virt ports clarifications.

2017-04-03 Thread Sahina Bose
On Sun, Apr 2, 2017 at 7:23 PM, Leon Goldberg  wrote:

> Hey,
>
> We're gathering information regarding the ports we open as part of the
> firewalld migration research.
>
> We have most of the current ports covered by either firewalld itself or by
> 3rd party packages, however some questions remain unanswered:
>
>
> IPTablesConfigForVirt:
>
> - serial consoles (tcp/2223): Is this required? can't find a single
> reference to a listening entity. Either way, I couldn't find a relevant
> service that provides it.
>
>
> IPTablesConfigForGluster:
>
> - Gluster swift (tcp/8080): Doesn't appear in Gluster's firewalld service.
> Should be added to Gluster's firewalld service?
>

This is required when gluster-swift service is running on the hosts.
gluster-swift is no longer installed as part of glusterfs-server
installation, so this can be removed.


>
> - tcp/39543 and tcp/55863, appear under "status". Couldn't find a relevant
> service that provides them. Should be added? (and if so, where?)
>

The https://access.redhat.com/documentation/en-us/red_hat_gluste
r_storage/3.2/html/installation_guide/port_information mentions these as
needed by oVirt. Could be legacy? These can be removed if oVirt no longer
uses these ports


>
> - nlockmgr (tcp/38468, udp/963, tcp/965): tcp/38468 appears in gluster's
> service. Couldn't find a relevant service that provides the other two.
> Should be added? (and if so, where?)
>

These are needed by NFS LockManager, and needed when gluster nfs access is
enabled on gluster volume


>
> - ctdbd (tcp/4379): Couldn't find a relevant service that provides this.
> Should be added? (and if so, where?)
>

These are needed to access gluster volume using SMB. CTDB service uses this
port


>
>
> Thanks,
> Leon
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] [OST] [HC] HE VM fails to start

2017-03-30 Thread Sahina Bose
The error in vdsm.log

Traceback (most recent call last):
  File "/usr/share/vdsm/virt/vm.py", line 2016, in _setup_devices
dev_object.setup()
  File "/usr/lib/python2.7/site-packages/vdsm/virt/vmdevices/graphics.py",
line 63, in setup
net_api.create_libvirt_network(display_network, self.conf['vmId'])
  File "/usr/lib/python2.7/site-packages/vdsm/network/api.py", line
90, in create_libvirt_network
libvirt.create_network(netname, user_reference)
  File "/usr/lib/python2.7/site-packages/vdsm/network/libvirt.py",
line 94, in create_network
if not is_libvirt_network(netname):
  File "/usr/lib/python2.7/site-packages/vdsm/network/libvirt.py",
line 159, in is_libvirt_network
netname = LIBVIRT_NET_PREFIX + netname
TypeError: cannot concatenate 'str' and 'NoneType' objects
2017-03-29 22:58:39,559-0400 ERROR (vm/d71bdf4e) [virt.vm]
(vmId='d71bdf4e-1eb3-4762-bd0e-05bb9f5e43ef') The vm start process
failed (vm:659)

The tests last passed on Mar 28. Did a recent patch break this?

The full build logs at
http://jenkins.ovirt.org/job/ovirt_master_hc-system-tests/52/artifact/exported-artifacts/test_logs/

thanks
sahina
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] OST: HE installation on 4.1 fails due to default cluster level

2017-03-02 Thread Sahina Bose
On Mon, Feb 27, 2017 at 9:01 PM, Sandro Bonazzola <sbona...@redhat.com>
wrote:

>
>
> On Mon, Feb 27, 2017 at 4:03 PM, Eyal Edri <ee...@redhat.com> wrote:
>
>>
>>
>> On Fri, Feb 24, 2017 at 2:49 PM, Martin Perina <mper...@redhat.com>
>> wrote:
>>
>>>
>>>
>>> On Fri, Feb 24, 2017 at 11:58 AM, Sahina Bose <sab...@redhat.com> wrote:
>>>
>>>> Hi all,
>>>>
>>>> The ovirt-engine 4.1 appliance has the Default cluster set to 4.2 and
>>>> the hyperconverged OST tests fail here as the 4.1 host cannot be added to
>>>> cluster.  (CLUSTER_VERSION_INCOMPATIBLE_WITH_CLUSTER)
>>>>
>>>> The appliance is from  http://resources.ovirt.org/rep
>>>> os/ovirt/tested/master/rpm/el7/noarch/ovirt-engine-appliance
>>>> -4.1-20170215.1.el7.centos.noarch.rpm
>>>>
>>>
>>> ​This is the tested master repository, so it should contain master code
>>> where 4.2 cluster level is already default. So probably only
>>> ovirt-engine-appliance version has not yet been bumped to 4.2 ...
>>> ​
>>>
>>> ​I'm just wondering why we don't have any ovirt-engine-appliance in
>>> latest 4.1 tested repo:
>>>
>>
>> We can't publish appliance to tested repo if its not tested/verified.
>> The correct thing to do is to make appliance build from tested repo and
>> then run the HE/HC suite and only if it pass publish it to verified tested
>> repo.
>>
>> I don't think we're there yet, but there is some progress.
>> Maybe Sandro can update on current status.
>>
>
>
> No update yet. We're out of capacity to handle more automation until
> 4.0.7, and 4.1.1 are out.
>


Is there a 4.1 appliance rpm available in any repo, so that I can try the
HC tests on the 4.1 branch?



>
>
>
>
>>
>>
>>>
>>> http://plain.resources.ovirt.org/repos/ovirt/tested/4.1/rpm/el7/noarch/
>>> ​
>>> @Ryan?
>>>
>>>
>>>>
>>>> Has the default cluster level been changed in an updated appliance?
>>>>
>>>> Test results at http://jenkins.ovirt.org/view/oVirt system
>>>> tests/job/ovirt_4.1_hc-system-tests/1/artifact/exported-artifacts
>>>>
>>>> thanks
>>>> sahina
>>>>
>>>> ___
>>>> Devel mailing list
>>>> Devel@ovirt.org
>>>> http://lists.ovirt.org/mailman/listinfo/devel
>>>>
>>>
>>>
>>> ___
>>> Devel mailing list
>>> Devel@ovirt.org
>>> http://lists.ovirt.org/mailman/listinfo/devel
>>>
>>
>>
>>
>> --
>> Eyal Edri
>> Associate Manager
>> RHV DevOps
>> EMEA ENG Virtualization R
>> Red Hat Israel
>>
>> phone: +972-9-7692018 <+972%209-769-2018>
>> irc: eedri (on #tlv #rhev-dev #rhev-integ)
>>
>
>
>
> --
> Sandro Bonazzola
> Better technology. Faster innovation. Powered by community collaboration.
> See how it works at redhat.com
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] OST: HE installation on 4.1 fails due to default cluster level

2017-02-24 Thread Sahina Bose
Hi all,

The ovirt-engine 4.1 appliance has the Default cluster set to 4.2 and the
hyperconverged OST tests fail here as the 4.1 host cannot be added to
cluster.  (CLUSTER_VERSION_INCOMPATIBLE_WITH_CLUSTER)

The appliance is from
http://resources.ovirt.org/repos/ovirt/tested/master/rpm/el7/noarch/ovirt-engine-appliance-4.1-20170215.1.el7.centos.noarch.rpm

Has the default cluster level been changed in an updated appliance?

Test results at http://jenkins.ovirt.org/view/oVirt system
tests/job/ovirt_4.1_hc-system-tests/1/artifact/exported-artifacts

thanks
sahina
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] HE setup failure

2017-02-13 Thread Sahina Bose
Hi all,

The HE setup fails in ovirt-system-tests while deploying HE on
hyperconverged gluster setup using master

Error :
Failed to execute stage 'Misc configuration': "

Traceback from hosted-engine log:
ProtocolError: 
  File
"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/storage_backends.py",
line 279, in create_volume
volUUID=volume_uuid
  File
"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/storage_backends.py",
line 245, in _get_volume_path
volUUID
  File "/usr/lib64/python2.7/xmlrpclib.py", line **FILTERED**3, in __call__
return self.__send(self.__name, args)
  File "/usr/lib64/python2.7/xmlrpclib.py", line 1587, in __request
verbose=self.__verbose
  File "/usr/lib64/python2.7/xmlrpclib.py", line 1273, in request
return self.single_request(host, handler, request_body, verbose)
  File "/usr/lib64/python2.7/xmlrpclib.py", line 1321, in single_request
response.msg,
ProtocolError: 

Is this a regression?
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] [lago-devel] vdsm service fails to start on HC setup

2017-02-07 Thread Sahina Bose
On Tue, Feb 7, 2017 at 1:38 AM, Nir Soffer <nsof...@redhat.com> wrote:

> Fixed in master.
>

Thanks! works now.


>
> On Mon, Feb 6, 2017 at 3:40 PM, Nir Soffer <nsof...@redhat.com> wrote:
> > On Mon, Feb 6, 2017 at 11:34 AM, Yaniv Bronheim <ybron...@redhat.com>
> wrote:
> >> we merged https://gerrit.ovirt.org/#/c/71231/ yesterday, calling
> vdsm-tool
> >> configure without specifying modules will run this new lvm configure.
> didn't
> >> see any issues that can come up, but it is a regression
> >>
> >> On Mon, Feb 6, 2017 at 10:26 AM, Yaniv Kaul <yk...@redhat.com> wrote:
> >>>
> >>> +Nir
> >>>
> >>> On Feb 6, 2017 10:12 AM, "Sahina Bose" <sab...@redhat.com> wrote:
> >>>>
> >>>> Hi all,
> >>>>
> >>>> While verifying the test to deploy hyperconverged HE [1], I'm running
> >>>> into an issue today where vdsm fails to start.
> >>>>
> >>>> In the logs -
> >>>>  lago-basic-suite-hc-host0 vdsmd_init_common.sh: Error:
> >>>> Feb  6 02:21:32 lago-basic-suite-hc-host0 vdsmd_init_common.sh: One of
> >>>> the modules is not configured to work with VDSM.
> >>>>
> >>>> Starting manually - vdsm-tool configure --force gives:
> >>>> Units need configuration: {'lvm2-lvmetad.service': {'LoadState':
> >>>> 'masked', 'ActiveState': 'failed'}}
> >>>>
> >>>> Is this a known issue?
> >>>>
> >>>> [1] - https://gerrit.ovirt.org/57283
> >
> > Yes, I posted a fix yesterday:
> > https://gerrit.ovirt.org/71677
> >
> > We are waiting for review.
> >
> > Nir
> >
> >>>>
> >>>> thanks
> >>>> sahina
> >>>>
> >>>>
> >>>> ___
> >>>> lago-devel mailing list
> >>>> lago-de...@ovirt.org
> >>>> http://lists.ovirt.org/mailman/listinfo/lago-devel
> >>>>
> >>>
> >>> ___
> >>> Devel mailing list
> >>> Devel@ovirt.org
> >>> http://lists.ovirt.org/mailman/listinfo/devel
> >>
> >>
> >>
> >>
> >> --
> >> Yaniv Bronhaim.
> >>
> >> ___
> >> Devel mailing list
> >> Devel@ovirt.org
> >> http://lists.ovirt.org/mailman/listinfo/devel
> ___
> Devel mailing list
> Devel@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/devel
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] vdsm service fails to start on HC setup

2017-02-06 Thread Sahina Bose
Hi all,

While verifying the test to deploy hyperconverged HE [1], I'm running into
an issue today where vdsm fails to start.

In the logs -
 lago-basic-suite-hc-host0 vdsmd_init_common.sh: Error:
Feb  6 02:21:32 lago-basic-suite-hc-host0 vdsmd_init_common.sh: One of the
modules is not configured to work with VDSM.

Starting manually - vdsm-tool configure --force gives:
Units need configuration: {'lvm2-lvmetad.service': {'LoadState': 'masked',
'ActiveState': 'failed'}}

Is this a known issue?

[1] - https://gerrit.ovirt.org/57283

thanks
sahina
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] blockcommit and gluster network disk path

2016-11-22 Thread Sahina Bose
On Mon, Nov 21, 2016 at 3:32 PM, Sahina Bose <sab...@redhat.com> wrote:

> Hi,
>
> I'm running into problems with blockcommit and gluster network disks -
> wanted to check how to pass path for network disks. How's the protocol and
> host parameters specified?
>
> For a backing volume chain as below, executing
> virsh blockcommit fioo5 vmstore/912d9062-3881-479b-
> a6e5-7b074a252cb6/images/27b0cbcb-4dfd-4eeb-8ab0-
> 8fda54a6d8a4/027a3b37-77d4-4fa9-8173-b1fedba1176c --base
> vmstore/912d9062-3881-479b-a6e5-7b074a252cb6/images/
> 27b0cbcb-4dfd-4eeb-8ab0-8fda54a6d8a4/d4c23ec6-20ce-4a2f-9b32-ca91e65a114a
> --top vmstore/912d9062-3881-479b-a6e5-7b074a252cb6/images/
> 27b0cbcb-4dfd-4eeb-8ab0-8fda54a6d8a4/027a3b37-77d4-4fa9-8173-b1fedba1176c
> --verbose --wait
>
> gives "error: invalid argument: No device found for specified path".
>


I can get this to work if I provide index based argument. Is this the only
supported way?

virsh blockcommit fioo5 vda --base vda[1] --active --verbose --wait


>
> 
>  io='threads'/>
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 27b0cbcb-4dfd-4eeb-8ab0-8fda54a6d8a4
> 
> 
> 
> 
>
>
>
> thanks
> sahina
>
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] blockcommit and gluster network disk path

2016-11-21 Thread Sahina Bose
Hi,

I'm running into problems with blockcommit and gluster network disks -
wanted to check how to pass path for network disks. How's the protocol and
host parameters specified?

For a backing volume chain as below, executing
virsh blockcommit fioo5
vmstore/912d9062-3881-479b-a6e5-7b074a252cb6/images/27b0cbcb-4dfd-4eeb-8ab0-8fda54a6d8a4/027a3b37-77d4-4fa9-8173-b1fedba1176c
--base
vmstore/912d9062-3881-479b-a6e5-7b074a252cb6/images/27b0cbcb-4dfd-4eeb-8ab0-8fda54a6d8a4/d4c23ec6-20ce-4a2f-9b32-ca91e65a114a
--top
vmstore/912d9062-3881-479b-a6e5-7b074a252cb6/images/27b0cbcb-4dfd-4eeb-8ab0-8fda54a6d8a4/027a3b37-77d4-4fa9-8173-b1fedba1176c
--verbose --wait

gives "error: invalid argument: No device found for specified path".





















27b0cbcb-4dfd-4eeb-8ab0-8fda54a6d8a4







thanks
sahina
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] Integrating oVirt and Gluster geo-replication to provide a DR solution

2016-09-14 Thread Sahina Bose
Hi all,

Though there are many solutions that integrate with oVirt to provide
disaster recovery for the guest images, these solutions either rely on
backup agents running on guests or third party software and are complicated
to setup

Since oVirt already integrates with glusterfs, we can leverage gluster's
geo-replication feature to mirror contents to a remote/secondary site
periodically for disaster recovery, without the need for additional software

Please review the PR[1] for the feature page outlining the solution and
integration in oVirt.
Comments and feedback welcome.

[1] https://github.com/oVirt/ovirt-site/pull/453

thanks,
sahina
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] Maintainer rights on vdsm - ovirt-3.5-gluster

2015-11-17 Thread Sahina Bose

Thanks!

On 11/16/2015 07:44 PM, David Caro wrote:

Done!

On 11/16 15:28, Dan Kenigsberg wrote:

On Wed, Apr 29, 2015 at 10:03:47AM +0200, David Caro wrote:

Done

On 04/20, Dan Kenigsberg wrote:

On Mon, Apr 20, 2015 at 03:20:18PM +0530, Sahina Bose wrote:

Hi!

On the vdsm branch "ovirt-3.5-gluster", could you provide merge rights to
Bala (barum...@redhat.com) ?

+1 from me.

ovirt-3.5-gluster needs a rebase on top of the current ovirt-3.5

Thanks. Few months have passed, and now we need Sahina herself as an
admin of the this branch. Would you please add her as well?


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] [TICKET] failing gluster tests -- RefreshGlusterVolumeDetailsCommandTest

2015-06-29 Thread Sahina Bose


On 06/26/2015 06:28 PM, Sandro Bonazzola wrote:

Il 26/06/2015 14:29, Martin Perina ha scritto:

Hi,

there's something really strange with this test. I executed the build including
unit test execution to verify usage of OpenJDK 1.8 for building the engine and
here are results:

  Fedora20/OpenJDK 1.7 - build successful
  Fedora21/OpenJDK 1.8 - build always failed with error mentioned below
  Fedora22/OpenJDK 1.8 - build successful

it fails for me on Fedora22/OpenJDK 1.8 
java-1.8.0-openjdk-1.8.0.45-40.b14.fc22.x86_64


Looking into this.






Martin

- Original Message -

From: Doron Fediuck dfedi...@redhat.com
To: Eyal Edri ee...@redhat.com, rhev-gluster rhev-glus...@redhat.com, Vijay 
Bellur vbel...@redhat.com
Cc: infra in...@ovirt.org, devel@ovirt.org
Sent: Friday, June 26, 2015 1:18:27 PM
Subject: Re: [ovirt-devel] [TICKET] failing gluster tests   --  
RefreshGlusterVolumeDetailsCommandTest

Adding rhev-gluster.
Vijay / Sahina can you please review?

On Jun 26, 2015 12:12, Eyal Edri ee...@redhat.com wrote:



- Original Message -

From: Sandro Bonazzola sbona...@redhat.com
To: Doron Fediuck dfedi...@redhat.com, Greg Sheremeta
gsher...@redhat.com, infra in...@ovirt.org,
devel@ovirt.org, Sahina Bose sab...@redhat.com
Sent: Friday, June 26, 2015 10:55:51 AM
Subject: Re: [ovirt-devel] [TICKET] failing gluster tests --
RefreshGlusterVolumeDetailsCommandTest

Il 25/06/2015 17:53, Doron Fediuck ha scritto:

Sahina, any idea?

On 25/06/15 18:06, Greg Sheremeta wrote:

These tests appear to be randomly failing in Jenkins. Was this a
memory issue? Pretty odd error -- Could not initialize class --
never seen that one before.

It worked on a dependent patch's build, and I didn't touch any
gluster stuff.

http://jenkins.ovirt.org/job/ovirt-engine_master_unit-tests_gerrit/40725/console

Thanks,
Greg


15:56:32 Results :
15:56:32
15:56:32 Tests in error:
15:56:32
org.ovirt.engine.core.bll.gluster.RefreshGlusterVolumeDetailsCommandTest
15:56:32
testRefreshLightWeight(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest):
Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
testRefreshHeavyWeightFor31(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest):
Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
testRefreshHeavyWeightFor32(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest):
Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
testRefreshLightWeightFor33(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest):
Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
15:56:32 Tests run: 2341, Failures: 0, Errors: 5, Skipped: 6
15:56:32

Just sent an email about the same error. It's reproducible near to 100%
please address as soon as possible.


this was the latest gluster patch related:
https://gerrit.ovirt.org/#/c/42863/
might worth to revert until fixed.




Greg Sheremeta
Red Hat, Inc.
Sr. Software Engineer, RHEV
Cell: 919-807-1086
gsher...@redhat.com
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel



--
Sandro Bonazzola
Better technology. Faster innovation. Powered by community collaboration.
See how it works at redhat.com
___
Infra mailing list
in...@ovirt.org
http://lists.ovirt.org/mailman/listinfo/infra




--
Eyal Edri
Supervisor, RHEV CI
EMEA ENG Virtualization RD
Red Hat Israel

phone: +972-9-7692018
irc: eedri (on #tlv #rhev-dev #rhev-integ)

___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel





___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] [rhev-gluster] [TICKET] failing gluster tests -- RefreshGlusterVolumeDetailsCommandTest

2015-06-29 Thread Sahina Bose


On 06/29/2015 02:39 PM, Sahina Bose wrote:


On 06/26/2015 06:28 PM, Sandro Bonazzola wrote:

Il 26/06/2015 14:29, Martin Perina ha scritto:

Hi,

there's something really strange with this test. I executed the 
build including
unit test execution to verify usage of OpenJDK 1.8 for building the 
engine and

here are results:

  Fedora20/OpenJDK 1.7 - build successful
  Fedora21/OpenJDK 1.8 - build always failed with error mentioned below
  Fedora22/OpenJDK 1.8 - build successful
it fails for me on Fedora22/OpenJDK 1.8 
java-1.8.0-openjdk-1.8.0.45-40.b14.fc22.x86_64


Looking into this.



https://gerrit.ovirt.org/42993 merged to fix this.








Martin

- Original Message -

From: Doron Fediuck dfedi...@redhat.com
To: Eyal Edri ee...@redhat.com, rhev-gluster 
rhev-glus...@redhat.com, Vijay Bellur vbel...@redhat.com

Cc: infra in...@ovirt.org, devel@ovirt.org
Sent: Friday, June 26, 2015 1:18:27 PM
Subject: Re: [ovirt-devel] [TICKET] failing gluster tests --
RefreshGlusterVolumeDetailsCommandTest


Adding rhev-gluster.
Vijay / Sahina can you please review?

On Jun 26, 2015 12:12, Eyal Edri ee...@redhat.com wrote:



- Original Message -

From: Sandro Bonazzola sbona...@redhat.com
To: Doron Fediuck dfedi...@redhat.com, Greg Sheremeta
gsher...@redhat.com, infra in...@ovirt.org,
devel@ovirt.org, Sahina Bose sab...@redhat.com
Sent: Friday, June 26, 2015 10:55:51 AM
Subject: Re: [ovirt-devel] [TICKET] failing gluster tests --
RefreshGlusterVolumeDetailsCommandTest

Il 25/06/2015 17:53, Doron Fediuck ha scritto:

Sahina, any idea?

On 25/06/15 18:06, Greg Sheremeta wrote:

These tests appear to be randomly failing in Jenkins. Was this a
memory issue? Pretty odd error -- Could not initialize class --
never seen that one before.

It worked on a dependent patch's build, and I didn't touch any
gluster stuff.

http://jenkins.ovirt.org/job/ovirt-engine_master_unit-tests_gerrit/40725/console 



Thanks,
Greg


15:56:32 Results :
15:56:32
15:56:32 Tests in error:
15:56:32
org.ovirt.engine.core.bll.gluster.RefreshGlusterVolumeDetailsCommandTest 


15:56:32
testRefreshLightWeight(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest): 


Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
testRefreshHeavyWeightFor31(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest): 


Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
testRefreshHeavyWeightFor32(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest): 


Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
testRefreshLightWeightFor33(org.ovirt.engine.core.bll.gluster.GlusterSyncJobTest): 


Could not initialize class
org.ovirt.engine.core.bll.gluster.GlusterSyncJob
15:56:32
15:56:32 Tests run: 2341, Failures: 0, Errors: 5, Skipped: 6
15:56:32
Just sent an email about the same error. It's reproducible near 
to 100%

please address as soon as possible.


this was the latest gluster patch related:
https://gerrit.ovirt.org/#/c/42863/
might worth to revert until fixed.




Greg Sheremeta
Red Hat, Inc.
Sr. Software Engineer, RHEV
Cell: 919-807-1086
gsher...@redhat.com
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel



--
Sandro Bonazzola
Better technology. Faster innovation. Powered by community 
collaboration.

See how it works at redhat.com
___
Infra mailing list
in...@ovirt.org
http://lists.ovirt.org/mailman/listinfo/infra




--
Eyal Edri
Supervisor, RHEV CI
EMEA ENG Virtualization RD
Red Hat Israel

phone: +972-9-7692018
irc: eedri (on #tlv #rhev-dev #rhev-integ)

___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel







___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] [oVirt 3.6 Localization Question #2] Geo-replication mount brocker has been setup

2015-05-12 Thread Sahina Bose


On 05/08/2015 03:18 PM, Yuko Katabami wrote:

Hi all again,

I have another question.

*File:***LocalizedEnums*
**Resource ID:*** AuditLogType___GLUSTER_SETUP_GEOREP_MOUNT_BROKER*
**Strings:***Geo-replication mount brocker has been setup
*Question:* This should be a typo for broker?


yes, it is a typo



Kind regards,

Yuko Katabami


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

[ovirt-devel] Maintainer rights on vdsm - ovirt-3.5-gluster

2015-04-20 Thread Sahina Bose

Hi!

On the vdsm branch ovirt-3.5-gluster, could you provide merge rights 
to Bala (barum...@redhat.com) ?


thanks
sahina
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] DAO test is failing on master

2015-01-28 Thread Sahina Bose


On 01/28/2015 01:35 PM, Barak Korren wrote:

Sandro,
I remember vaguely that you have to use the 'postgres' user when connecting.
I don't see user information in the URL, maybe it needs to be:

jdbc:postgresql://postgres@localhost/engine_dao_tests

Or maybe:

jdbc:postgresql://localhost/engine_dao_tests;user=postgreas

I'm not 100% sure about that, as my jdbc-foo is not that strong...


ovirt-db-scheduler-test.properties does have a username and password property set to 
engine now. Do the *DaoTest use postgres user on Jenkins server?

thanks!
sahina




Barak.

- Original Message -

From: Sandro Bonazzola sbona...@redhat.com
To: Sahina Bose sab...@redhat.com, infra in...@ovirt.org, devel@ovirt.org, 
Barak Korren
bkor...@redhat.com, David Caro dcaro...@redhat.com
Sent: Wednesday, January 28, 2015 9:41:23 AM
Subject: Re: [ovirt-devel] DAO test is failing on master

Il 28/01/2015 07:59, Sahina Bose ha scritto:

Sandro,

Could be devel error. This test looks for database settings as per
backend/manager/modules/scheduler/src/test/resources/ovirt-db-scheduler-test.properties

The Data source URL is specified as
jdbc:postgresql://localhost/engine_dao_tests. Does this need to be
changed for Jenkins jobs?

David? Barak?



On 01/27/2015 09:29 PM, Sandro Bonazzola wrote:

http://jenkins.ovirt.org/job/ovirt-engine_master_dao-unit-tests_merged/10022/console

15:50:12 Failed tests:
15:50:12
scheduleARecurringJob(org.ovirt.engine.core.utils.timer.DBSchedulerUtilQuartzImplTest):
Unexpected exception occured -Failed to obtain DB
connection from data source 'EngineDS': java.sql.SQLException: Connections
could not be acquired from the underlying database!
15:50:12
scheduleAJob(org.ovirt.engine.core.utils.timer.DBSchedulerUtilQuartzImplTest):
Unexpected exception occured -Failed to obtain DB connection
from data source 'EngineDS': java.sql.SQLException: Connections could not
be acquired from the underlying database!

Not sure if it's infra or devel issue.



--
Sandro Bonazzola
Better technology. Faster innovation. Powered by community collaboration.
See how it works at redhat.com



___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] DAO test is failing on master

2015-01-27 Thread Sahina Bose

Sandro,

Could be devel error. This test looks for database settings as per 
backend/manager/modules/scheduler/src/test/resources/ovirt-db-scheduler-test.properties


The Data source URL is specified as 
jdbc:postgresql://localhost/engine_dao_tests. Does this need to be 
changed for Jenkins jobs?


On 01/27/2015 09:29 PM, Sandro Bonazzola wrote:

http://jenkins.ovirt.org/job/ovirt-engine_master_dao-unit-tests_merged/10022/console

15:50:12 Failed tests:
15:50:12   
scheduleARecurringJob(org.ovirt.engine.core.utils.timer.DBSchedulerUtilQuartzImplTest):
 Unexpected exception occured -Failed to obtain DB
connection from data source 'EngineDS': java.sql.SQLException: Connections 
could not be acquired from the underlying database!
15:50:12   
scheduleAJob(org.ovirt.engine.core.utils.timer.DBSchedulerUtilQuartzImplTest): 
Unexpected exception occured -Failed to obtain DB connection
from data source 'EngineDS': java.sql.SQLException: Connections could not be 
acquired from the underlying database!

Not sure if it's infra or devel issue.



___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] [ovirt-users] [Feature review] Select network to be used for glusterfs

2015-01-15 Thread Sahina Bose


On 01/15/2015 02:27 PM, Dan Kenigsberg wrote:

On Thu, Jan 15, 2015 at 12:34:18PM +0530, Sahina Bose wrote:


I've updated the feature page with the REST API and other comments. On
further thought, there will be no change to Add brick API, as the engine
will select the network to be used based on the networks setup for the host.
If Storage network role is associated with any of the networks, this will
be used. Otherwise, the host's address will be used to add the brick.


snip

The paragraph above rules out the use case I lay below. Could you relate
to it? Isn't it a reasonable use case?


If I am not mistaken, it could make sense to have a setup with one brick
using network A and another - using network B. Does your design support
this? I think that this would be particularly important on upgraded
clusters, where the management network is already used, but newly
created bricks should start using another network.




On upgraded clusters, the user would have to assign a network with the 
role Storage network. Any newly created brick would then start using 
this, rather than the management network.


I'm not sure if the use case where each brick on a host is added using 
different networks is a common one (apart from the upgrade scenario, 
that is). If it is, we could provide an Advanced edit option in the UI 
to select network in Add Bricks dialog.
The entity design supports setting different network per brick and the 
REST API already provides a way to set this as an optional parameter.



May I repeat my follow request? It would help me understand the content
of the feature.


Sorry, I missed these before!



Would you add a feature page section regarding modification to the
Vdsm/Engine API?


http://www.ovirt.org/Features/Select_Network_For_Gluster#Change_to_VDSM_API
http://www.ovirt.org/Features/Select_Network_For_Gluster#Change_to_REST_API



One last comment - may I ask that new APIs accept both ipv4 and ipv6
addresses? There is an ongoing effort to support ipv6 on Vdsm.



Glusterfs does not support ipv6 yet, so addition of bricks using ipv6 
addresses would not work.


thanks,
sahina

___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] [ovirt-users] [Feature review] Select network to be used for glusterfs

2015-01-13 Thread Sahina Bose


On 01/12/2015 06:21 PM, Lior Vernia wrote:

Hi Sahina! :)

Cool feature, and I think long-awaited by many users. I have a few comments:

1. In the Add Bricks dialog, it seems like the IP Address field is a
list box - I presume the items contained there are all IP addresses
configured on the host's interfaces.

1. a. May I suggest that this contain network names instead of IP
addresses? Would be easier for users to think about things (they surely
remember the meaning of network names, not necessarily of IP addresses).





1. b. If I correctly understood the mock-up, then configuring a Storage
Network role only affects the default entry chosen in the list box. Is
it really worth the trouble of implementing this added role? It's quite
different than display/migration roles, which are used to determine what
IP address to use at a later time (i.e. not when configuring the host),
when a VM is run/migrated in the cluster.



If not for Storage network role, how would we default which network to 
use. In fact, we are planning to remove the drop down to choose network 
from the Add Brick UI, to avoid confusion and just use the network with 
this role, if available - otherwise use the host address. (host_address 
in vds_static)


Will update page accordingly




1. c. A word of warning: sometimes a host interface's IP address is
missing in the engine - this usually happens when they're configured for
the first time with DHCP, and the setup networks command returns before
an IP address is allocated (this can later be resolved by refreshing
host capabilities, there's a button for that). So when displaying items
in the list box, you should really check that an IP address exists for
each network.

2. Storage Network: if you intend to keep this role in the feature (I
don't think it adds a lot of functionality, see article 1b), it might be
better to call it Gluster Network - otherwise people using virt mode
might think this network is gonna be used to communicate with other
types of storage domains.



Could this network be reused for other storage needs also. If not, we 
can rename it gluster network




Yours, Lior.

On 12/01/15 14:00, Sahina Bose wrote:

Hi all,

Please review the feature page for this proposed solution and provide
your inputs - http://www.ovirt.org/Features/Select_Network_For_Gluster

thanks
sahina


___
Users mailing list
us...@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] [ovirt-users] [Feature review] Select network to be used for glusterfs

2015-01-13 Thread Sahina Bose


On 01/12/2015 06:14 PM, Oved Ourfali wrote:

Hi Sahina,

Some comments:

1. As far as I understand, you might not have an IP available immediately after 
setupNetworks runs (getCapabilities should run, but it isn't run automatically, 
afair).
2. Perhaps you should pass not the IP but the name of the network? IPs might 
change.
3. Adding to 2, perhaps using DNS names is a more valid approach?


To the gluster volume add brick command, the brick information needs to 
be passed in the form ip address or host name:directory path


So even if we do show the network names in the UI, we will need the 
underlying IP address to form this command.
Regarding DNS names, currently is there a way to query for the DNS 
aliases for a host? I would need to use hostname in the command above, 
and assume that the user has setup his DNS outside of oVirt to correctly 
resolve to internal/external network, correct?




4. You're using the terminology role, but it might be confusing, as we have roles with regards 
to permissions. Consider changing storage usage and not storage role in the feature page.

Thanks,
Oved

- Original Message -

From: Sahina Bose sab...@redhat.com
To: devel@ovirt.org, users us...@ovirt.org
Sent: Monday, January 12, 2015 2:00:16 PM
Subject: [ovirt-users] [Feature review] Select network to be used for   
glusterfs

Hi all,

Please review the feature page for this proposed solution and provide
your inputs - http://www.ovirt.org/Features/Select_Network_For_Gluster

thanks
sahina


___
Users mailing list
us...@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users



___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] [ovirt-users] [Feature review] Select network to be used for glusterfs

2015-01-13 Thread Sahina Bose


On 01/12/2015 08:52 PM, Dan Kenigsberg wrote:

On Mon, Jan 12, 2015 at 02:59:50PM +0200, Lior Vernia wrote:


On 12/01/15 14:44, Oved Ourfali wrote:

Hi Sahina,

Some comments:

1. As far as I understand, you might not have an IP available immediately after 
setupNetworks runs (getCapabilities should run, but it isn't run automatically, 
afair).
2. Perhaps you should pass not the IP but the name of the network? IPs might 
change.

Actually, IP address can indeed change - which would be very bad for
gluster functioning! I think moving networks or changing their IP
addresses via Setup Networks should be blocked if they're used by
gluster bricks.

In the suggested feature, there is no real storage role. The storage
role title means only default value for glusterfs IP.

For example, once a brick was created, nothing protects the admin from
accidently removing the storage network, or changing its IP address.

Another proof that this is not a real role, is that it affects only
GUI: I am guessing that REST API would not make use of it at all. (maybe
I'm wrong; for sure, REST must be defined in the feature page)


REST API that lists the available networks (with IP addresses) would be 
used to select the network and pass to the create gluster volume API


I'll update the feature page with the REST API changes as well.



Maybe that's the behavior we want. But alternatively, Engine can enforce
a stronger linkage between the brick to the network that it uses. When
adding a brick, the dialog would list available networks instead of the
specific IP. As long as the brick is being used, the admin would be
blocked/warned against deleting the network.


Is there a way to block against changing IP address used by a network?



I'm missing a discussion regarding the upgrade path. If we would opt to
requiring a single storage role network in a cluster, in an upgraded
cluster the management network should take this role.


There would not be any change to existing volumes on upgrade, as bricks 
have already been added. Users can use the Edit brick option to update 
the network to be used, if required as mentioned in Change network used 
by brick 






3. Adding to 2, perhaps using DNS names is a more valid approach?
4. You're using the terminology role, but it might be confusing, as we have roles with regards 
to permissions. Consider changing storage usage and not storage role in the feature page.

Well, we've already been using this terminology for a while now
concerning display/migration roles for networks... That's probably the
terminology to use.


Thanks,
Oved

- Original Message -

From: Sahina Bose sab...@redhat.com
To: devel@ovirt.org, users us...@ovirt.org
Sent: Monday, January 12, 2015 2:00:16 PM
Subject: [ovirt-users] [Feature review] Select network to be used for   
glusterfs

Hi all,

Please review the feature page for this proposed solution and provide
your inputs - http://www.ovirt.org/Features/Select_Network_For_Gluster

thanks
sahina

___
Users mailing list
us...@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


[ovirt-devel] Help with patch reviews!

2014-12-22 Thread Sahina Bose

Hi all,

Could any of you help with these patch reviews - these are related to 
the gluster geo-replication management feature [1]


http://gerrit.ovirt.org/#/c/34552/
http://gerrit.ovirt.org/#/c/34630/
http://gerrit.ovirt.org/#/c/34216
http://gerrit.ovirt.org/#/c/33845/

thanks
sahina

[1] - http://www.ovirt.org/Features/Gluster_Geo_Replication
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] Certain questions about Quartz based scheduling in oVirt

2014-11-25 Thread Sahina Bose


On 11/24/2014 06:27 PM, Moti Asayag wrote:

Hi Shubhendu,

- Original Message -

From: Shubhendu Tripathi shtri...@redhat.com
To: devel@ovirt.org
Sent: Monday, November 24, 2014 8:58:34 AM
Subject: [ovirt-devel] Certain questions about Quartz based scheduling in   
oVirt

Hi All,

We are in a requirement where we need to schedule jobs at certain time
interval, hourly, daily, weekly and monthly (i.e. repetitive and cron
kind of scheduling).
I was trying to understand quartz based scheduling mechanism in oVirt to
achieve the scenarios.

Have some basic questions regarding the same -
1. Is there is mechanism to persist the scheduling data in oVirt ?

In ovirt we do not persist the jobs. The application reschedule the jobs when 
it starts
and programmatically triggers jobs when required.

On packaging/services/ovirt-engine/ovirt-engine.xml.in we specify the job store 
configuration
as RAMJobStore, which is a volatile:

 property name=org.quartz.jobStore.class 
value=org.quartz.simpl.RAMJobStore/

You may select other implementation. See:
http://quartz-scheduler.org/api/2.2.0/org/quartz/spi/JobStore.html


Would the correct approach then be to have multiple scheduler instances?

1. - that uses the in memory job store

2. that uses DB to persist the jobs.

The second instance would be used to schedule and manage any dynamic 
jobs, for instance like the ones required for gluster volume snapshot 
scheduling


Using a separate instance also would mean there's no change to existing 
jobs that are scheduled in Backend bean.






2. How to tackle edit and rescheduling of jobs ?


You can have a look at org.ovirt.engine.core.utils.timer.SchedulerUtil 
interface in
the 'scheduler' project which provides that functionality, and the related 
classes.

But if not special requirements, i guess the shipped quartz implementation 
should be
enough:

http://quartz-scheduler.org/documentation/quartz-2.x/tutorials/tutorial-lesson-05


Kindly guide on how to achieve these.

Thanks and Regards,
Shubhendu
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] Remote DataCenter/Cluster

2014-09-24 Thread Sahina Bose


On 08/25/2014 10:13 AM, Dusmant Kumar Pati wrote:

On 08/21/2014 03:57 PM, Sahina Bose wrote:


On 08/21/2014 03:42 PM, Itamar Heim wrote:

On 08/21/2014 05:00 AM, Sahina Bose wrote:

Hi,

We ran into a requirement where we need to mark a DataCenter or 
Cluster
as remote. This is for the feature Gluster Geo-replication 
management -

where the same oVirt instance will be managing both the master volume
and as well as the remote site volume

The reason we want to call out a Datacenter as remote, is to ensure 
that
the polling frequency on this is different from one that is 
co-located.


Is there a similar requirement for virtualization?

thanks
sahina


I thought in the gluster view there are no DCs, only clusters?
(actually, i wonder if we shouldn't consider the same for the virt 
use case, and deprecate the DC entity, though that would be a major 
effort and not sure worth it).


In fact, we were wondering if we need to bring in the concept of DC 
for the gluster view and mark it remote...or just stick with flagging 
cluster as Remote :)
I think your second option looks better, considering gluster does not 
have any DC concept. Just stick with flagging cluster as Remote.
Once managed by RHSC (either local or remote), almost all features of 
RHSC, should be applicable for that cluster as well. Isn't it?




If we do introduce a Remote cluster, how would that affect the Data 
Center for virt+gluster mode - having remote as well as local cluster?


Thoughts?


Users may be more familiar with DC w.r.t DR



to your question, yes. there is a similar use case for virt - 
flagging a DC or Cluster and Storage as remote for a DR use case.

though i don't think anyone spent cycles on modeling that


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel




___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] Remote DataCenter/Cluster

2014-08-21 Thread Sahina Bose


On 08/21/2014 03:42 PM, Itamar Heim wrote:

On 08/21/2014 05:00 AM, Sahina Bose wrote:

Hi,

We ran into a requirement where we need to mark a DataCenter or Cluster
as remote. This is for the feature Gluster Geo-replication management -
where the same oVirt instance will be managing both the master volume
and as well as the remote site volume

The reason we want to call out a Datacenter as remote, is to ensure that
the polling frequency on this is different from one that is co-located.

Is there a similar requirement for virtualization?

thanks
sahina


I thought in the gluster view there are no DCs, only clusters?
(actually, i wonder if we shouldn't consider the same for the virt use 
case, and deprecate the DC entity, though that would be a major effort 
and not sure worth it).


In fact, we were wondering if we need to bring in the concept of DC for 
the gluster view and mark it remote...or just stick with flagging 
cluster as Remote :)

Users may be more familiar with DC w.r.t DR



to your question, yes. there is a similar use case for virt - flagging 
a DC or Cluster and Storage as remote for a DR use case.

though i don't think anyone spent cycles on modeling that


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [ovirt-devel] Hide automatic comments in Gerrit

2014-08-01 Thread Sahina Bose


On 08/01/2014 01:07 AM, Moti Asayag wrote:


- Original Message -

From: Vojtech Szocs vsz...@redhat.com
To: Mike Kolesnik mkole...@redhat.com
Cc: devel@ovirt.org
Sent: Thursday, July 31, 2014 7:49:09 PM
Subject: Re: [ovirt-devel] Hide automatic comments in Gerrit



- Original Message -

From: Mike Kolesnik mkole...@redhat.com
To: devel@ovirt.org
Sent: Thursday, July 17, 2014 7:06:07 PM
Subject: [ovirt-devel] Hide automatic comments in Gerrit

Hi,

I've written a Greasemonkey script to hide bot comments (Jenkins,
automation,
etc) on the review pages:
https://gist.github.com/mkolesni/abaedba07d820df6352c

Nice!

+1, I've been using it for quite a while and it is very helpful.

Attached two images to show the difference with and without it.


+1

I find this useful to hide the noise.






It adds a button Hide automatic comments in the comments section menu.

There's also an AUTO_HIDE option so that the bot comments are hidden
automatically when the review page loads.
I've set it to true, but you can disable it if you don't need it.

Enjoy! (And if there's bugs, fix them ;))

Regards,
Mike


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel



___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel

Re: [ovirt-devel] oVirt 3.5 beta2 - results

2014-07-30 Thread Sahina Bose

Piotr,

Thanks for the test report!

On 07/29/2014 08:48 PM, Piotr Kliczewski wrote:

Hi all,

I tested gluster related features:


Nagios Integration - http://www.ovirt.org/Features/Nagios_Integration#HOW_TO

I installed Nagios dependencies on f20 which went smoothly but when I
did the same for rhel6 I noticed that I had to install manually
additional rpm which was not covered by howto.

rrdtool-perl-1.3.8-6.el6.x86_64.rpm


I will retry this and update the How_To




During discovery of the Nagios server I got following issue:

[root@rhel gluster]# /usr/lib64/nagios/plugins/gluster/discovery.py -c
Default -H 192.168.1.9
Failed to execute NRPE command 'discover_volume_list' in host '192.168.1.9'
Error : Make sure NPRE server in host '192.168.1.9' is configured to
accept requests from Nagios server


Did you get this error even after following the step to edit 
allowed_hosts in /etc/nagios/nrpe.cfg?





so I followed http://tecadmin.net/install-nrpe-on-centos-rhel/.

Nagios server reported status of the cluster. When I had configured
first nagios server I saw:

OK : None of the Volumes in the cluster are in Critical State

but for the second there was:

(null).


Do you mean configuring second cluster in the same Nagios server?




I followed howto and installed oVirt UI plugin but after restart I was
not able to see monitoring details tab so I opened:
https://bugzilla.redhat.com/show_bug.cgi?id=1124371




Volume performance stats -
http://www.ovirt.org/Features/Gluster_Volume_Performance_Statistics#HOW_TO

I reused already existing setup. I enabled stats and added a volume.
When checking stats details I saw could not fetch stats.

I wanted to generate some stats so I mount volume previously created using:

mount -t nfs 192.168.1.9:/vol1 /media/volume

I had to redo it several times do to:

mount.nfs: requested NFS version or transport protocol is not supported

After several attempts I lost connectivity to the machine. After host
recovered I tried to run:

mount -o mountproto=tcp -t nfs 192.168.1.9:/vol1 /media/volume

but the result was the same.

I opened: https://bugzilla.redhat.com/show_bug.cgi?id=1124376


I checked whether gluster still works with jsonrpc. I removed the host
that I installed before and added new one using jsonrpc protocol.
After the installation I noticed that host was moved to Non-Operation
state. In the logs I found:

{jsonrpc: 2.0, id: 101bf460-6529-42d6-9370-a9629daad628,
error: {message: The method does not exist / is not available.,
code: -32601}}

I checked what was the reason and there was no apiwrapper.py module so I opened:

https://bugzilla.redhat.com/show_bug.cgi?id=1124481



Thanks,
Piotr
___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


___
Devel mailing list
Devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/devel


Re: [Engine-devel] 401 Authorization Error

2014-02-17 Thread Sahina Bose


On 02/17/2014 05:09 PM, Vikas Kokare wrote:
I am using oVirt Java SDK 3.4.2 to connect to RHEV-M environment. The 
code being used is


org.ovirt.engine.sdk.Api api = new 
Api(https://HOST:PORT/api,USER,PASSWORD;, true);



Have you tried user@domain for the user parameter?




The response from the server to this call is

/oVirt API error htmlheadtitleJBoss Web/7.2.2.Final-redhat-1 - 
JBWEB64: Error report/titlestyle!--H1 
{font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:22px;} 
H2 
{font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:16px;} 
H3 
{font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:14px;} 
BODY 
{font-family:Tahoma,Arial,sans-serif;color:black;background-color:white;} 
B 
{font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;} 
P 
{font-family:Tahoma,Arial,sans-serif;background:white;color:black;font-size:12px;}A 
{color : black;}A.name {color : black;}HR {color : 
#525D76;}--/style /headbodyh1J*BWEB65: HTTP Status 401* - 
/h1HR size=1 noshade=noshadepbJBWEB000309: type/b 
JBWEB67: Status report/ppbJBWEB68: message/b 
u/u/ppbJBWEB69: description/b uJ*BWEB000121: This 
request requires HTTP authentication*./u/pHR size=1 
noshade=noshadeh3JBoss Web/7.2.2.Final-redhat-1/h3/body/html

/
The documentation talks about the 401 error when the request doesn't 
contain Authorization header.


Even when i access https://HOST:PORT/api , i am prompted to login, but 
the credentials used to login to
https://HOST:PORT/webadmin/webadmin/WebAdmin.html don't work. So if i 
login to webadmin console first, then since the Authorization header 
is set, access to /api is now possible in the browser.


I need help here specifically for making the Java SDK work with RHeVM 
server.


-Vikas


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] Junit code coverage

2014-02-04 Thread Sahina Bose


On 02/04/2014 03:51 AM, Eyal Edri wrote:


- Original Message -

From: Sahina Bose sab...@redhat.com
To: engine-devel engine-devel@ovirt.org
Sent: Friday, January 31, 2014 3:33:10 PM
Subject: [Engine-devel] Junit code coverage

Hi,

Do we have a jenkins job that does cobertura / similar code coverage
metrics for oVirt?

[cc'ing infra]

hi sahina,
we currently don't have a job like that, i assume this is something that can be 
added if needed,
for unit tests, there are indeed some options - cobertura/jacoco or sonar 
server to display aggregated results,
though i'm not sure we have enough resources now to add another VM to run sonar 
(we should have soon though).

what did you had in mind you'd like to test and monitor? ovirt-engine unit 
tests?

Thanks, Eyal.

I wanted an idea about the unit test coverage that we currently have. I 
was able to run the cobertura:cobertura target from command line locally 
(it worked for the backend modules), so was checking to see if we had an 
equivalent jenkins job.





Eyal.


thanks
sahina
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


[Engine-devel] Junit code coverage

2014-01-31 Thread Sahina Bose

Hi,

Do we have a jenkins job that does cobertura / similar code coverage 
metrics for oVirt?


thanks
sahina
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] Proposal to add Juan Hernandez as maintainer to api/sdk/cli

2013-12-17 Thread Sahina Bose


On 12/16/2013 09:04 PM, Michael Pasternak wrote:

Juan has worked on oVirt for a long period of time, developing
several features in the different areas (including api and cli),
and obviously gained a lot of experience and knowledge,

I'd like to propose Juan as a maintainer of the  api/sdk/cli projects.



+1
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] UX inputs on gluster volume async tasks

2013-09-02 Thread Sahina Bose

Hi Malini, Eldan

Thanks so much for the detailed feedback. Comments inline

On 09/01/2013 07:11 PM, Eldan Hildesheim wrote:

Hi all,
I have few more questions.
1. How often do we get the data of the activity changes: I wonder if we can 
change the activity icon by a progress one.
The refresh of data can be configured, defaulting it to 30 seconds. We 
have no way however of knowing the time left / number of files yet to be 
rebalanced on a volume. So a progress bar may not be possible.

2. Normally in oVirt / Rhev we show more data as a sub tab. Can we assume that 
Rebalance status is like more data and then put all the data that is now in the modal 
inside a new sub tab name (activity)?

We were averse to creating a new sub tab -
1. To avoid proliferation of sub tabs
2. since this sub-tab will only be relevant when rebalance operation is 
going on.

3. The icon of Migration data from Brick in progress: Is this apart of the 
Rebalance process? Is this a derived aspect of the rebalance?

This is for remove brick...Not rebalance

4. Do you have a phone num we can call Monday?

Will contact you off list.

Thanks,
Eldan
  


- Original Message -
From: Malini Rao m...@redhat.com
To: Sahina Bose sab...@redhat.com
Cc: Eldan Hildesheim ehild...@redhat.com, engine-devel engine-devel@ovirt.org, 
Dusmant Pati dp...@redhat.com
Sent: Friday, August 30, 2013 4:22:44 PM
Subject: Re: UX inputs on gluster volume async tasks

Sahina,

Attached are my detailed comments and questions about this feature from a UX 
perspective. If it is easier, we can get on a call to discuss the questions and 
other points. Let me know what you prefer.

Thanks
Malini

- Original Message -
From: Sahina Bose sab...@redhat.com
To: Malini Rao m...@redhat.com, Eldan Hildesheim ehild...@redhat.com
Cc: engine-devel engine-devel@ovirt.org, Dusmant Pati dp...@redhat.com
Sent: Wednesday, August 28, 2013 7:30:59 AM
Subject: UX inputs on gluster volume async tasks

Hi Malini, Eldan,

Could you provide feedback from UX perspective on this feature?

The feature description and User flows are at
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management

thanks
sahina



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


[Engine-devel] UX inputs on gluster volume async tasks

2013-08-28 Thread Sahina Bose

Hi Malini, Eldan,

Could you provide feedback from UX perspective on this feature?

The feature description and User flows are at
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management

thanks
sahina

___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] Gluster Volume asynchronous tasks

2013-08-21 Thread Sahina Bose


On 08/21/2013 03:54 PM, Itamar Heim wrote:

On 08/21/2013 12:46 AM, Sahina Bose wrote:


On 08/20/2013 04:00 AM, Itamar Heim wrote:

On 08/12/2013 06:09 AM, Sahina Bose wrote:


On 08/12/2013 03:28 PM, Yair Zaslavsky wrote:


- Original Message -

From: Sahina Bose sab...@redhat.com
To: Eli Mesika emes...@redhat.com
Cc: engine-devel engine-devel@ovirt.org, a...@ovirt.org
Sent: Monday, August 12, 2013 11:51:15 AM
Subject: Re: [Engine-devel] Gluster Volume asynchronous tasks


On 08/12/2013 01:21 PM, Eli Mesika wrote:

- Original Message -

From: Sahina Bose sab...@redhat.com
To: engine-devel engine-devel@ovirt.org, a...@ovirt.org,
Michael
Pasternak mpast...@redhat.com
Sent: Monday, August 12, 2013 8:41:55 AM
Subject: [Engine-devel] Gluster Volume asynchronous tasks

Hi all,

We are working on a feature to add support to start and monitor
gluster
volume asynchronous tasks (like rebalancing a gluster volume,
removing
brick from volume ) from the oVirt engine.

The operations can be started from the Volumes tab or the Bricks
sub-tab
using the Rebalance, Remove options.
These are long running operations which can be monitored using a
task id
returned from Gluster. We are planning to add the monitoring in 
the

existing Task sub tab

The feature description and User flows are at
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management 





The detailed design (including REST API design) is at
http://www.ovirt.org/Features/Detailed_Gluster_Volume_Asynchronous_Tasks_Management. 





I would really appreciate if you could review and provide your
valuable
feedback.

I Sahina
Why not using 6the External Tasks feature introduced for 3.3 for
those
Gluster tasks ???
http://www.ovirt.org/Features/Design/DetailedExternalTasks

Hi Eli,

We still want to be able to start and stop these operations from the
engine.
So, when a user wants to say, rebalance a volume, they would go 
select

the volume and click on Rebalance Start.
This would then call the BLL command to start rebalance which will
invoke the corresponding vdsm verb to start the rebalance on the
volume.
This is the same as existing flow for other commands. The only
difference is the vdsm verb will return the task id from gluster, 
for

the rebalance operation that was started. And we will monitor the
progress of the task using the gluster task id (by calling a gluster
command)

I'm not sure how ExternalTasks would fit in here? I was thinking of
using ExternalTask support for adding Job/Steps to engine when the
operation is started outside of engine, that is, from Gluster CLI.
Please correct me if I'm missing something.

Does this mean that from Gluster CLI you will not try and invoke the
rebalance command ?
(I mean, I should either use Gluster CLI or Engine's REST API?)
Rebalance volume command could be invoked in any of the following 
ways:

1. From the console UI (clicking on Rebalance as shown in
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management#Rebalance_Volume) 




2. Using REST API
3. Outside of engine, from Gluster CLI - In such cases, the engine
should detect that a user has triggered rebalance operation outside 
the
engine, and allow the user to monitor progress of this from the 
engine.

This is where, we need support to add a Job for an operation that was
started externally, so that it can be seen in the Tasks tab.


and still, it should be considered an internal task, since the engine
is managing it / detected it.



Itamar, yes, you are right. This would need to be treated as an internal
task as the engine needs to be able to stop it and also monitor it. We
would probably need a similar mechanism as external task injection, to
add a Job for the task started from gluster CLI.




even if it was started from CLI, wouldn't it be better if engine 
detected it, and still treated it as an internal task, allowing to 
cancel it, etc.?


Yes, but I need to add a Job for this internal task, so that it can be 
monitored in the Tasks pane. Any idea if I can use any existing 
framework to do it? I was thinking I would use 
ExecutionHandler.createJob to do this (similar to what's done in 
AddExternalJobCommand)


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] Gluster Volume asynchronous tasks

2013-08-20 Thread Sahina Bose


On 08/20/2013 04:00 AM, Itamar Heim wrote:

On 08/12/2013 06:09 AM, Sahina Bose wrote:


On 08/12/2013 03:28 PM, Yair Zaslavsky wrote:


- Original Message -

From: Sahina Bose sab...@redhat.com
To: Eli Mesika emes...@redhat.com
Cc: engine-devel engine-devel@ovirt.org, a...@ovirt.org
Sent: Monday, August 12, 2013 11:51:15 AM
Subject: Re: [Engine-devel] Gluster Volume asynchronous tasks


On 08/12/2013 01:21 PM, Eli Mesika wrote:

- Original Message -

From: Sahina Bose sab...@redhat.com
To: engine-devel engine-devel@ovirt.org, a...@ovirt.org, 
Michael

Pasternak mpast...@redhat.com
Sent: Monday, August 12, 2013 8:41:55 AM
Subject: [Engine-devel] Gluster Volume asynchronous tasks

Hi all,

We are working on a feature to add support to start and monitor
gluster
volume asynchronous tasks (like rebalancing a gluster volume, 
removing

brick from volume ) from the oVirt engine.

The operations can be started from the Volumes tab or the Bricks
sub-tab
using the Rebalance, Remove options.
These are long running operations which can be monitored using a
task id
returned from Gluster. We are planning to add the monitoring in the
existing Task sub tab

The feature description and User flows are at
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management 




The detailed design (including REST API design) is at
http://www.ovirt.org/Features/Detailed_Gluster_Volume_Asynchronous_Tasks_Management. 




I would really appreciate if you could review and provide your
valuable
feedback.

I Sahina
Why not using 6the External Tasks feature introduced for 3.3 for 
those

Gluster tasks ???
http://www.ovirt.org/Features/Design/DetailedExternalTasks

Hi Eli,

We still want to be able to start and stop these operations from the
engine.
So, when a user wants to say, rebalance a volume, they would go select
the volume and click on Rebalance Start.
This would then call the BLL command to start rebalance which will
invoke the corresponding vdsm verb to start the rebalance on the 
volume.

This is the same as existing flow for other commands. The only
difference is the vdsm verb will return the task id from gluster, for
the rebalance operation that was started. And we will monitor the
progress of the task using the gluster task id (by calling a gluster
command)

I'm not sure how ExternalTasks would fit in here? I was thinking of
using ExternalTask support for adding Job/Steps to engine when the
operation is started outside of engine, that is, from Gluster CLI.
Please correct me if I'm missing something.

Does this mean that from Gluster CLI you will not try and invoke the
rebalance command ?
(I mean, I should either use Gluster CLI or Engine's REST API?)

Rebalance volume command could be invoked in any of the following ways:
1. From the console UI (clicking on Rebalance as shown in
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management#Rebalance_Volume) 



2. Using REST API
3. Outside of engine, from Gluster CLI - In such cases, the engine
should detect that a user has triggered rebalance operation outside the
engine, and allow the user to monitor progress of this from the engine.
This is where, we need support to add a Job for an operation that was
started externally, so that it can be seen in the Tasks tab.


and still, it should be considered an internal task, since the engine 
is managing it / detected it.




Itamar, yes, you are right. This would need to be treated as an internal 
task as the engine needs to be able to stop it and also monitor it. We 
would probably need a similar mechanism as external task injection, to 
add a Job for the task started from gluster CLI.



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] Gluster Volume asynchronous tasks

2013-08-12 Thread Sahina Bose


On 08/12/2013 03:28 PM, Yair Zaslavsky wrote:


- Original Message -

From: Sahina Bose sab...@redhat.com
To: Eli Mesika emes...@redhat.com
Cc: engine-devel engine-devel@ovirt.org, a...@ovirt.org
Sent: Monday, August 12, 2013 11:51:15 AM
Subject: Re: [Engine-devel] Gluster Volume asynchronous tasks


On 08/12/2013 01:21 PM, Eli Mesika wrote:

- Original Message -

From: Sahina Bose sab...@redhat.com
To: engine-devel engine-devel@ovirt.org, a...@ovirt.org, Michael
Pasternak mpast...@redhat.com
Sent: Monday, August 12, 2013 8:41:55 AM
Subject: [Engine-devel] Gluster Volume asynchronous tasks

Hi all,

We are working on a feature to add support to start and monitor gluster
volume asynchronous tasks (like rebalancing a gluster volume, removing
brick from volume ) from the oVirt engine.

The operations can be started from the Volumes tab or the Bricks sub-tab
using the Rebalance, Remove options.
These are long running operations which can be monitored using a task id
returned from Gluster. We are planning to add the monitoring in the
existing Task sub tab

The feature description and User flows are at
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management

The detailed design (including REST API design) is at
http://www.ovirt.org/Features/Detailed_Gluster_Volume_Asynchronous_Tasks_Management.

I would really appreciate if you could review and provide your valuable
feedback.

I Sahina
Why not using 6the External Tasks feature introduced for 3.3 for those
Gluster tasks ???
http://www.ovirt.org/Features/Design/DetailedExternalTasks

Hi Eli,

We still want to be able to start and stop these operations from the engine.
So, when a user wants to say, rebalance a volume, they would go select
the volume and click on Rebalance Start.
This would then call the BLL command to start rebalance which will
invoke the corresponding vdsm verb to start the rebalance on the volume.
This is the same as existing flow for other commands. The only
difference is the vdsm verb will return the task id from gluster, for
the rebalance operation that was started. And we will monitor the
progress of the task using the gluster task id (by calling a gluster
command)

I'm not sure how ExternalTasks would fit in here? I was thinking of
using ExternalTask support for adding Job/Steps to engine when the
operation is started outside of engine, that is, from Gluster CLI.
Please correct me if I'm missing something.

Does this mean that from Gluster CLI you will not try and invoke the rebalance 
command ?
(I mean, I should either use Gluster CLI or Engine's REST API?)

Rebalance volume command could be invoked in any of the following ways:
1. From the console UI (clicking on Rebalance as shown in 
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management#Rebalance_Volume)

2. Using REST API
3. Outside of engine, from Gluster CLI - In such cases, the engine 
should detect that a user has triggered rebalance operation outside the 
engine, and allow the user to monitor progress of this from the engine. 
This is where, we need support to add a Job for an operation that was 
started externally, so that it can be seen in the Tasks tab.








thanks
sahina
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


[Engine-devel] Gluster Volume asynchronous tasks

2013-08-11 Thread Sahina Bose

Hi all,

We are working on a feature to add support to start and monitor gluster 
volume asynchronous tasks (like rebalancing a gluster volume, removing 
brick from volume ) from the oVirt engine.


The operations can be started from the Volumes tab or the Bricks sub-tab 
using the Rebalance, Remove options.
These are long running operations which can be monitored using a task id 
returned from Gluster. We are planning to add the monitoring in the 
existing Task sub tab


The feature description and User flows are at 
http://www.ovirt.org/Features/Gluster_Volume_Asynchronous_Tasks_Management


The detailed design (including REST API design) is at 
http://www.ovirt.org/Features/Detailed_Gluster_Volume_Asynchronous_Tasks_Management.


I would really appreciate if you could review and provide your valuable 
feedback.


thanks
sahina
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] ovirt 3.3 RC packages

2013-08-07 Thread Sahina Bose


On 08/07/2013 01:42 PM, Ofer Schreiber wrote:

Dear maintainers,

As you probably know, we're heading towards the 3.3 release of ovirt.
I'd like to get a short status about your project, and it's readiness for the 
upcoming release.
If your project is blocker free, please let me know of the relevant build to 
pick up into the RC repo.

Current known blockers (as in 
https://bugzilla.redhat.com/show_bug.cgi?id=918494 - Tracker: oVirt 3.3 
release):

ovirt-engine

984586  ovirt-engine-backendinfra   Cannot start a VM with USB 
Native - Exit message: internal error Could not format channel target type.
988299  ovirt-engine-core   gluster Impossible to start VM from 
Gluster Storage Domain
There's an issue with running this on CentOS 6.4 as there is no qemu 1.3 
available for this distro.
Deepak (deepakcs) has initiated a conversation asking for inputs on how 
to handle this dependency. We could take a call based on the resolution.



987939  ovirt-engine-installer  integration engine-setup - engine-cleanup - 
engine-setup - fails

vsdm

988004  vdsm   network  [vdsm] OSError: [Errno 2] No such 
file or directory: '/sys/class/net/ovirtmgmt/brif'
988065  vdsm   virt Migration fails - AttributeError: 
'ConsoleDevice' object has no attribute 'alias'
988397  vdsm   network  ovirt-node post-installation setup 
networks fails when NetworkManager is running
988990  vdsm   network  oVirt 3.3 - (vdsm-network): netinfo 
- ValueError: unknown bridge ens3
990854  vdsm   network  Multiple Gateways: Upgrade VDSM to 
3.3 must reconfigure networking on host
990963  vdsmvdsm must require 
selinux-policy-3.12.1-68.fc19

ovirt-node

988986 ovirt-node   libvirt network directory is not 
persisted

other
=
990509 selinux-policy   Current selinux policy prevents 
running a VM with volumes under /var/run/vdsm/storage

Thanks,

Ofer Schreiber
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] [vdsm] How to handle qemu 1.3 dep for Gluster Storage Domain

2013-08-06 Thread Sahina Bose

[Adding engine-devel]

On 08/06/2013 10:48 AM, Deepak C Shetty wrote:

Hi All,
There were 2 learnings from BZ 
https://bugzilla.redhat.com/show_bug.cgi?id=988299


1) Gluster RPM deps were not proper in VDSM when using Gluster Storage 
Domain. This has been partly addressed
by the gluster-devel thread @ 
http://lists.gnu.org/archive/html/gluster-devel/2013-08/msg8.html
and will be fully addressed once Gluster folks ensure their packaging 
is friendly enuf for VDSM to consume
just the needed bits. Once that happens, i will be sending a patch to 
vdsm.spec.in to update the gluster

deps correctly. So this issue gets addressed in near term.

2) Gluster storage domain needs minimum libvirt 1.0.1 and qemu 1.3.

libvirt 1.0.1 has the support for representing gluster as a network 
block device and qemu 1.3 has the
native support for gluster block backend which supports gluster://... 
URI way of representing a gluster
based file (aka volume/vmdisk in VDSM case). Many distros (incl. 
centos 6.4 in the BZ) won't have qemu

1.3 in their distro repos! How do we handle this dep in VDSM ?

Do we disable gluster storage domain in oVirt engine if VDSM reports 
qemu  1.3 as part of getCapabilities ?

or
Do we ensure qemu 1.3 is present in ovirt.repo assuming ovirt.repo is 
always present on VDSM hosts in which
case when VDSM gets installed, qemu 1.3 dep in vdsm.spec.in will 
install qemu 1.3 from the ovirt.repo
instead of the distro repo. This means vdsm.spec.in will have qemu = 
1.3 under Requires.


Is this possible to make this a conditional install? That is, only if 
Storage Domain = GlusterFS in the Data center, the bootstrapping of host 
will install the qemu 1.3 and dependencies.


(The question still remains as to where the qemu 1.3 rpms will be available)


What will be a good way to handle this ?
Appreciate your response

thanx,
deepak

___
vdsm-devel mailing list
vdsm-de...@lists.fedorahosted.org
https://lists.fedorahosted.org/mailman/listinfo/vdsm-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] Introducing limited branding support.

2013-06-13 Thread Sahina Bose


On 06/11/2013 10:09 PM, Alexander Wels wrote:

Hi Guys,

We recently merged at a patch (http://gerrit.ovirt.org/#/c/13181/) that allows
for limited branding support of oVirt user portal and web admin. We also moved
the styles needed to support this branding out of the application and into its
own module. The styles can now be found in ovirt-
engine/packaging/branding/ovirt.brand.

In this directory you will find the following files:
- branding.properties. This file controls the branding theme.
- ovirt_messages.properties. A standard java resource bundle properties file
containing the messages that can be changed.
- A bunch of .css files that contain the classes that can be altered.

I have created a wiki page with some information and pictures of what parts of
the interface can be changed at this point in time. It is located here:
http://www.ovirt.org/Feature/Branding

There is also more information in README.branding that got introduced with
this patch.

Alexander

ps.
If your user interface looks messed up (missing borders and things of that
nature) the engine cannot find the default branding. This means you are not
using the make commands recently introduced. We highly recommend you use this
to have a complete environment. If you are unwilling or unable to use that you
can make a symlink in /etc/ovirt-engine/branding/00-ovirt.brand to ovirt-
engine/packaging/branding/ovirt.brand
If creating a symlink, make sure it matches the ENGINE_ETC property 
that's loaded.
So, for symlink /etc/ovirt-engine/branding/00-ovirt.brand , the 
ENGINE_ETC should be pointing to /etc/ovirt-engine.


I ran into issues, so thought I would share.

thanks
sahina



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


[Engine-devel] Failing Junit tests in ovirt-engine

2013-06-07 Thread Sahina Bose

Hi,
I have send a patch to fix failures in junit tests [1] during 
ovirt-engine build.

Could someone please review and merge?

http://gerrit.ovirt.org/15434

thanks
sahina

[1]
Results :

Failed tests:
validateParameters(org.ovirt.engine.core.bll.SetupNetworksParametersTest)
 at org.junit.Assert.assertFalse(Assert.java:79)
at 
org.ovirt.engine.core.bll.SetupNetworksParametersTest.validateParameters(SetupNetworksParametersTest.java:37)

at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

Tests in error:
gatewayChanged(org.ovirt.engine.core.bll.network.host.SetupNetworksHelperTest)
gatewayChanged(org.ovirt.engine.core.bll.network.host.SetupNetworksHelperTest) 
Time elapsed: 0.001 sec   ERROR!

java.lang.NullPointerException
at 
org.ovirt.engine.core.common.FeatureSupported.supportedInConfig(FeatureSupported.java:14)
at 
org.ovirt.engine.core.common.FeatureSupported.multipleGatewaysSupported(FeatureSupported.java:114)


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] Failed to configure management network on the host

2013-06-04 Thread Sahina Bose

Hi Moti,

Not sure I understand your comment about host activation flow.
commit bdd8966d97e1e02e775ee6f755d0a3637a668911  (engine: Remove mgmt 
network setup from host activation) is present in our code base, but we 
were not using the host activation flow.


Here's what we tried to do.
Add a host (with vdsm 4.10.2 installed) to engine (built from Jun 2 
upstream codebase).


The installation failed with Failed to configure manamgent network on 
the host


and the logs had this

2013-06-03 19:49:03,390 INFO 
[org.ovirt.engine.core.bll.network.NetworkConfigurator] 
(pool-4-thread-50) [7bf46c88] Engine managed to communicate with VDSM 
agent on host s1
2013-06-03 19:49:03,412 ERROR 
[org.ovirt.engine.core.vdsbroker.vdsbroker.CollectVdsNetworkDataVDSCommand] 
(pool-4-thread-50) [7bf46c88] Command CollectVdsNetworkDataVDS execution 
failed. Exception: VDSNetworkException: java.net.ConnectException: 
Connection refused


As Kanagaraj mentioned, reverting the Change Iaf82e104: engine:
Allow engine to configure management network , fixed this issue.

From the flowchart on 
http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization, it 
seems to fail on the Configure Management Network (by setup networks) 
action.


Any help on solving this would be appreciated.

thanks
sahina



On 06/04/2013 11:53 AM, Moti Asayag wrote:

Hi,

Same host activation flow should work on the master as well as the
management network will not be created as part of the host activation.
So reverted changes were already merged by commit
bdd8966d97e1e02e775ee6f755d0a3637a668911

For more details about host installation see:
http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization

Thanks,
Moti

See
On 06/04/2013 08:51 AM, Kanagaraj wrote:

Host installation went through successfully after reverting the change
eca95e192ebf3e1fe0e9440bb8fac694214ca8fd (Change Iaf82e104: engine:
Allow engine to configure management network ).

Thanks,
Kanagaraj

On 06/04/2013 12:23 AM, Dead Horse wrote:

Seeing this as well but during host activation with latest master
vdsm/engine.
-- http://lists.ovirt.org/pipermail/engine-devel/2013-June/004752.html



On Mon, Jun 3, 2013 at 9:45 AM, Kanagaraj kmayi...@redhat.com
mailto:kmayi...@redhat.com wrote:

 Hi,

  After adding a host to a 3.2 cluster, bootstrap went through fine
 and in the end it failed with the following error.

 Host s2 installation failed. Failed to configure manamgent
 network on the host.

 Attached engine.log and vdsm.log

 Vdsm rpms:
 vdsm-4.10.2-22.2.el6rhs.x86_64
 vdsm-python-4.10.2-22.2.el6rhs.x86_64
 vdsm-cli-4.10.2-22.2.el6rhs.noarch
 vdsm-gluster-4.10.2-22.2.el6rhs.noarch
 vdsm-xmlrpc-4.10.2-22.2.el6rhs.noarch

 Engine rpms: recent code from master

 Can someone please help in resolving this issue,

 Thanks,
 Kanagaraj

 ___
 Engine-devel mailing list
 Engine-devel@ovirt.org mailto:Engine-devel@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/engine-devel



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


[Engine-devel] SQL procedure - row mapper

2013-05-17 Thread Sahina Bose

Hi all,

In org.ovirt.engine.core.dal.dbbroker.SimpleJdbcCallsHandler , there's a 
map maintained for procedure name and SimpleJdbcCall.


If I have the same procedure with different row mappers, this results in 
an error - because the map already contains a mapping for the procedure 
name but with different row mapper.


Do we intend to support calling the same procedure with different 
RowMappers? If so, I can change this class to handle this.


thanks
sahina
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] SQL procedure - row mapper

2013-05-17 Thread Sahina Bose

Hi Laszlo,

I wanted a way to return a hook entity (GlusterHookEntity) as well as 
content of the hook. So had a single procedure getGlusterHookById(id 
UUID, includeContent BOOLEAN) but 2 dao methods

getGlusterHook - returns GlusterHookEntity
getGlusterHookContent - returns String (used a RowMapper to get only 
content from resultset)


But the second method caused a ClassCastException due to the first 
RowMapper being used.


Anyways, this is the patch where I changed the implementation to use 
separate sp - http://gerrit.ovirt.org/#/c/14832/


thanks!
sahina

On 05/17/2013 03:48 PM, Laszlo Hornyak wrote:

Hi Sahina,

Could you share more details what you are trying to do?
A procedure always returns the same structure of data, so maybe it is more 
simple if you to return all the data you received from the plpgsql stored 
procedure and then just use the data mapped into the beans to build your own 
data structures.

Laszlo

- Original Message -

From: Sahina Bose sab...@redhat.com
To: engine-devel engine-devel@ovirt.org
Sent: Friday, May 17, 2013 11:34:31 AM
Subject: [Engine-devel] SQL procedure - row mapper

Hi all,

In org.ovirt.engine.core.dal.dbbroker.SimpleJdbcCallsHandler , there's a
map maintained for procedure name and SimpleJdbcCall.

If I have the same procedure with different row mappers, this results in
an error - because the map already contains a mapping for the procedure
name but with different row mapper.

Do we intend to support calling the same procedure with different
RowMappers? If so, I can change this class to handle this.

thanks
sahina
___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] find-bugs errors - please investigate/fix

2013-04-24 Thread Sahina Bose


On 04/24/2013 07:47 AM, Einav Cohen wrote:

ovirt find-bugs errors in jenkins - please investigate/fix asap.
[@Eyal - FYI: some of them seem really strange - see Michal's section below]

@Sahina Bose:
find-bugs error details:
http://jenkins.ovirt.org/job/ovirt_engine_find_bugs/4064/findbugsResult/NORMAL/module.-267688716/source.349621/#23
caused by: http://gerrit.ovirt.org/#/c/13831/
Have submitted a patch (http://gerrit.ovirt.org/#/c/14187/) fixing this. 
Thanks!


@Michael Kublin:
find-bugs error details:
http://jenkins.ovirt.org/job/ovirt_engine_find_bugs/4064/findbugsResult/NORMAL/module.-609324659/source.347305/#346
caused by:
http://gerrit.ovirt.org/#/c/13740/

@Michal Skrivanek:

the following seem strange - can you please investigate? could be false 
positives.

- find-bugs error details:
http://jenkins.ovirt.org/job/ovirt_engine_find_bugs/4064/findbugsResult/NORMAL/module.935532853/package.-2082646592/source.350390/#1349
[not sure when introduced - seems old, hence I suspect it is a false positive]

- Two additional errors in the same file as ^^^, you can see them at:
http://jenkins.ovirt.org/job/ovirt_engine_find_bugs/4064/findbugsResult/NORMAL/module.935532853/package.-2082646592/
when clicking on them, there aren't any details, however according to their 
tool-tips, it seems to be an unreachable
OnSuccess method within an anonymous class or something similar.

- another suspicious error is in VmListModel.java:
http://jenkins.ovirt.org/job/ovirt_engine_find_bugs/4064/findbugsResult/NORMAL/module.935532853/package.-1516395789/source.350485/#2406
[again - not sure when introduced - seems old, hence I suspect it is a false 
positive]


Thanks,
Einav


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] [vdsm] ovirt-host-deploy and multible bridges

2013-04-10 Thread Sahina Bose


On 04/10/2013 02:19 PM, Balamurugan Arumugam wrote:

On 04/10/2013 11:18 AM, Sahina Bose wrote:


On 04/10/2013 12:25 AM, Dan Kenigsberg wrote:

On Tue, Apr 09, 2013 at 07:28:17PM +0530, Balamurugan Arumugam wrote:

On 04/09/2013 06:37 PM, Sahina Bose wrote:

Decoding correct address  - glusterHostsList should return any
ipAddress that engine knows as being associated with host.
It could be either ipAddress used while adding host (stored as 
hostname

in vds_static) or any of the ipAddresses populated in vds_interface
table (addr column) .
I do not have enough knowledge about this bit of code to say what
entries are made in vds_interface table. I know there's an entry for
ovirtmgmt here but not sure if this gets added as part of addHost 
flow

or not.


I guess, vds_interface table is populated by ips given by vdsm
through getVdsCaps.

Current glusterHostsList provides one of ipaddress of the local host
(other than 127.*.*.*).   If virbr0 is enabled, it picks up
192.168.122.1 ip address of the bridge and sends to the engine, but
this entry is missing in the table.

The requirement is that we need a ip of the local host which is also
stored in the database.

The database has entries of ips of a host those are from physical
nics and/or bridges who has slaves to nics.

It's not something I've tested, or want to encourage, but currently,
outside of gluster, Vdsm may run behind a fancy NAT as a virtual 
server.

I.e., its local undress may be utterly different from the address used
by Engine.

I'd like to keep having this flexibility, and not to assume otherwise.

Why does glusterHostsList need to return the ip of the management
network? The client that issued this verb has to know that IP in the
first place.

I notice that the idiom _getLocalIpAddress() or _getGlusterHostName()
is used all too often in vdsm/gluster/cli.py.

How about changing the Vdsm/Engine API so that the string 
localhost is

returned instead? Then, Engine can replace it with whatever it seems
fit.

Dan.

Dan,

Thanks for clarifying. Looks like relying on the IpAddress to determine
the host will be prone to errors going forward.
We will change the approach and start using the UUID that gluster peer
status returns to identify host - will create a new verb glusterPeerList
that does this.



Current glusterHostsList provides list of
{'hostname': HOSTNAME, 'uuid': UUID, 'status': STATE} including local 
host.


What will be the difference between new glusterPeerList and existing 
glusterHostsList?


If this is the case, we just need to make sure at engine we use UUID and 
not IP address to identify host. We would still need a vdsm verb that 
will return the current host gluster UUID, to store in engine in case of 
Add Host flow.




And for the current host, like you mentioned, since the engine already
knows which vdsm host this command is executed on, the engine will not
rely on vdsm to return the host's IP.




Regards,
Bala




___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] build stuck on RunVmCommandTest

2013-04-10 Thread Sahina Bose

Juan, thanks! This patch fixed the problem.

On 04/10/2013 04:28 PM, Juan Hernandez wrote:

On 04/10/2013 12:12 PM, Juan Hernandez wrote:

On 04/10/2013 10:31 AM, Allon Mureinik wrote:
Einav had a similar issue yesterday with RemoveDiskTest (IIRC), 
which at

first pointed me to the direction of Java 7, but this is unrelated.

The root of all these problems is commit
fd6835059f110f4e14d67c9d2d31aa786a822f4b (core: Locate data source in a
loop) - now, whenever we have unmocked DAO calls (like in 
RunVmCommand),

instead of failing them fast and silently, we'll get stuck in a loop.

We need to see if we can offer a quick workaround, or perhaps revert
this patch until we can offer such a solution.
Juan, your input would be appreciates here.



As a workaround create /etc/ovirt-engine/engine.conf and add the
following two lines:

ENGINE_DB_CONNECTION_TIMEOUT=0
ENGINE_DB_CHECK_INTERVAL=0

Then the tests should run faster.



This is a way to solve it:

http://gerrit.ovirt.org/13782



Thanks,
Allon

 



 *From: *Shireesh Anjal san...@redhat.com
 *To: *Allon Mureinik amure...@redhat.com
 *Cc: *engine-devel@ovirt.org
 *Sent: *Wednesday, April 10, 2013 11:15:40 AM
 *Subject: *Re: [Engine-devel] build stuck on RunVmCommandTest

 On 04/10/2013 01:39 PM, Allon Mureinik wrote:

 Real oddness.
 out of interest, can you run
 java -version

 and report the version here?


 shireesh@localhost ovirt-engine]$ java -version
 java version 1.7.0_09-icedtea
 OpenJDK Runtime Environment (fedora-2.3.4.fc18-x86_64)
 OpenJDK 64-Bit Server VM (build 23.2-b09, mixed mode)



 



 *From: *Shireesh Anjal san...@redhat.com
 *To: *engine-devel@ovirt.org
 *Sent: *Wednesday, April 10, 2013 9:40:19 AM
 *Subject: *[Engine-devel] build stuck on RunVmCommandTest

 Hi,

  From last night onwards, my mvn build is getting stuck 
for

 a long time (  30 minutes) on RunVmCommandTest

 Running org.ovirt.engine.core.bll.MoveDisksCommandTest
 Tests run: 9, Failures: 0, Errors: 0, Skipped: 0, Time
 elapsed: 0.04 sec
 *Running org.ovirt.engine.core.bll.RunVmCommandTest**
 **Tests run: 25, Failures: 0, Errors: 0, Skipped: 0, Time
 elapsed: 1,983.033 sec**
 *Running 
org.ovirt.engine.core.bll.lock.InMemoryLockManagerTest

 Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time
 elapsed: 0.007 sec
 Running org.ovirt.engine.core.bll.RemoveImageCommandTest

 The same issue is happening on one of my colleague's 
system
 as well. Any help in fixing this will be highly 
appreciated.


 Regards,
 Shireesh

 ___
 Engine-devel mailing list
 Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel






___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel









___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


[Engine-devel] ovirt-host-deploy and multible bridges

2013-04-09 Thread Sahina Bose

Hi all,

I'm testing the bootstrapping of host without reboot on Fedora 18. After
host's bootstrap,
Ifconfig output returns this:

ovirtmgmt: flags=4163UP,BROADCAST,RUNNING,MULTICAST  mtu 1500
 inet 10.70.37.219  netmask 255.255.254.0  broadcast 10.70.37.255
   snipped

virbr0: flags=4099UP,BROADCAST,MULTICAST  mtu 1500
 inet 192.168.122.1  netmask 255.255.255.0  broadcast
192.168.122.255
snipped

Running*glusterHostsList*  vdsm verb, returns the ip address
192.168.122.1, whereas my host has been added with ip address 10.70.37.219

If I reboot the host, the virbr0 bridge is removed, and there's no issue.

The vdsm verb glusterHostsList - returns ipAddress of host + output of 
gluster peer probe. This is needed because a periodic sync job needs to 
make sure that the hosts added in engine are in sync with the gluster 
cli (hosts could also be added/removed from gluster cli).


How can we make sure glusterHostsList picks the correct ipAddress? 
Reading the inetinfo based on bridge has been vetoed as we are doing 
away with bridges.


It would also work if virbr0 was updated in vds_interfaces table. Since 
this is not happening either - we have an issue.


thanks
sahina

___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] ovirt-host-deploy and multible bridges

2013-04-09 Thread Sahina Bose

[Adding vdsm-devel]

On 04/09/2013 03:40 PM, Sahina Bose wrote:

Hi all,

I'm testing the bootstrapping of host without reboot on Fedora 18. After
host's bootstrap,
Ifconfig output returns this:

ovirtmgmt: flags=4163UP,BROADCAST,RUNNING,MULTICAST  mtu 1500
  inet 10.70.37.219  netmask 255.255.254.0  broadcast 10.70.37.255
snipped

virbr0: flags=4099UP,BROADCAST,MULTICAST  mtu 1500
  inet 192.168.122.1  netmask 255.255.255.0  broadcast
192.168.122.255
 snipped

Running*glusterHostsList*  vdsm verb, returns the ip address
192.168.122.1, whereas my host has been added with ip address 10.70.37.219

If I reboot the host, the virbr0 bridge is removed, and there's no issue.

The vdsm verb glusterHostsList - returns ipAddress of host + output of 
gluster peer probe. This is needed because a periodic sync job needs 
to make sure that the hosts added in engine are in sync with the 
gluster cli (hosts could also be added/removed from gluster cli).


How can we make sure glusterHostsList picks the correct ipAddress? 
Reading the inetinfo based on bridge has been vetoed as we are doing 
away with bridges.


It would also work if virbr0 was updated in vds_interfaces table. 
Since this is not happening either - we have an issue.


thanks
sahina



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] ovirt-host-deploy and multible bridges

2013-04-09 Thread Sahina Bose
Decoding correct address  - glusterHostsList should return any 
ipAddress that engine knows as being associated with host.
It could be either ipAddress used while adding host (stored as hostname 
in vds_static) or any of the ipAddresses populated in vds_interface 
table (addr column) .
I do not have enough knowledge about this bit of code to say what 
entries are made in vds_interface table. I know there's an entry for 
ovirtmgmt here but not sure if this gets added as part of addHost flow 
or not.


thx
sahina

On 04/09/2013 06:05 PM, Dan Kenigsberg wrote:

On Tue, Apr 09, 2013 at 03:55:25PM +0530, Sahina Bose wrote:

[Adding vdsm-devel]

On 04/09/2013 03:40 PM, Sahina Bose wrote:

Hi all,

I'm testing the bootstrapping of host without reboot on Fedora 18. After
host's bootstrap,
Ifconfig output returns this:

ovirtmgmt: flags=4163UP,BROADCAST,RUNNING,MULTICAST  mtu 1500
  inet 10.70.37.219  netmask 255.255.254.0  broadcast 10.70.37.255
snipped

virbr0: flags=4099UP,BROADCAST,MULTICAST  mtu 1500
  inet 192.168.122.1  netmask 255.255.255.0  broadcast
192.168.122.255
 snipped

Running*glusterHostsList*  vdsm verb, returns the ip address
192.168.122.1, whereas my host has been added with ip address 10.70.37.219

If I reboot the host, the virbr0 bridge is removed, and there's no issue.

The vdsm verb glusterHostsList - returns ipAddress of host +
output of gluster peer probe. This is needed because a periodic
sync job needs to make sure that the hosts added in engine are in
sync with the gluster cli (hosts could also be added/removed from
gluster cli).

How can we make sure glusterHostsList picks the correct ipAddress?

Can you define (in plain English) what is the correct address?
The host may have multiple valid addresses (storage, migration, display,
whatnot).

Only when it's clear to us, we can start expressing this in Python.


Reading the inetinfo based on bridge has been vetoed as we are
doing away with bridges.

It would also work if virbr0 was updated in vds_interfaces table.
Since this is not happening either - we have an issue.

It might be a valid hack to drop this default virbr0 on vdsm start - not
only the libvirt definition thereof, but also the running kernel device.

However, as expressed above, this would not solve your problem when you
have a currently-running host with multiple addresses.

Dan.


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] FeatureSupported and compatibility versions

2013-04-04 Thread Sahina Bose
The idea should be to make sure that maintaining the feature supported 
matrix is not a nightmare. If we need to go and replicate entries in 
_config.sql file for each new version, then this is an issue. And I 
think we are all in agreement that this is not the way to go.


Either we go with
[1].  default value in ConfigValues, and only changed value in db script 
like in patch set 5 (http://gerrit.ovirt.org/#/c/12970/5)


If this mechanism is broken, do you know where/what is broken?

or  [2] . with the new approach where a Feature supported changes to 
true/false **from** a particular version.
(I think, for gluster features, the Feature From works for us as we do 
not see it changing from version to version once supported. )


But if there's a way to fix 1, let's do that to get this moving.
Mike, could you elaborate on what needs to be fixed in [1] ?



On 04/02/2013 04:30 PM, Shireesh Anjal wrote:

On 04/02/2013 02:47 PM, Mike Kolesnik wrote:



On 03/27/2013 05:48 PM, Mike Kolesnik wrote:

- Original Message -

On 03/20/2013 08:20 PM, Yair Zaslavsky wrote:

- Original Message -

From: Shireesh Anjalsan...@redhat.com
To: Mike Kolesnikmkole...@redhat.com
Cc:engine-devel@ovirt.org
Sent: Wednesday, March 20, 2013 4:47:08 PM
Subject: Re: [Engine-devel] FeatureSupported and 
compatibility
versions

On 03/18/2013 01:11 PM, Shireesh Anjal wrote:

On 03/18/2013 12:59 PM, Mike Kolesnik wrote:

- Original Message -

Hi all,

The current mechanism in oVirt to check whether 
a feature is
supported
in a particular compatibility version is to use 
the
FeatureSupported
class. e.g.


FeatureSupported.networkLinking(getVm().getVdsGroupCompatibilityVersion())


Checks whether the network linking feature is 
supported for
the
the
VM's cluster compatibility version. This 
internally checks
whether
the
value of the corresponding config 
(NetworkLinkingSupported) for
the
given compatibility version is true/false.

I'm not sure if this is a good idea, since a 
feature is
typically
supported from a particular version. E.g. 
Gluster support was
introduced in 3.1, and it continues to be 
available in all
subsequent
versions. So I see no point in adding 
configuration for every
version
indicating whether the feature is supported in 
that version or
not. I
suggest to use either of the following options:

You can merge the configs into a single config 
when older
versions
go out of the supported versions for the system.

i.e. in 4.0 you can have upgrade script that merges 
all
GlusterFeatureSupported to one entry instead of 
several.

Why are we even storing this information in config? Is this
something
that can be configured at customer site?

As previously explained (but off list :) ) , Config gives you 
the
ability to have a cachable map of entry (i.e - feature name)
per version and value.
I guess it was convinient for the developers to use that.
I also mentioned that customers/oVirt users should config the
entries of vdc_options using engine-config tool only.
Not all entries are exposed via engine-config.properties (and 
no,
not just is feature supported entries are hidden).




1) Instead of using a boolean config for each 
version, use a
single
string config that indicates the supported 
from version e.g.
GlusterSupportedFrom = 3.1. There could be rare 
 cases where a
feature,
 

Re: [Engine-devel] Async Task Manager improvements

2013-03-13 Thread Sahina Bose

Hi Yair,

Thanks for the detailed design.
Had some questions

1. Can we think about introducing some DI framework in the Task 
Management package. This could be used to inject the DAL, VDS Broker, 
Commons etc dependencies. Even the list of providers and TaskStatusEvent 
handlers could be registered using this framework.


2. You mention Several providers that refer to instances of the same 
external system type have the same ProviderLogic object.  I'm not sure 
I understand this. Could you clarify?


3. Will TaskManager also talk to Job entity and update/end Job if necessary?

4. Are we planning to support custom actions on tasks? That is, 
depending on status of task, task can be paused/ resumed/ aborted 
/custom action performed etc


thanks
sahina



On 03/11/2013 03:38 PM, Yair Zaslavsky wrote:

Hi all,

I would like to present you a document I'm working on (still in 
draft/working-in-progress mode) of changes to be done at the engine async task 
manager.

Regarding the detailed design -

The suggested design breaks the task management into two modules - task 
management/polling part + command management (in context of completion of 
tasks/commands).
The current status of the design is that the design of task management is 
provided (needs some polishing) - the command management design will be 
provided soon.

In addition, we already have some ideas for an alternative design for the task 
management part (as suggested by Saggi Mizrahi).
After converging , we will present the complete design.
The reason we're sending the Wiki now is that community members will be aware 
mainly to the motivations behind the changes

(Perhaps we should create separate documents for the design and for the 
motivation/requirements)

http://www.ovirt.org/Wiki/AsyncTaskManagerChanges


Yair


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] NPE during addStorageServer command

2013-02-06 Thread Sahina Bose
Looking at the code, it looks like you need a Storage helper called 
GLUSTERFSStorageHelper in package org.ovirt.engine.core.bll.storage. The 
NPE seems to be because the Storage helper is null for the storage type.


On 02/06/2013 04:42 PM, Deepak C Shetty wrote:

Hi All,
   I am trying to compile ovirt engine after applying sharad's 
glusterfs domain support patches @
http://gerrit.ovirt.org/#/q/project:ovirt-engine+branch:master+topic:glusterfs,n,z 



After compiling, deploying the engine ( by following the steps in 
wiki.ovirt.org/Building_Ovirt_Engine )
I connect to the weadmin GUI, add a VDSM host ( this is my own vdsm 
host with VDSM glsuter domain support ) and the host is Up state.


Then I select New SD-None in DC - select my vdsm host-provide the 
args (remote path = volfileserver:volume, vfstype=glusterfs, mount = 
left blank) and click on OK


I see NPE in the engine side during addStorageServer cmd ( IIUC ) and 
then engine tries to send disconnectStorageServer, which reaches my 
VDSM host and it throws a excp, since the domain is not mounted at all.


I have captured the logs on the engien and vdsm side and attached here.

I am looking for some help on why the NPE is being seen on the engien 
side during adding a new SD ?


thanx,
deepak



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


Re: [Engine-devel] NPE during addStorageServer command

2013-02-06 Thread Sahina Bose


On 02/06/2013 05:36 PM, Shireesh Anjal wrote:

On 02/06/2013 04:55 PM, Sahina Bose wrote:
Looking at the code, it looks like you need a Storage helper called 
GLUSTERFSStorageHelper in package org.ovirt.engine.core.bll.storage. 


Which is introduced by Sharad's this patch: http://gerrit.ovirt.org/8834
I guess you don't have this patch in your local repository.
In this patch, you will need to rename the file as it's case sensitive 
on linux- GlusterFsStorageHelper should be GLUSTERFSStorageHelper.


Hope this helps,
cheers
sahina


The NPE seems to be because the Storage helper is null for the 
storage type.


On 02/06/2013 04:42 PM, Deepak C Shetty wrote:

Hi All,
   I am trying to compile ovirt engine after applying sharad's 
glusterfs domain support patches @
http://gerrit.ovirt.org/#/q/project:ovirt-engine+branch:master+topic:glusterfs,n,z 



After compiling, deploying the engine ( by following the steps in 
wiki.ovirt.org/Building_Ovirt_Engine )
I connect to the weadmin GUI, add a VDSM host ( this is my own vdsm 
host with VDSM glsuter domain support ) and the host is Up state.


Then I select New SD-None in DC - select my vdsm host-provide the 
args (remote path = volfileserver:volume, vfstype=glusterfs, mount 
= left blank) and click on OK


I see NPE in the engine side during addStorageServer cmd ( IIUC ) 
and then engine tries to send disconnectStorageServer, which reaches 
my VDSM host and it throws a excp, since the domain is not mounted 
at all.


I have captured the logs on the engien and vdsm side and attached here.

I am looking for some help on why the NPE is being seen on the 
engien side during adding a new SD ?


thanx,
deepak



___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel




___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel




___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel


___
Engine-devel mailing list
Engine-devel@ovirt.org
http://lists.ovirt.org/mailman/listinfo/engine-devel