On Tue, 13 Nov 2018 13:01:09 +0100 Martin Perina <mper...@redhat.com> wrote:
> On Tue, Nov 13, 2018 at 12:49 PM Michal Skrivanek <mskri...@redhat.com> > wrote: > > > > > > > On 13 Nov 2018, at 12:20, Dominik Holler <dhol...@redhat.com> wrote: > > > > On Tue, 13 Nov 2018 11:56:37 +0100 > > Martin Perina <mper...@redhat.com> wrote: > > > > On Tue, Nov 13, 2018 at 11:02 AM Dafna Ron <d...@redhat.com> wrote: > > > > Martin? can you please look at the patch that Dominik sent? > > We need to resolve this as we have not had an engine build for the last 11 > > days > > > > > > Yesterday I've merged Dominik's revert patch > > https://gerrit.ovirt.org/95377 > > which should switch cluster level back to 4.2. Below mentioned change > > https://gerrit.ovirt.org/95310 is relevant only to cluster level 4.3, am I > > right Michal? > > > > The build mentioned > > > > https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11121/ > > is from yesterday. Are we sure that it was executed only after #95377 was > > merged? I'd like to see the results from latest > > > > https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11127/ > > but unfortunately it already waits more than an hour for available hosts > > ... > > > > > > > > > > > > https://gerrit.ovirt.org/#/c/95283/ results in > > > > http://jenkins.ovirt.org/job/ovirt-engine_master_build-artifacts-el7-x86_64/8071/ > > which is used in > > > > https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3489/parameters/ > > results in run_vms succeeding. > > > > The next merged change > > https://gerrit.ovirt.org/#/c/95310/ results in > > > > http://jenkins.ovirt.org/job/ovirt-engine_master_build-artifacts-el7-x86_64/8072/ > > which is used in > > > > https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3490/parameters/ > > results in run_vms failing with > > 2018-11-12 17:35:10,109-05 INFO > > [org.ovirt.engine.core.bll.RunVmOnceCommand] (default task-1) > > [6930b632-5593-4481-bf2a-a1d8b14a583a] Running command: RunVmOnceCommand > > internal: false. Entities affected : ID: > > d10aa133-b9b6-455d-8137-ab822d1c1971 Type: VMAction group RUN_VM with role > > type USER > > 2018-11-12 17:35:10,113-05 DEBUG > > [org.ovirt.engine.core.common.di.interceptor.DebugLoggingInterceptor] > > (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] method: > > getVmManager, params: [d10aa133-b9b6-455d-8137-ab822d1c1971], timeElapsed: > > 4ms > > 2018-11-12 17:35:10,128-05 DEBUG > > [org.ovirt.engine.core.common.di.interceptor.DebugLoggingInterceptor] > > (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] method: > > getAllForClusterWithStatus, params: [2ca9ccd8-61f0-470c-ba3f-07766202f260, > > Up], timeElapsed: 7ms > > 2018-11-12 17:35:10,129-05 INFO > > [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-1) > > [6930b632-5593-4481-bf2a-a1d8b14a583a] Candidate host > > 'lago-basic-suite-master-host-1' ('282860ab-8873-4702-a2be-100a6da111af') > > was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' > > (correlation id: 6930b632-5593-4481-bf2a-a1d8b14a583a) > > 2018-11-12 17:35:10,129-05 INFO > > [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-1) > > [6930b632-5593-4481-bf2a-a1d8b14a583a] Candidate host > > 'lago-basic-suite-master-host-0' ('c48eca36-ea98-46b2-8473-f184833e68a8') > > was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' > > (correlation id: 6930b632-5593-4481-bf2a-a1d8b14a583a) > > 2018-11-12 17:35:10,130-05 ERROR [org.ovirt.engine.core.bll.RunVmCommand] > > (default task-1) [6930b632-5593-4481-bf2a-a1d8b14a583a] Can't find VDS to > > run the VM 'd10aa133-b9b6-455d-8137-ab822d1c1971' on, so this VM will not > > be run. > > in > > > > https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3490/artifact/exported-artifacts/test_logs/basic-suite-master/post-004_basic_sanity.py/lago-basic-suite-master-engine/_var_log/ovirt-engine/engine.log/*view*/ > > > > Is this helpful for you? > > > > > > > > actually, there ire two issues > > 1) cluster is still 4.3 even after Martin’s revert. > > > > https://gerrit.ovirt.org/#/c/95409/ should align cluster level with dc level > This change aligns the cluster level, but https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3502/parameters/ consuming build result from https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11121/ looks like that this does not solve the issue: File "/home/jenkins/workspace/ovirt-system-tests_manual/ovirt-system-tests/basic-suite-master/test-scenarios/004_basic_sanity.py", line 698, in run_vms api.vms.get(VM0_NAME).start(start_params) File "/usr/lib/python2.7/site-packages/ovirtsdk/infrastructure/brokers.py", line 31193, in start headers={"Correlation-Id":correlation_id} File "/usr/lib/python2.7/site-packages/ovirtsdk/infrastructure/proxy.py", line 122, in request persistent_auth=self.__persistent_auth File "/usr/lib/python2.7/site-packages/ovirtsdk/infrastructure/connectionspool.py", line 79, in do_request persistent_auth) File "/usr/lib/python2.7/site-packages/ovirtsdk/infrastructure/connectionspool.py", line 162, in __do_request raise errors.RequestError(response_code, response_reason, response_body) RequestError: status: 400 reason: Bad Request engine.log: 2018-11-14 03:10:36,802-05 INFO [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-3) [99e282ea-577a-4dab-857b-285b1df5e6f6] Candidate host 'lago-basic-suite-master-host-0' ('4dbfb937-ac4b-4cef-8ae3-124944829add') was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' (correlation id: 99e282ea-577a-4dab-857b-285b1df5e6f6) 2018-11-14 03:10:36,802-05 INFO [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-3) [99e282ea-577a-4dab-857b-285b1df5e6f6] Candidate host 'lago-basic-suite-master-host-1' ('731e5055-706e-4310-a062-045e32ffbfeb') was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' (correlation id: 99e282ea-577a-4dab-857b-285b1df5e6f6) 2018-11-14 03:10:36,802-05 ERROR [org.ovirt.engine.core.bll.RunVmCommand] (default task-3) [99e282ea-577a-4dab-857b-285b1df5e6f6] Can't find VDS to run the VM 'dc1e1e92-1e5c-415e-8ac2-b919017adf40' on, so this VM will not be run. > 2) the patch is wrong too, as in HandleVdsCpuFlagsOrClusterChangedCommand > > it just goes ahead and sets the cluster cpu to whatever the host reported > > regardless if it is valid or not. Steven, please fix that (line 96 in > > backend/manager/modules/bll/src/main/java/org/ovirt/engine/core/bll/HandleVdsCpuFlagsOrClusterChangedCommand.java). > > It needs to pass the validation or we need some other solution. > > 3) regardless, we should make 4.3 work too , I tried to play with it a bit > > in https://gerrit.ovirt.org/#/c/95407/, let’s see… > > > > Thanks, > > michal > > > > > > > > On Mon, Nov 12, 2018 at 3:58 PM Dominik Holler <dhol...@redhat.com> wrote: > > > > On Mon, 12 Nov 2018 13:45:54 +0100 > > Martin Perina <mper...@redhat.com> wrote: > > > > On Mon, Nov 12, 2018 at 12:58 PM Dominik Holler <dhol...@redhat.com> > > > > wrote: > > > > > > On Mon, 12 Nov 2018 12:29:17 +0100 > > Martin Perina <mper...@redhat.com> wrote: > > > > On Mon, Nov 12, 2018 at 12:20 PM Dafna Ron <d...@redhat.com> wrote: > > > > There are currently two issues failing ovirt-engine on CQ ovirt > > > > master: > > > > > > 1. edit vm pool is causing failure in different tests. it has a > > > > patch > > > > *waiting > > > > to be merged*: https://gerrit.ovirt.org/#/c/95354/ > > > > > > Merged > > > > > > 2. we have a failure in upgrade suite as well to run vm but this > > > > seems > > > > to > > > > be related to the tests as well: > > 2018-11-12 05:41:07,831-05 WARN > > [org.ovirt.engine.core.bll.validator.VirtIoRngValidator] > > > > (default > > > > task-1) > > > > [] Random number source URANDOM is not supported in cluster > > > > 'test-cluster' > > > > compatibility version 4.0. > > > > here is the full error from the upgrade suite failure in run vm: > > https://pastebin.com/XLHtWGGx > > > > Here is the latest failure: > > > > > > > > https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11118/ > > > > > > > > I will try to take a look later today > > > > > > I have the idea that this might be related to > > https://gerrit.ovirt.org/#/c/95377/ , and I check in > > > > > > https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3485/console > > > > > > , but I have to stop now, if not solved I can go on later today. > > > > > > OK, both CI and above manual OST job went fine, so I've just merged the > > revert patch. I will take a look at it later in detail, we should > > > > really be > > > > testing 4.3 on master and not 4.2 > > > > > > Ack. > > > > Now > > > > > > https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/11121/ > > is failing on > > File > > > > "/home/jenkins/workspace/ovirt-master_change-queue-tester/ovirt-system-tests/basic-suite-master/test-scenarios/004_basic_sanity.py", > > line 698, in run_vms > > api.vms.get(VM0_NAME).start(start_params) > > status: 400 > > reason: Bad Request > > > > 2018-11-12 10:06:30,722-05 INFO > > [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-3) > > [b8d11cb0-5be9-4b7e-b45a-c95fa1f18681] Candidate host > > 'lago-basic-suite-master-host-1' ('dbfe1b0c-f940-4dba-8fb1-0cfe5ca7ddfc') > > was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' > > (correlation id: b8d11cb0-5be9-4b7e-b45a-c95fa1f18681) > > 2018-11-12 10:06:30,722-05 INFO > > [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-3) > > [b8d11cb0-5be9-4b7e-b45a-c95fa1f18681] Candidate host > > 'lago-basic-suite-master-host-0' ('e83a63ca-381e-40db-acb2-65a3e7953e11') > > was filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'CPU-Level' > > (correlation id: b8d11cb0-5be9-4b7e-b45a-c95fa1f18681) > > 2018-11-12 10:06:30,723-05 ERROR [org.ovirt.engine.core.bll.RunVmCommand] > > (default task-3) [b8d11cb0-5be9-4b7e-b45a-c95fa1f18681] Can't find VDS to > > run the VM '57a66eff-8cbf-4643-b045-43d4dda80c66' on, so this VM will not > > be run. > > > > Is this related to > > https://gerrit.ovirt.org/#/c/95310/ > > ? > > > > > > > > > > > > > > Thanks, > > Dafna > > > > > > > > > > On Mon, Nov 12, 2018 at 9:23 AM Dominik Holler < > > > > dhol...@redhat.com> > > > > wrote: > > > > > > On Sun, 11 Nov 2018 19:04:40 +0200 > > Dan Kenigsberg <dan...@redhat.com> wrote: > > > > On Sun, Nov 11, 2018 at 5:27 PM Eyal Edri <ee...@redhat.com> > > > > wrote: > > > > > > > > > > On Sun, Nov 11, 2018 at 5:24 PM Eyal Edri <ee...@redhat.com> > > > > > > wrote: > > > > > > > > > > On Sun, Nov 11, 2018 at 5:20 PM Dan Kenigsberg < > > > > dan...@redhat.com> > > > > wrote: > > > > > > On Sun, Nov 11, 2018 at 4:36 PM Ehud Yonasi < > > > > eyon...@redhat.com> > > > > > > wrote: > > > > > > Hey, > > I've seen that CQ Master is not passing ovirt-engine for > > > > 10 > > > > days > > > > and fails on test suite called restore_vm0_networking > > > > here's a snap error regarding it: > > > > https://pastebin.com/7msEYqKT > > > > Link to a sample job with the error: > > > > > > > > > > > > http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/11113/artifact/basic-suite.el7.x86_64/004_basic_sanity.py.junit.xml > > > > > > > > I cannot follow this link because I'm 4 minutes too late > > > > jenkins.ovirt.org uses an invalid security certificate. > > > > The > > > > certificate expired on November 11, 2018, 5:13:25 PM > > > > GMT+2. The > > > > current time is November 11, 2018, 5:17 PM. > > > > > > > > Yes, we're looking into that issue now. > > > > > > > > Fixed, you should be able to access it now. > > > > > > OST fails during restore_vm0_networking in line 101 of > > 004_basic_sanity.py while comparing > > vm_service.get().status == state > > > > It seems that instead of reporting back the VM status, Engine > > > > set > > > > garbage > > > > "The response content type 'text/html; charset=iso-8859-1' > > > > isn't the > > > > expected XML" > > > > > > The relevant line in > > > > > > > > > > https://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/11113/artifact/basic-suite.el7.x86_64/test_logs/basic-suite-master/post-004_basic_sanity.py/lago-basic-suite-master-engine/_var_log/httpd/ssl_access_log/*view*/ > > > > seems to be > > 192.168.201.1 - - [11/Nov/2018:04:27:43 -0500] "GET > > /ovirt-engine/api/vms/26088164-d1a0-4254-a377-5d3c242c8105 > > > > HTTP/1.1" > > > > 503 299 > > > > and I guess the 503 error message is sent in HTML instead of XML. > > > > If I run manually > > https://gerrit.ovirt.org/#/c/95354/ > > with latest build of engine-master > > > > > > > > > > http://jenkins.ovirt.org/job/ovirt-engine_master_build-artifacts-el7-x86_64/8074/ > > > > basic suite seems to be happy: > > https://jenkins.ovirt.org/view/oVirt system > > tests/job/ovirt-system-tests_manual/3484/ > > > > > > I do not know what could cause that, and engine.log does not > > > > mention > > > > it. But it seems like a problem in engine API hence +Martin > > > > Perina > > > > and > > > > +Ondra Machacek . > > > > > > > > > > > > > > > > > > > > > > Can some1 have a look at it and help to resolve the > > > > issue? > > > > > > > > _______________________________________________ > > Infra mailing list -- in...@ovirt.org > > To unsubscribe send an email to infra-le...@ovirt.org > > Privacy Statement: > > > > https://www.ovirt.org/site/privacy-policy/ > > > > oVirt Code of Conduct: > > > > https://www.ovirt.org/community/about/community-guidelines/ > > > > List Archives: > > > > > > > > > > https://lists.ovirt.org/archives/list/in...@ovirt.org/message/ZQAYWTLZJKGPJ25F33E6ICVDXQDYSKSQ/ > > > > > > _______________________________________________ > > Devel mailing list -- devel@ovirt.org > > To unsubscribe send an email to devel-le...@ovirt.org > > Privacy Statement: > > > > https://www.ovirt.org/site/privacy-policy/ > > > > oVirt Code of Conduct: > > > > https://www.ovirt.org/community/about/community-guidelines/ > > > > List Archives: > > > > > > > > > > https://lists.ovirt.org/archives/list/devel@ovirt.org/message/R5LOJH73XCLLFOUTKPM5GUCS6PNNKGTE/ > > > > > > > > > > > > -- > > > > Eyal edri > > > > > > MANAGER > > > > RHV/CNV DevOps > > > > EMEA VIRTUALIZATION R&D > > > > > > Red Hat EMEA > > > > TRIED. TESTED. TRUSTED. > > phone: +972-9-7692018 > > irc: eedri (on #tlv #rhev-dev #rhev-integ) > > > > > > > > > > -- > > > > Eyal edri > > > > > > MANAGER > > > > RHV/CNV DevOps > > > > EMEA VIRTUALIZATION R&D > > > > > > Red Hat EMEA > > > > TRIED. TESTED. TRUSTED. > > phone: +972-9-7692018 > > irc: eedri (on #tlv #rhev-dev #rhev-integ) > > > > _______________________________________________ > > Devel mailing list -- devel@ovirt.org > > To unsubscribe send an email to devel-le...@ovirt.org > > Privacy Statement: https://www.ovirt.org/site/privacy-policy/ > > oVirt Code of Conduct: > > > > https://www.ovirt.org/community/about/community-guidelines/ > > > > List Archives: > > > > > > > > > > https://lists.ovirt.org/archives/list/devel@ovirt.org/message/DA6Q5RE5JO3FYIKN2QLKLWMCUBQA2HBX/ > > > > > > > > > > > > > > > > > > > > > > > > > > > _______________________________________________ Devel mailing list -- devel@ovirt.org To unsubscribe send an email to devel-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/devel@ovirt.org/message/PVPUSWIAQMK5FX4CEVTSDOYBPJTAPOHH/