I had to re-trigger didi's fix due to a issue in Jenkins but I can now confirm that the issue is resolved in ovirt-engine and that we have a new ovirt-engine package in tested for el7:
https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/10340/ https://resources.ovirt.org/repos/ovirt/tested/master/rpm/el7/noarch/ On Mon, Sep 17, 2018 at 4:04 PM Martin Perina <mper...@redhat.com> wrote: > > > On Mon, 17 Sep 2018, 16:25 Ravi Shankar Nori, <rn...@redhat.com> wrote: > >> host-deploy is still broken on master fc28 >> > > Yes, there are multiple issues on FC28, but the question is if this fixed > OST on CentOS? > > >> On Mon, Sep 17, 2018 at 8:01 AM, Yuval Turgeman <yturg...@redhat.com> >> wrote: >> >>> I'm pretty sure I verified this on el7 as well, i'll check again, but >>> thinking about it, tar will stop when it gets to the first empty block, so >>> if the record size on the engine's side is large and the end is filled with >>> zeros, -b1 will make it stop at the first empty block so the next read on >>> the host's side would get the trailing zeros which is what otopi reads. >>> Btw, it could be a problem with deployed el7 systems as well, if for any >>> reason the default on the host is set to something that is more than 20 >>> blocks (can be set with export TAR_BLOCKING_FACTOR for the root account on >>> the host side). >>> It's ok to revert the patch to fix the regression, but I don't see any >>> other way other than -b1... perhaps add a `cat -` after to just read until >>> EOF or something, or have otopi strip the input. >>> >>> On Mon, Sep 17, 2018 at 2:30 PM, Galit Rosenthal <grose...@redhat.com> >>> wrote: >>> >>>> Didi, >>>> >>>> Is this what you are looking for >>>> https://ovirt-jira.atlassian.net/browse/OVIRT-2259 >>>> ? >>>> Galit >>>> >>>> On Mon, Sep 17, 2018 at 1:54 PM Dafna Ron <d...@redhat.com> wrote: >>>> >>>>> I think that in ovirt-engine we currently only build to centos. >>>>> since we have not had an engine build for 2 weeks (on master) I think >>>>> we should merge and worry about fc28 once it would be relevant. >>>>> >>>>> the failure we have now could be another regression missed since the >>>>> project has been broken for two weeks. >>>>> >>>>> Thanks, >>>>> Dafna >>>>> >>>>> >>>>> >>>>> On Mon, Sep 17, 2018 at 10:30 AM Yedidyah Bar David <d...@redhat.com> >>>>> wrote: >>>>> >>>>>> On Mon, Sep 17, 2018 at 11:49 AM Dafna Ron <d...@redhat.com> wrote: >>>>>> > >>>>>> > Didi, Marin, any update on the patch? >>>>>> >>>>>> Yes - it passed. Actually failed, but only after host-deploy: >>>>>> >>>>>> >>>>>> https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3189/ >>>>>> >>>>>> I'd rather not merge it as-is, because it will break fedora. >>>>>> >>>>>> If someone can have a look at the code generating the tar file, and >>>>>> can see if >>>>>> it's easy to make it work well for both centos and fedora, perhaps by >>>>>> explicitly >>>>>> setting all relevant params to some reasonable values, great. >>>>>> Otherwise, I guess >>>>>> we can merge for now, as fedora is still not supported anyway. >>>>>> >>>>>> Thanks, >>>>>> >>>>>> > >>>>>> > >>>>>> > On Sun, Sep 16, 2018 at 11:09 AM Yedidyah Bar David < >>>>>> d...@redhat.com> wrote: >>>>>> >> >>>>>> >> On Sun, Sep 16, 2018 at 12:53 PM Yedidyah Bar David < >>>>>> d...@redhat.com> wrote: >>>>>> >> > >>>>>> >> > On Fri, Sep 14, 2018 at 6:06 PM Martin Perina < >>>>>> mper...@redhat.com> wrote: >>>>>> >> > > >>>>>> >> > > >>>>>> >> > > >>>>>> >> > > On Fri, Sep 14, 2018 at 4:51 PM, Ravi Shankar Nori < >>>>>> rn...@redhat.com> wrote: >>>>>> >> > >> >>>>>> >> > >> I see the same errors on my dev env. From the logs attached >>>>>> by Andrej the response received by otopi has a bunch of null chars before >>>>>> the actual response CONFIRM DEPLOY_PROCEED=yes >>>>>> >> > >> >>>>>> >> > >> >>>>>> >> > >> >>>>>> >> > >> 2018-09-14 15:49:23,018+0200 DEBUG >>>>>> otopi.plugins.otopi.dialog.machine dialog.__logString:204 DIALOG:SEND >>>>>> ### Response is CONFIRM DEPLOY_PROCEED=yes|no or ABORT DEPLOY_PROCEED >>>>>> >> > >> >>>>>> >> > >> ^@^@^@^@^@^@^@^@^@CONFIRM DEPLOY_PROCEED=yes >>>>>> >> > > >>>>>> >> > > >>>>>> >> > > Didi/Sandro, could you please take a look? Below error seems >>>>>> like some issue in otopi, where an error is raised when handling binary >>>>>> input: >>>>>> >> > >>>>>> >> > Not sure the issue is "binary input" in general, but simply >>>>>> illegal >>>>>> >> > input. The prompt expects, as it says, one of these 3 replies: >>>>>> >> > >>>>>> >> > CONFIRM DEPLOY_PROCEED=yes >>>>>> >> > CONFIRM DEPLOY_PROCEED=no >>>>>> >> > ABORT DEPLOY_PROCEED >>>>>> >> > >>>>>> >> > Instead, judging from the file supplied by Andrej, it gets from >>>>>> the engine: >>>>>> >> > <7169 null bytes>CONFIRM DEPLOY_PROCEED=yes >>>>>> >> > >>>>>> >> > So either the engine now sends, for some reason, 7169 null >>>>>> bytes, in >>>>>> >> > this response, or there is some low-level change causing this to >>>>>> be >>>>>> >> > eventually supplied to otopi - a change in apache-sshd, openssh, >>>>>> some >>>>>> >> > library, the kernel, no idea. >>>>>> >> > >>>>>> >> > Well, thinking a bit, I have a wild guess: Perhaps it's related >>>>>> to the >>>>>> >> > patch introduced recently to change the tar blocking? >>>>>> >> >>>>>> >> https://gerrit.ovirt.org/94357 >>>>>> >> >>>>>> >> I am leaving soon, perhaps someone can try the manual job with the >>>>>> >> result of the check-patch job for above patch, to see if it fixes. >>>>>> >> Otherwise I'll do this tomorrow. >>>>>> >> >>>>>> >> > >>>>>> >> > > >>>>>> >> > > >>>>>> >> > > 2018-09-14 15:49:23,032+0200 DEBUG otopi.context >>>>>> context._executeMethod:143 method exception >>>>>> >> > > Traceback (most recent call last): >>>>>> >> > > File "/usr/lib/python2.7/site-packages/otopi/context.py", >>>>>> line 133, in _executeMethod >>>>>> >> > > method['method']() >>>>>> >> > > File >>>>>> "/tmp/ovirt-O6CfS4aUHI/otopi-plugins/ovirt-host-deploy/core/misc.py", >>>>>> line >>>>>> 87, in _confirm >>>>>> >> > > prompt=True, >>>>>> >> > > File >>>>>> "/tmp/ovirt-O6CfS4aUHI/otopi-plugins/otopi/dialog/machine.py", line 478, >>>>>> in >>>>>> confirm >>>>>> >> > > code=opcode, >>>>>> >> > > >>>>>> >> > > >>>>>> >> > >> >>>>>> >> > >> On Fri, Sep 14, 2018 at 10:44 AM, Dafna Ron <d...@redhat.com> >>>>>> wrote: >>>>>> >> > >>> >>>>>> >> > >>> if you run it with mock you would remove any environmental >>>>>> conditions that can effect the outcome so I recommend using mock >>>>>> >> > >>> >>>>>> >> > >>> >>>>>> >> > >>> On Fri, Sep 14, 2018 at 3:32 PM, Martin Perina < >>>>>> mper...@redhat.com> wrote: >>>>>> >> > >>>> >>>>>> >> > >>>> >>>>>> >> > >>>> >>>>>> >> > >>>> On Fri, Sep 14, 2018 at 3:49 PM, Dafna Ron <d...@redhat.com> >>>>>> wrote: >>>>>> >> > >>>>> >>>>>> >> > >>>>> did you use mock to reproduce? >>>>>> >> > >>>> >>>>>> >> > >>>> >>>>>> >> > >>>> No, just run_suite under myself >>>>>> >> > >>>>> >>>>>> >> > >>>>> >>>>>> >> > >>>>> On Fri, Sep 14, 2018 at 2:39 PM, Martin Perina < >>>>>> mper...@redhat.com> wrote: >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> Hi, >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> the problem is that we haven't fetched the temporary >>>>>> host-deploy log from /tmp directory, so we don't know which string that >>>>>> host-deploy process sent to engine is causing that issue. I tried to >>>>>> reproduce on my local machine, but I was unable to reproduce it, >>>>>> 002_bootstrap phase finished successfully (other phases are still >>>>>> running). >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> So if anyone is able to reproduce, please try to fetch >>>>>> host-deploy log from /tmp directory after the error is raised and share >>>>>> it. >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> Thanks >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> Martin >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> On Fri, Sep 14, 2018 at 1:52 PM, Dafna Ron < >>>>>> d...@redhat.com> wrote: >>>>>> >> > >>>>>>> >>>>>> >> > >>>>>>> Full logs can be found here: >>>>>> >> > >>>>>>> >>>>>> >> > >>>>>>> >>>>>> https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/10307/artifact/upgrade-from-release-suite.el7.x86_64/test_logs/upgrade-from-release-suite-master/post-002_bootstrap.py/ >>>>>> >> > >>>>>>> >>>>>> >> > >>>>>>> On Fri, Sep 14, 2018 at 12:48 PM, Dafna Ron < >>>>>> d...@redhat.com> wrote: >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> Hi, >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> The previous regression was resolved and we now have a >>>>>> new regression. >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> I don't think that the reported change is related so >>>>>> can someone from ovirt-engine take a look? >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> The failure is add host on the upgrade suite. >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> Please note that we have not had an engine-ovirt build >>>>>> for over 10 days due to several consecutive regressions and I would ask >>>>>> you >>>>>> to stop merging until we can stabilize the project and have a new package >>>>>> of engine. >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> error: >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> 2018-09-14 05:51:07,670-04 INFO >>>>>> [org.ovirt.engine.core.uutils.ssh.SSHDialog] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] SSH execute >>>>>> 'root@lago-upgrade-from-release-suite-master-host-0' 'umask 0077; >>>>>> MYTMP="$(TMPDIR="${OVIRT_TMPDIR}" mktemp -d -t ovirt-XXXXXXXXXX)"; trap >>>>>> "chmod -R u+rwX \"${MYTMP}\" > /dev/null 2>&1; rm -fr \"${MYTMP}\" > >>>>>> /dev/null 2>&1" 0; tar -b1 --warning=no-timestamp -C "${MYTMP}" -x && >>>>>> "${MYTMP}"/ovirt-host-deploy DIALOG/dialect=str:machine >>>>>> DIALOG/customization=bool:True' >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,550-04 INFO >>>>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] >>>>>> (VdsDeploy) [5c91fcbd] EVENT_ID: VDS_INSTALL_IN_PROGRESS(509), Installing >>>>>> Host lago-upgrade-from-release-suite-master-host-0. Stage: Initializing. >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,565-04 INFO >>>>>> [org.ovirt.engine.core.utils.transaction.TransactionSupport] (VdsDeploy) >>>>>> [5c91fcbd] transaction rolled back >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,574-04 ERROR >>>>>> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy) >>>>>> [5c91fcbd] >>>>>> Error during deploy dialog >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,578-04 ERROR >>>>>> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Error during host >>>>>> lago-upgrade-from-release-suite-master-host-0 install >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,586-04 ERROR >>>>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] EVENT_ID: >>>>>> VDS_INSTALL_IN_PROGRESS_ERROR(511), An error has occurred during >>>>>> installation of Host lago-upgrade-from-release-suite-master-host-0: >>>>>> CallableStatementCallback; SQL [{call insertauditlog(?, ?, ?, ?, ?, ?, ?, >>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, >>>>>> ?, >>>>>> ?, ?)}ERROR: invalid byte sequence for encoding "UTF8": 0x00; nested >>>>>> exception is org.postgresql.util.PSQLException: ERROR: invalid byte >>>>>> sequence for encoding "UTF8": 0x00. >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,586-04 ERROR >>>>>> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Error during host >>>>>> lago-upgrade-from-release-suite-master-host-0 install, preferring first >>>>>> exception: CallableStatementCallback; SQL [{call insertauditlog(?, ?, ?, >>>>>> ?, >>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, >>>>>> ?, >>>>>> ?, ?, ?, ?, ?)}ERROR: invalid byte sequence for encoding "UTF8": 0x00; >>>>>> nested exception is org.postgresql.util.PSQLException: ERROR: invalid >>>>>> byte >>>>>> sequence for encoding "UTF8": 0x00 >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,586-04 ERROR >>>>>> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Host installation >>>>>> failed for host 'e475e93a-63b3-4573-b242-162c2ed864f0', >>>>>> 'lago-upgrade-from-release-suite-master-host-0': >>>>>> CallableStatementCallback; >>>>>> SQL [{call insertauditlog(?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, >>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)}ERROR: invalid byte >>>>>> sequence for encoding "UTF8": 0x00; nested exception is >>>>>> org.postgresql.util.PSQLException: ERROR: invalid byte sequence for >>>>>> encoding "UTF8": 0x00 >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,615-04 INFO >>>>>> [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] START, >>>>>> SetVdsStatusVDSCommand(HostName = >>>>>> lago-upgrade-from-release-suite-master-host-0, >>>>>> SetVdsStatusVDSCommandParameters:{hostId='e475e93a-63b3-4573-b242-162c2ed864f0', >>>>>> status='InstallFailed', nonOperationalReason='NONE', >>>>>> stopSpmFailureLogged='false', maintenanceReason='null'}), log id: >>>>>> 146cdc08 >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,626-04 INFO >>>>>> [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] FINISH, >>>>>> SetVdsStatusVDSCommand, return: , log id: 146cdc08 >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,639-04 ERROR >>>>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] EVENT_ID: >>>>>> VDS_INSTALL_FAILED(505), Host >>>>>> lago-upgrade-from-release-suite-master-host-0 >>>>>> installation failed. CallableStatementCallback; SQL [{call >>>>>> insertauditlog(?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, >>>>>> ?, >>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)}ERROR: invalid byte sequence >>>>>> for >>>>>> encoding "UTF8": 0x00; nested exception is >>>>>> org.postgresql.util.PSQLException: ERROR: invalid byte sequence for >>>>>> encoding "UTF8": 0x00. >>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,652-04 INFO >>>>>> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand] >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Lock freed to object >>>>>> 'EngineLock:{exclusiveLocks='[e475e93a-63b3-4573-b242-162c2ed864f0=VDS]', >>>>>> sharedLocks=''}' >>>>>> >> > >>>>>>>> 2018-09-14 05:51:37,996-04 INFO >>>>>> [org.ovirt.engine.core.bll.quota.QuotaManager] >>>>>> (EE-ManagedThreadFactory-engineScheduled-Thread-44) [] Quota Cache >>>>>> updated. >>>>>> (19 msec) >>>>>> >> > >>>>>>>> (END) >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>>> Thanks, >>>>>> >> > >>>>>>>> Dafna >>>>>> >> > >>>>>>>> >>>>>> >> > >>>>>>> >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> >>>>>> >> > >>>>>> -- >>>>>> >> > >>>>>> Martin Perina >>>>>> >> > >>>>>> Associate Manager, Software Engineering >>>>>> >> > >>>>>> Red Hat Czech s.r.o. >>>>>> >> > >>>>> >>>>>> >> > >>>>> >>>>>> >> > >>>> >>>>>> >> > >>>> >>>>>> >> > >>>> >>>>>> >> > >>>> -- >>>>>> >> > >>>> Martin Perina >>>>>> >> > >>>> Associate Manager, Software Engineering >>>>>> >> > >>>> Red Hat Czech s.r.o. >>>>>> >> > >>> >>>>>> >> > >>> >>>>>> >> > >> >>>>>> >> > > >>>>>> >> > > >>>>>> >> > > >>>>>> >> > > -- >>>>>> >> > > Martin Perina >>>>>> >> > > Associate Manager, Software Engineering >>>>>> >> > > Red Hat Czech s.r.o. >>>>>> >> > >>>>>> >> > >>>>>> >> > >>>>>> >> > -- >>>>>> >> > Didi >>>>>> >> >>>>>> >> >>>>>> >> >>>>>> >> -- >>>>>> >> Didi >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Didi >>>>>> >>>>> _______________________________________________ >>>>> Infra mailing list -- infra@ovirt.org >>>>> To unsubscribe send an email to infra-le...@ovirt.org >>>>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ >>>>> oVirt Code of Conduct: >>>>> https://www.ovirt.org/community/about/community-guidelines/ >>>>> List Archives: >>>>> https://lists.ovirt.org/archives/list/infra@ovirt.org/message/CG2IYPXSSEFTL6XCN72JHUSWOUY7QRSA/ >>>>> >>>> >>>> >>>> -- >>>> >>>> GALIT ROSENTHAL >>>> >>>> SOFTWARE ENGINEER >>>> >>>> Red Hat >>>> >>>> <https://www.redhat.com/> >>>> >>>> ga...@gmail.com T: 972-9-7692230 >>>> <https://red.ht/sig> >>>> >>> >>> >>> _______________________________________________ >>> Infra mailing list -- infra@ovirt.org >>> To unsubscribe send an email to infra-le...@ovirt.org >>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ >>> oVirt Code of Conduct: >>> https://www.ovirt.org/community/about/community-guidelines/ >>> List Archives: >>> https://lists.ovirt.org/archives/list/infra@ovirt.org/message/QMRM2INTCRDPT7GPF24EEPNJAZRP4CUQ/ >>> >>> >> _______________________________________________ > Infra mailing list -- infra@ovirt.org > To unsubscribe send an email to infra-le...@ovirt.org > Privacy Statement: https://www.ovirt.org/site/privacy-policy/ > oVirt Code of Conduct: > https://www.ovirt.org/community/about/community-guidelines/ > List Archives: > https://lists.ovirt.org/archives/list/infra@ovirt.org/message/45YRF2S6Q4Q4RVGHPZFM25RCN2AO2G7D/ >
_______________________________________________ Infra mailing list -- infra@ovirt.org To unsubscribe send an email to infra-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/infra@ovirt.org/message/NDEZENRJFGE5E47KMH4OD5N573KH7IGL/