On Mon, 17 Sep 2018, 16:25 Ravi Shankar Nori, <rn...@redhat.com> wrote:

> host-deploy is still broken on master fc28
>

Yes, there are multiple issues on FC28, but the question is if this fixed
OST on CentOS?


> On Mon, Sep 17, 2018 at 8:01 AM, Yuval Turgeman <yturg...@redhat.com>
> wrote:
>
>> I'm pretty sure I verified this on el7 as well, i'll check again, but
>> thinking about it, tar will stop when it gets to the first empty block, so
>> if the record size on the engine's side is large and the end is filled with
>> zeros, -b1 will make it stop at the first empty block so the next read on
>> the host's side would get the trailing zeros which is what otopi reads.
>> Btw, it could be a problem with deployed el7 systems as well, if for any
>> reason the default on the host is set to something that is more than 20
>> blocks (can be set with export TAR_BLOCKING_FACTOR for the root account on
>> the host side).
>> It's ok to revert the patch to fix the regression, but I don't see any
>> other way other than -b1... perhaps add a `cat -` after to just read until
>> EOF or something, or have otopi strip the input.
>>
>> On Mon, Sep 17, 2018 at 2:30 PM, Galit Rosenthal <grose...@redhat.com>
>> wrote:
>>
>>> Didi,
>>>
>>> Is this what you are looking for
>>> https://ovirt-jira.atlassian.net/browse/OVIRT-2259
>>> ?
>>> Galit
>>>
>>> On Mon, Sep 17, 2018 at 1:54 PM Dafna Ron <d...@redhat.com> wrote:
>>>
>>>> I think that in ovirt-engine we currently only build to centos.
>>>> since we have not had an engine build for 2 weeks (on master) I think
>>>> we should merge and worry about fc28 once it would be relevant.
>>>>
>>>> the failure we have now could be another regression missed since the
>>>> project has been broken for two weeks.
>>>>
>>>> Thanks,
>>>> Dafna
>>>>
>>>>
>>>>
>>>> On Mon, Sep 17, 2018 at 10:30 AM Yedidyah Bar David <d...@redhat.com>
>>>> wrote:
>>>>
>>>>> On Mon, Sep 17, 2018 at 11:49 AM Dafna Ron <d...@redhat.com> wrote:
>>>>> >
>>>>> > Didi, Marin, any update on the patch?
>>>>>
>>>>> Yes - it passed. Actually failed, but only after host-deploy:
>>>>>
>>>>>
>>>>> https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-tests_manual/3189/
>>>>>
>>>>> I'd rather not merge it as-is, because it will break fedora.
>>>>>
>>>>> If someone can have a look at the code generating the tar file, and
>>>>> can see if
>>>>> it's easy to make it work well for both centos and fedora, perhaps by
>>>>> explicitly
>>>>> setting all relevant params to some reasonable values, great.
>>>>> Otherwise, I guess
>>>>> we can merge for now, as fedora is still not supported anyway.
>>>>>
>>>>> Thanks,
>>>>>
>>>>> >
>>>>> >
>>>>> > On Sun, Sep 16, 2018 at 11:09 AM Yedidyah Bar David <d...@redhat.com>
>>>>> wrote:
>>>>> >>
>>>>> >> On Sun, Sep 16, 2018 at 12:53 PM Yedidyah Bar David <
>>>>> d...@redhat.com> wrote:
>>>>> >> >
>>>>> >> > On Fri, Sep 14, 2018 at 6:06 PM Martin Perina <mper...@redhat.com>
>>>>> wrote:
>>>>> >> > >
>>>>> >> > >
>>>>> >> > >
>>>>> >> > > On Fri, Sep 14, 2018 at 4:51 PM, Ravi Shankar Nori <
>>>>> rn...@redhat.com> wrote:
>>>>> >> > >>
>>>>> >> > >> I see the same errors on my dev env. From the logs attached by
>>>>> Andrej the response received by otopi has a bunch of null chars before the
>>>>> actual response CONFIRM DEPLOY_PROCEED=yes
>>>>> >> > >>
>>>>> >> > >>
>>>>> >> > >>
>>>>> >> > >> 2018-09-14 15:49:23,018+0200 DEBUG
>>>>> otopi.plugins.otopi.dialog.machine dialog.__logString:204 DIALOG:SEND
>>>>>  ### Response is CONFIRM DEPLOY_PROCEED=yes|no or ABORT DEPLOY_PROCEED
>>>>> >> > >>
>>>>> >> > >> ^@^@^@^@^@^@^@^@^@CONFIRM DEPLOY_PROCEED=yes
>>>>> >> > >
>>>>> >> > >
>>>>> >> > > Didi/Sandro, could you please take a look? Below error seems
>>>>> like some issue in otopi, where an error is raised when handling binary
>>>>> input:
>>>>> >> >
>>>>> >> > Not sure the issue is "binary input" in general, but simply
>>>>> illegal
>>>>> >> > input. The prompt expects, as it says, one of these 3 replies:
>>>>> >> >
>>>>> >> > CONFIRM DEPLOY_PROCEED=yes
>>>>> >> > CONFIRM DEPLOY_PROCEED=no
>>>>> >> > ABORT DEPLOY_PROCEED
>>>>> >> >
>>>>> >> > Instead, judging from the file supplied by Andrej, it gets from
>>>>> the engine:
>>>>> >> > <7169 null bytes>CONFIRM DEPLOY_PROCEED=yes
>>>>> >> >
>>>>> >> > So either the engine now sends, for some reason, 7169 null bytes,
>>>>> in
>>>>> >> > this response, or there is some low-level change causing this to
>>>>> be
>>>>> >> > eventually supplied to otopi - a change in apache-sshd, openssh,
>>>>> some
>>>>> >> > library, the kernel, no idea.
>>>>> >> >
>>>>> >> > Well, thinking a bit, I have a wild guess: Perhaps it's related
>>>>> to the
>>>>> >> > patch introduced recently to change the tar blocking?
>>>>> >>
>>>>> >> https://gerrit.ovirt.org/94357
>>>>> >>
>>>>> >> I am leaving soon, perhaps someone can try the manual job with the
>>>>> >> result of the check-patch job for above patch, to see if it fixes.
>>>>> >> Otherwise I'll do this tomorrow.
>>>>> >>
>>>>> >> >
>>>>> >> > >
>>>>> >> > >
>>>>> >> > > 2018-09-14 15:49:23,032+0200 DEBUG otopi.context
>>>>> context._executeMethod:143 method exception
>>>>> >> > > Traceback (most recent call last):
>>>>> >> > >   File "/usr/lib/python2.7/site-packages/otopi/context.py",
>>>>> line 133, in _executeMethod
>>>>> >> > >     method['method']()
>>>>> >> > >   File
>>>>> "/tmp/ovirt-O6CfS4aUHI/otopi-plugins/ovirt-host-deploy/core/misc.py", line
>>>>> 87, in _confirm
>>>>> >> > >     prompt=True,
>>>>> >> > >   File
>>>>> "/tmp/ovirt-O6CfS4aUHI/otopi-plugins/otopi/dialog/machine.py", line 478, 
>>>>> in
>>>>> confirm
>>>>> >> > >     code=opcode,
>>>>> >> > >
>>>>> >> > >
>>>>> >> > >>
>>>>> >> > >> On Fri, Sep 14, 2018 at 10:44 AM, Dafna Ron <d...@redhat.com>
>>>>> wrote:
>>>>> >> > >>>
>>>>> >> > >>> if you run it with mock you would remove any environmental
>>>>> conditions that can effect the outcome so I recommend using mock
>>>>> >> > >>>
>>>>> >> > >>>
>>>>> >> > >>> On Fri, Sep 14, 2018 at 3:32 PM, Martin Perina <
>>>>> mper...@redhat.com> wrote:
>>>>> >> > >>>>
>>>>> >> > >>>>
>>>>> >> > >>>>
>>>>> >> > >>>> On Fri, Sep 14, 2018 at 3:49 PM, Dafna Ron <d...@redhat.com>
>>>>> wrote:
>>>>> >> > >>>>>
>>>>> >> > >>>>> did you use mock to reproduce?
>>>>> >> > >>>>
>>>>> >> > >>>>
>>>>> >> > >>>> No, just run_suite under myself
>>>>> >> > >>>>>
>>>>> >> > >>>>>
>>>>> >> > >>>>> On Fri, Sep 14, 2018 at 2:39 PM, Martin Perina <
>>>>> mper...@redhat.com> wrote:
>>>>> >> > >>>>>>
>>>>> >> > >>>>>> Hi,
>>>>> >> > >>>>>>
>>>>> >> > >>>>>> the problem is that we haven't fetched the temporary
>>>>> host-deploy log from /tmp directory, so we don't know which string that
>>>>> host-deploy process sent to engine is causing that issue. I tried to
>>>>> reproduce on my local machine, but I was unable to reproduce it,
>>>>> 002_bootstrap phase finished successfully (other phases are still 
>>>>> running).
>>>>> >> > >>>>>>
>>>>> >> > >>>>>> So if anyone is able to reproduce, please try to fetch
>>>>> host-deploy log from /tmp directory after the error is raised and share 
>>>>> it.
>>>>> >> > >>>>>>
>>>>> >> > >>>>>> Thanks
>>>>> >> > >>>>>>
>>>>> >> > >>>>>> Martin
>>>>> >> > >>>>>>
>>>>> >> > >>>>>>
>>>>> >> > >>>>>> On Fri, Sep 14, 2018 at 1:52 PM, Dafna Ron <
>>>>> d...@redhat.com> wrote:
>>>>> >> > >>>>>>>
>>>>> >> > >>>>>>> Full logs can be found here:
>>>>> >> > >>>>>>>
>>>>> >> > >>>>>>>
>>>>> https://jenkins.ovirt.org/view/Change%20queue%20jobs/job/ovirt-master_change-queue-tester/10307/artifact/upgrade-from-release-suite.el7.x86_64/test_logs/upgrade-from-release-suite-master/post-002_bootstrap.py/
>>>>> >> > >>>>>>>
>>>>> >> > >>>>>>> On Fri, Sep 14, 2018 at 12:48 PM, Dafna Ron <
>>>>> d...@redhat.com> wrote:
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> Hi,
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> The previous regression was resolved and we now have a
>>>>> new regression.
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> I don't think that the reported change is related so can
>>>>> someone from ovirt-engine take a look?
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> The failure is add host on the upgrade suite.
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> Please note that we have not had an engine-ovirt build
>>>>> for over 10 days due to several consecutive regressions and I would ask 
>>>>> you
>>>>> to stop merging until we can stabilize the project and have a new package
>>>>> of engine.
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> error:
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> 2018-09-14 05:51:07,670-04 INFO
>>>>> [org.ovirt.engine.core.uutils.ssh.SSHDialog]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] SSH execute
>>>>> 'root@lago-upgrade-from-release-suite-master-host-0' 'umask 0077;
>>>>> MYTMP="$(TMPDIR="${OVIRT_TMPDIR}" mktemp -d -t ovirt-XXXXXXXXXX)"; trap
>>>>> "chmod -R u+rwX \"${MYTMP}\" > /dev/null 2>&1; rm -fr \"${MYTMP}\" >
>>>>> /dev/null 2>&1" 0; tar -b1 --warning=no-timestamp -C "${MYTMP}" -x &&
>>>>> "${MYTMP}"/ovirt-host-deploy DIALOG/dialect=str:machine
>>>>> DIALOG/customization=bool:True'
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,550-04 INFO
>>>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>>>>> (VdsDeploy) [5c91fcbd] EVENT_ID: VDS_INSTALL_IN_PROGRESS(509), Installing
>>>>> Host lago-upgrade-from-release-suite-master-host-0. Stage: Initializing.
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,565-04 INFO
>>>>> [org.ovirt.engine.core.utils.transaction.TransactionSupport] (VdsDeploy)
>>>>> [5c91fcbd] transaction rolled back
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,574-04 ERROR
>>>>> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy) 
>>>>> [5c91fcbd]
>>>>> Error during deploy dialog
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,578-04 ERROR
>>>>> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Error during host
>>>>> lago-upgrade-from-release-suite-master-host-0 install
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,586-04 ERROR
>>>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] EVENT_ID:
>>>>> VDS_INSTALL_IN_PROGRESS_ERROR(511), An error has occurred during
>>>>> installation of Host lago-upgrade-from-release-suite-master-host-0:
>>>>> CallableStatementCallback; SQL [{call insertauditlog(?, ?, ?, ?, ?, ?, ?,
>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?,
>>>>> ?, ?)}ERROR: invalid byte sequence for encoding "UTF8": 0x00; nested
>>>>> exception is org.postgresql.util.PSQLException: ERROR: invalid byte
>>>>> sequence for encoding "UTF8": 0x00.
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,586-04 ERROR
>>>>> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Error during host
>>>>> lago-upgrade-from-release-suite-master-host-0 install, preferring first
>>>>> exception: CallableStatementCallback; SQL [{call insertauditlog(?, ?, ?, 
>>>>> ?,
>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?,
>>>>> ?, ?, ?, ?, ?)}ERROR: invalid byte sequence for encoding "UTF8": 0x00;
>>>>> nested exception is org.postgresql.util.PSQLException: ERROR: invalid byte
>>>>> sequence for encoding "UTF8": 0x00
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,586-04 ERROR
>>>>> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Host installation
>>>>> failed for host 'e475e93a-63b3-4573-b242-162c2ed864f0',
>>>>> 'lago-upgrade-from-release-suite-master-host-0': 
>>>>> CallableStatementCallback;
>>>>> SQL [{call insertauditlog(?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?,
>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)}ERROR: invalid byte
>>>>> sequence for encoding "UTF8": 0x00; nested exception is
>>>>> org.postgresql.util.PSQLException: ERROR: invalid byte sequence for
>>>>> encoding "UTF8": 0x00
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,615-04 INFO
>>>>> [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] START,
>>>>> SetVdsStatusVDSCommand(HostName =
>>>>> lago-upgrade-from-release-suite-master-host-0,
>>>>> SetVdsStatusVDSCommandParameters:{hostId='e475e93a-63b3-4573-b242-162c2ed864f0',
>>>>> status='InstallFailed', nonOperationalReason='NONE',
>>>>> stopSpmFailureLogged='false', maintenanceReason='null'}), log id: 146cdc08
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,626-04 INFO
>>>>> [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] FINISH,
>>>>> SetVdsStatusVDSCommand, return: , log id: 146cdc08
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,639-04 ERROR
>>>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] EVENT_ID:
>>>>> VDS_INSTALL_FAILED(505), Host 
>>>>> lago-upgrade-from-release-suite-master-host-0
>>>>> installation failed. CallableStatementCallback; SQL [{call
>>>>> insertauditlog(?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?,
>>>>> ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)}ERROR: invalid byte sequence for
>>>>> encoding "UTF8": 0x00; nested exception is
>>>>> org.postgresql.util.PSQLException: ERROR: invalid byte sequence for
>>>>> encoding "UTF8": 0x00.
>>>>> >> > >>>>>>>> 2018-09-14 05:51:08,652-04 INFO
>>>>> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [5c91fcbd] Lock freed to object
>>>>> 'EngineLock:{exclusiveLocks='[e475e93a-63b3-4573-b242-162c2ed864f0=VDS]',
>>>>> sharedLocks=''}'
>>>>> >> > >>>>>>>> 2018-09-14 05:51:37,996-04 INFO
>>>>> [org.ovirt.engine.core.bll.quota.QuotaManager]
>>>>> (EE-ManagedThreadFactory-engineScheduled-Thread-44) [] Quota Cache 
>>>>> updated.
>>>>> (19 msec)
>>>>> >> > >>>>>>>> (END)
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>> Thanks,
>>>>> >> > >>>>>>>> Dafna
>>>>> >> > >>>>>>>>
>>>>> >> > >>>>>>>
>>>>> >> > >>>>>>
>>>>> >> > >>>>>>
>>>>> >> > >>>>>>
>>>>> >> > >>>>>> --
>>>>> >> > >>>>>> Martin Perina
>>>>> >> > >>>>>> Associate Manager, Software Engineering
>>>>> >> > >>>>>> Red Hat Czech s.r.o.
>>>>> >> > >>>>>
>>>>> >> > >>>>>
>>>>> >> > >>>>
>>>>> >> > >>>>
>>>>> >> > >>>>
>>>>> >> > >>>> --
>>>>> >> > >>>> Martin Perina
>>>>> >> > >>>> Associate Manager, Software Engineering
>>>>> >> > >>>> Red Hat Czech s.r.o.
>>>>> >> > >>>
>>>>> >> > >>>
>>>>> >> > >>
>>>>> >> > >
>>>>> >> > >
>>>>> >> > >
>>>>> >> > > --
>>>>> >> > > Martin Perina
>>>>> >> > > Associate Manager, Software Engineering
>>>>> >> > > Red Hat Czech s.r.o.
>>>>> >> >
>>>>> >> >
>>>>> >> >
>>>>> >> > --
>>>>> >> > Didi
>>>>> >>
>>>>> >>
>>>>> >>
>>>>> >> --
>>>>> >> Didi
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Didi
>>>>>
>>>> _______________________________________________
>>>> Infra mailing list -- infra@ovirt.org
>>>> To unsubscribe send an email to infra-le...@ovirt.org
>>>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
>>>> oVirt Code of Conduct:
>>>> https://www.ovirt.org/community/about/community-guidelines/
>>>> List Archives:
>>>> https://lists.ovirt.org/archives/list/infra@ovirt.org/message/CG2IYPXSSEFTL6XCN72JHUSWOUY7QRSA/
>>>>
>>>
>>>
>>> --
>>>
>>> GALIT ROSENTHAL
>>>
>>> SOFTWARE ENGINEER
>>>
>>> Red Hat
>>>
>>> <https://www.redhat.com/>
>>>
>>> ga...@gmail.com    T: 972-9-7692230
>>> <https://red.ht/sig>
>>>
>>
>>
>> _______________________________________________
>> Infra mailing list -- infra@ovirt.org
>> To unsubscribe send an email to infra-le...@ovirt.org
>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
>> oVirt Code of Conduct:
>> https://www.ovirt.org/community/about/community-guidelines/
>> List Archives:
>> https://lists.ovirt.org/archives/list/infra@ovirt.org/message/QMRM2INTCRDPT7GPF24EEPNJAZRP4CUQ/
>>
>>
>
_______________________________________________
Infra mailing list -- infra@ovirt.org
To unsubscribe send an email to infra-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/infra@ovirt.org/message/45YRF2S6Q4Q4RVGHPZFM25RCN2AO2G7D/

Reply via email to