Hi Gerard, I did a few checks today with following outcome:
* (HPE15) installation with VIRT_NUMBER=1 does not pass "reboot_hosts" stage, it simply terminates after this messages is shown; that's why I was not able to see anything else yesterday * (HPE15) installation with VIRT_NUMBER=2 went correctly (i.e. I see the "Installation Complete!" banner :-)) * I tried to retest it (with VIRT_NUMBER=2) at "clean" server HPE16 and I hit the same issue as you did (i.e. RuntimeError: OS installation timeout) I'm wondering if it can be related to the OS state. May be we should firstly try to do an apt update & upgrade (may be even followed by the sever reboot), before execution of compass installation. I'll give it a try at another server. One additional question. How did you "installed" opnfv-clean binary at your server? I've used alien to create a deb package from http://artifacts.opnfv.org/apex/master/opnfv-apex-common-2.1-20160306.noarch.rpm. After its installation at HPE15, I was able to execute opnfv-clean, but I doubt that it works. So I'm wondering if it is enough to install this package or other two rpms/debs are also required (in that case I would expect that such dependency would be enforced by package spec file...). BTW, I did a small updates to your compass install script. May be it's time to comment out one of the options, so it is directly usable for OPNFV deployment at LaaS server. What do you think? Cheers, Martin Od: gerard.d...@wipro.com <gerard.d...@wipro.com> Odesláno: středa 1. srpna 2018 23:20 Komu: opnfv-tech-discuss@lists.opnfv.org Kopie: huangxiangyu; Klozik Martin Předmět: RE: [opnfv-tech-discuss] [compass4nfv][auto] Compass4nfv on LaaS x86 server Hi, Thanks Martin and Harry for your feedback ! On hpe15, the log file doesn't end with the string "compass deploy success", so the process may not have completed correctly. When you reach the point of "reboot_hosts do nothing", it takes quite a while (30-60 minutes ?) to get to the conclusion. Is it possible you stopped the process before it finished ? Also, I didn't see the setting of "VIRT_NUMBER" in the "deploy.sh" file in "/home/opnfv/compass4nfv", but then again it's possible you changed it afterwards. One directory up ("/home/opnfv"), the file "deply.sh.log" ends in "launch_compass failed". Out of curiosity, I tried option 1 on hpe15 (see script in ~/auto) (so: master branch, build+deploy, NOHA scenario, VIRT_NUMBER=2). This one failed with the "get_installing_progress" error. You can check the logs in /opt/opnfv-compass/compass4nfv, and in ~/auto). Best regards, Gerard From: Klozik Martin [mailto:martin.klo...@tieto.com] Sent: Wednesday, August 1, 2018 4:50 AM To: Gerard Damm (Product Engineering Service) <gerard.d...@wipro.com>; opnfv-tech-discuss@lists.opnfv.org Cc: huangxiangyu <huangxiang...@huawei.com> Subject: Re: [opnfv-tech-discuss] [compass4nfv][auto] Compass4nfv on LaaS x86 server ** This mail has been sent from an external source. Treat hyperlinks and attachments in this email with caution** Hi Gerard, I tried to follow your installation procedure (i.e. point 1 below) at HPE15 with the only difference, i.e. VIRT_NUMBER=1. I've not observed the same error as you, but some (probably not fatal) assertion error (search log for client.py). Anyhow the installation process seemed to finish somehow. I plan to have a more detailed look at the machine tomorrow. Feel free to have a look yourself, I'll forward you appropriate credentials. The installation log (output of deploy.sh) is available at /home/opnfv/compass4nfv/deploy.log Have a nice day, Martin Od: opnfv-tech-discuss@lists.opnfv.org <opnfv-tech-discuss@lists.opnfv.org> za uživatele Gerard Damm <gerard.d...@wipro.com> Odesláno: středa 1. srpna 2018 2:06:43 Komu: opnfv-tech-discuss@lists.opnfv.org Kopie: huangxiangyu Předmět: Re: [opnfv-tech-discuss] [compass4nfv][auto] Compass4nfv on LaaS x86 server Thanks for pointing out that other possible issue. The instructions I use as a reference: https://docs.opnfv.org/en/latest/submodules/compass4nfv/docs/release/installation/index.html My spelled out version/script for these instructions (case of virtual deployment on Ubuntu): https://wiki.opnfv.org/display/AUTO/Script%3A+Compass4nfv I did 3 more attempts on hpe32, and unfortunately they also failed: 1) tarball 6.2, stable/fraser branch, noha scenario, set VIRT_NUMBER to 5, deploy.sh 2) quickstart.sh (i.e. in master branch, build.sh, ha scenario, deploy.sh) 3) master branch, build.sh, noha scenario, set VIRT_NUMBER to 5, deploy.sh I got twice with the get_ansible_print error, and once a get_installing_progress error. (details below) At this point, probably the most efficient next step would be for you to try yourself on a LaaS server, write down exactly the sequence of commands you used, so as to find out the missing commands. Then I'll updates my notes, and the compass4nfv docs may also be updated. Best regards, Gerard 1) tarball 6.2, stable/fraser branch, noha scenario, set VIRT_NUMBER to 5, deploy.sh downloaded 6.2 tarball, checked out to stable/fraser, added these lines in deploy.sh (and with noha scenario): export VIRT_NUMBER=5 export VIRT_CPUS=4 export VIRT_MEM=16384 export VIRT_DISK=200G error: Traceback (most recent call last): File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1127, in <module> main() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1122, in main deploy() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1076, in deploy ansible_print = client.get_ansible_print() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 968, in get_ansible_print raise RuntimeError("OS installation timeout") RuntimeError: OS installation timeout + RET=1 + sleep 25 + [[ 1 -eq 0 ]] + /bin/false + exit 1 2) quickstart.sh (i.e. in master branch, build.sh, ha scenario, deploy.sh) error: 2018-07-31 21:57:59,756 p=130 u=root | hostname: host2 2018-07-31 21:57:59,782 p=130 u=root | host=compass-deck,url=/api/clusterhosts/2/state,body={"state": "ERROR"},headers={'Content-type': 'application/json', 'Accept': '*/*', 'X-Auth-Token': '$1$UohR2peC$xirMX8ctPjiZv5d1amTDf/'} 2018-07-31 21:57:59,817 p=130 u=root | notify host status success!!! status=200, body={ "severity": "INFO", "created_at": "2018-07-31 21:37:19", "updated_at": "2018-07-31 21:57:59", "id": 2, "state": "ERROR", "ready": false, "percentage": 0.0, "message": "" } 2018-07-31 21:57:59,818 p=130 u=root | hostname: host1 2018-07-31 21:57:59,845 p=130 u=root | host=compass-deck,url=/api/clusterhosts/1/state,body={"state": "ERROR"},headers={'Content-type': 'application/json', 'Accept': '*/*', 'X-Auth-Token': '$1$F7YoEKlk$1/6TRpRf7crU2U6t8S0lE1'} 2018-07-31 21:57:59,892 p=130 u=root | notify host status success!!! status=200, body={ "severity": "INFO", "created_at": "2018-07-31 21:37:19", "updated_at": "2018-07-31 21:57:59", "id": 1, "state": "ERROR", "ready": false, "percentage": 0.0, "message": "" } Traceback (most recent call last): File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1136, in <module> main() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1131, in main deploy() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1086, in deploy client.get_installing_progress(cluster_id, ansible_print) File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1029, in get_installing_progress _get_installing_progress() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1015, in _get_installing_progress (cluster_id, status, cluster_state) RuntimeError: ('get cluster %s state status %s: %s, error', (1, 200, {u'status': {u'completed_hosts': 0, u'total_hosts': 5, u'installing_hosts': 0, u'failed_hosts': 5}, u'severity': u'ERROR', u'created_at': u'2018-07-31 21:37:19', u'message': u'total 5, installing 0, completed: 0, error 5', u'updated_at': u'2018-07-31 21:57:59', u'state': u'ERROR', u'ready': False, u'percentage': 0.0, u'id': 1})) + RET=1 + sleep 25 + [[ 1 -eq 0 ]] + /bin/false + exit 1 3) master branch, build.sh, noha scenario, set VIRT_NUMBER to 5, deploy.sh error: Traceback (most recent call last): File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1136, in <module> main() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1131, in main deploy() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 1085, in deploy ansible_print = client.get_ansible_print() File "/opt/opnfv-compass/compass4nfv/deploy/client.py", line 977, in get_ansible_print raise RuntimeError("OS installation timeout") RuntimeError: OS installation timeout + RET=1 + sleep 25 + [[ 1 -eq 0 ]] + /bin/false + exit 1 From: huangxiangyu [mailto:huangxiang...@huawei.com] Sent: Tuesday, July 31, 2018 2:05 AM To: Gerard Damm (Product Engineering Service) <gerard.d...@wipro.com>; opnfv-tech-discuss@lists.opnfv.org Subject: 答复: [opnfv-tech-discuss] [compass4nfv][auto] Compass4nfv on LaaS x86 server ** This mail has been sent from an external source ** Hi Gerard This Error comes from the missing of variable VIRT_NUMBER. Because in the script, you are using noha scenario (os-nosdn-nofeature-noha.yml) which contains only one controller and one compute. Compass will expect five virtual machines for virtual deploy and user need to export VIRT_NUMBER to specify the number if it’s not 5. BTW, which doc do you follow when testing the compass installer? I will add this operation if there isn’t. Regards Harry The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. www.wipro.com -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#21696): https://lists.opnfv.org/g/opnfv-tech-discuss/message/21696 Mute This Topic: https://lists.opnfv.org/mt/23992905/21656 Group Owner: opnfv-tech-discuss+ow...@lists.opnfv.org Unsubscribe: https://lists.opnfv.org/g/opnfv-tech-discuss/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-