[
https://issues.apache.org/jira/browse/SINGA-397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16658514#comment-16658514
]
Ngin Yun Chuan commented on SINGA-397:
--------------------------------------
Hi Junyou,
It is possible that in scenario `1.png`, the train job is still running, so it
appears as the system halted. I edit `client-usage.py` to emit more feedback at
this stage.
For `2.png`, it seems like you have not started a train job nor added any
models before you tried to start an inference job. What were exact sequence of
commands you ran after system setup? I will add better error messages to notify
users in this case.
Check the sequence of commands in `client-usage.py` for the full
train-inference flow. Model developers have to add models for a specific task
first, then app developers can start a train job, and only after the train job
has completed, an inference job associated to that train job can be started.
Let me know which parts are more confusing, so that I can add better
documentation for those parts.
> Deployment Issues for Rafiki
> ----------------------------
>
> Key: SINGA-397
> URL: https://issues.apache.org/jira/browse/SINGA-397
> Project: Singa
> Issue Type: Test
> Environment: ubuntu 18, local machine
> Reporter: Junyou Chen
> Priority: Major
> Attachments: 1.png, 2.png
>
>
> After I deployed Rafiki on my own machine by following the Quick Setup
> section in the Developer Guide, I tried to run the $client-usage.py$ but
> failed(as picture 1 shows. system halted with no feedback). Also, I followed
> several examples in the User Guide, but there are still some problems(shown
> in picture 2).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)