[
https://issues.apache.org/jira/browse/SUBMARINE-347?focusedWorklogId=373624&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373624
]
ASF GitHub Bot logged work on SUBMARINE-347:
--------------------------------------------
Author: ASF GitHub Bot
Created on: 17/Jan/20 14:18
Start Date: 17/Jan/20 14:18
Worklog Time Spent: 10m
Work Description: jiwq commented on pull request #153: SUBMARINE-347.
Refine the TF job on K8s document of submarine server
URL: https://github.com/apache/submarine/pull/153
### What is this PR for?
Fix parser JobSpec to TFJob issue
Refine the doc of submarine server
### What type of PR is it?
[Bug Fix | Documentation]
### Todos
### What is the Jira issue?
https://issues.apache.org/jira/browse/SUBMARINE-347
### How should this be tested?
https://travis-ci.com/jiwq/submarine/builds/144941387
### Screenshots (if appropriate)
### Questions:
* Does the licenses files need update? No
* Is there breaking changes for older versions? No
* Does this needs documentation? No
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 373624)
Remaining Estimate: 0h
Time Spent: 10m
> Refine the TF job on K8s document of submarine server
> -----------------------------------------------------
>
> Key: SUBMARINE-347
> URL: https://issues.apache.org/jira/browse/SUBMARINE-347
> Project: Apache Submarine
> Issue Type: Improvement
> Components: Doc
> Reporter: Zhankun Tang
> Assignee: Wanqiang Ji
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> 1. When trying to deploy the TF-operator following the document:
> {code:java}
> $ kubectl kustomize ./dev-support/k8s/tfjob/operator | kubectl apply -f -
> clusterrole.rbac.authorization.k8s.io/kubeflow-tfjobs-admin created
> clusterrole.rbac.authorization.k8s.io/kubeflow-tfjobs-edit created
> clusterrole.rbac.authorization.k8s.io/kubeflow-tfjobs-view created
> clusterrole.rbac.authorization.k8s.io/tf-job-operator created
> clusterrolebinding.rbac.authorization.k8s.io/tf-job-operator created
> Error from server (NotFound): error when creating "STDIN": namespaces
> "submarine" not found
> Error from server (NotFound): error when creating "STDIN": namespaces
> "submarine" not found
> Error from server (NotFound): error when creating "STDIN": namespaces
> "submarine" not found
> Error from server (NotFound): error when creating "STDIN": namespaces
> "submarine" not found{code}
> We should mention this in the document:
> {code:java}
> kubectl create namespace submarine{code}
> 2. The curl command is not correct. The line break "\" is not working. And
> the "`" character should be "'".
> {code:java}
> curl -H "Content-Type: application/json" --request POST --data
> '{"name":"mnist","librarySpec":{"name":"TensorFlow","version":"2.1.0","image":"gcr.io/kubeflow-ci/tf-mnist-with-summaries:1.0","cmd":"python
> /var/tf_mnist/mnist_with_summaries.py --log_dir=/train/log
> --learning_rate=0.01
> --batch_size=150","envVars":{"ENV_1":"ENV1"}},"submitterSpec":{"type":"k8s","configPath":null,"namespace":"submarine","kind":"TFJob","apiVersion":"kubeflow.org/v1"},"taskSpecs":{"Ps":{"name":"tensorflow","replicas":2,"resources":"cpu=4,memory=2048M,nvidia.com/gpu=1"},"Worker":{"name":"tensorflow","replicas":2,"resources":"cpu=4,memory=2048M"}}}'
> http://127.0.0.1:8080/api/v1/jobs
> {code}
> 3. We should have a note for user to check the running job in document. Like
> "kubectl get TFJob". etc.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]