[jira] [Commented] (SUBMARINE-548) [Umbrella] Predefined Experiment

Kevin Su (Jira) Sun, 02 Aug 2020 04:17:27 -0700


    [ 
https://issues.apache.org/jira/browse/SUBMARINE-548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17169515#comment-17169515
 ]


Kevin Su commented on SUBMARINE-548:
------------------------------------

[~wangda], A few questions, make sure I understand it correctly.
 # If I want to use a predefined template to submit an experiment, we would 
register the *ExperimentTemplateSpec* first.
*ExperimentTemplateSpec* will look like below

{
   template_name: mnist_template
   experiment_spec: 
     meta:
       name: \{{name}}
       namespace: "default"
       framework: "TensorFlow"
       cmd: "python /var/tf_mnist/mnist_with_summaries.py --input 
\{{input.train_data}} --log_dir=/train/log --                 
             learning_rate=\{{training.learning_rate}} 
--batch_size=\{{training.batch_size}}"
       envVars:
         ENV_1: "ENV1"
     environment:
       image: "gcr.io/kubeflow-ci/tf-mnist-with-summaries:1.0"
     spec:
       Ps:
         replicas: 1
         resources: "cpu=1,memory=1024M"
       Worker:
         replicas: 1
         resources: "cpu=1,memory=1024M"  
   parameters: 
     - name: input.train_data
       required: true
       description: > 
         Train data is expected in SVM format, and can be stored in HDFS/S3
     - name: training.learning_rate
       required: true
       description: > 
         Learning rate for mnist model, default is 0.001
     - name: training.batch_size
       required: true
       description: > 
         Integer or `None`. Number of samples per gradient update. If 
unspecified, `batch_size` will default to 32 
} 
Should we add *Author* and *description* in *ExperimentTemplateSpec,* as 
mention in 
[https://github.com/apache/submarine/blob/master/docs/design/experiment-implementation.md#predefined-experiment-template-api-to-run-experiment]

 

       2. After registering, we will submit a list of parameters to run an 
experiment like below
{
   experiment_name: mnist_example
   template_name: mnist_template
   parameters: 
     input.train_data: "hdsf://foo/bar"
     training.learning_rate: 0.01
     training.batch_size: 64
} 
 

IIUC, It's a great proposal that users could very easily submit an experiment 
with a list of parameters, and no need to worry about other system resources 
and the environment.

> [Umbrella] Predefined Experiment
> --------------------------------
>
>                 Key: SUBMARINE-548
>                 URL: https://issues.apache.org/jira/browse/SUBMARINE-548
>             Project: Apache Submarine
>          Issue Type: New Feature
>          Components: experiment template
>            Reporter: JohnTing
>            Assignee: JohnTing
>            Priority: Major
>             Fix For: 0.5.0
>
>
> Predefined-experiment features
>  * [API] Define Experiment API for pre-defined template
>  * [SDK] Add Python SDK to support pre-defined experiment
>  * [UI] Allow Run pre-defined experiment
>  * [API] Define Swagger API for pre-defined template submission
>  * [API] Define Swagger API for pre-defined template registration/delete, etc.
>  * [Sever] Support submit pre-defined template, and translate it to actual job
> [https://github.com/apache/submarine/blob/master/docs/design/experiment-implementation.md#support-predefined-experiment-templates]
> [https://cwiki.apache.org/confluence/display/SUBMARINE/Roadmap]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@submarine.apache.org
For additional commands, e-mail: dev-h...@submarine.apache.org

[jira] [Commented] (SUBMARINE-548) [Umbrella] Predefined Experiment

Reply via email to