Hi Kevin,

Thank you so much for running this release.

Trying to follow the helm install stage, but notebook controller is failed
to start.

I downloaded RC1 source code, and follow the guidance:
https://github.com/apache/submarine/blob/master/docs/userdocs/k8s/helm.md

The pods in my system:

NAMESPACE     NAME                                              READY
STATUS             RESTARTS   AGE
default       notebook-controller-deployment-58797bdd75-9fstx   0/1
CrashLoopBackOff   7          12m
default       pytorch-operator-75fd845678-dpc52                 1/1
Running            0          12m
default       submarine-database-54776644c6-x2zrm               1/1
Running            0          12m
default       submarine-server-5d846f7b4f-m2fw8                 1/1
Running            0          12m
default       submarine-traefik-d55c689b5-cq6db                 1/1
Running            0          12m
default       tf-job-operator-598686fd84-2fwt9                  1/1
Running            0          12m

And notebook-controller pod has the following information:  (kubectl
describe pods notebook-controller-deployment-58797bdd75-9fstx)

Events:
  Type     Reason     Age                   From                     Message
  ----     ------     ----                  ----                     -------
  Normal   Scheduled  12m                   default-scheduler
 Successfully assigned
default/notebook-controller-deployment-58797bdd75-9fstx to docker-desktop
  Normal   Pulling    12m                   kubelet, docker-desktop
 Pulling image "apache/submarine:notebook-controller-v1.1.0-g253890cb"
  Normal   Pulled     12m                   kubelet, docker-desktop
 Successfully pulled image
"apache/submarine:notebook-controller-v1.1.0-g253890cb"
  Normal   Pulled     11m (x4 over 12m)     kubelet, docker-desktop
 Container image "apache/submarine:notebook-controller-v1.1.0-g253890cb"
already present on machine
  Normal   Created    11m (x5 over 12m)     kubelet, docker-desktop
 Created container manager
  Normal   Started    11m (x5 over 12m)     kubelet, docker-desktop
 Started container manager
  Warning  BackOff    2m45s (x51 over 12m)  kubelet, docker-desktop
 Back-off restarting failed container

Logs: (kubectl log notebook-controller-deployment-58797bdd75-9fstx)

2020-11-02T17:11:53.159Z ERROR setup unable to create controller
{"controller": "Notebook", "error": "no matches for kind \"Notebook\" in
version \"kubeflow.org/v1beta1\""}
github.com/go-logr/zapr.(*zapLogger).Error
/go/pkg/mod/github.com/go-logr/zapr@v0.1.0/zapr.go:128
main.main
/workspace/notebook-controller/main.go:76
runtime.main
/usr/local/go/src/runtime/proc.go:200

I guess it might be version of K8s, I'm using DockerDesktop:

kubectl version
Client Version: version.Info{Major:"1", Minor:"15", GitVersion:"v1.15.6",
GitCommit:"7015f71e75f670eb9e7ebd4b5749639d42e20079", GitTreeState:"clean",
BuildDate:"2019-11-13T11:20:18Z", GoVersion:"go1.12.12", Compiler:"gc",
Platform:"darwin/amd64"}
Server Version: version.Info{Major:"1", Minor:"16+",
GitVersion:"v1.16.6-beta.0",
GitCommit:"e7f962ba86f4ce7033828210ca3556393c377bcc", GitTreeState:"clean",
BuildDate:"2020-01-15T08:18:29Z", GoVersion:"go1.13.5", Compiler:"gc",
Platform:"linux/amd64"}

I'm not sure what I should do now? Last time when we release 0.4.0, we
verified 1.14, 1.15, 1.16 should all work.

https://github.com/apache/submarine/blob/master/docs/userdocs/k8s/README.md

What is our recommendation for now?

Thanks,
Wangda





On Fri, Oct 30, 2020 at 8:47 PM Wanqiang Ji <j...@apache.org> wrote:

> +1 for this RC1. Thanks Kevin drive this release and everyone for the great
> works.
> I've done the following tests:
> 1. Install the helm charts against the minikube
> 2. Install the helm charts against the kind
> 3. Check the UI's features, create the notebook/experiment and run work
>
> BR,
> Wanqiang Ji
>
>
> On Wed, Oct 28, 2020 at 4:52 PM Zhankun Tang <zt...@apache.org> wrote:
>
> > Thanks for the great efforts! Kevin!
> > I've done the below testing
> > 1. Verify the signatures (I signed Kevin's key and updated the KEYS file)
> > 2. Build from source
> > 3. Install the helm charts against Docker desktop k8s (v1.14.8)
> > 4. Check basic UI's experiment and notebook features
> >
> > And I like the built-in examples in our notebook image. It's
> > straightforward for beginners to get familiar with our SDK.
> > One minor suggestion is that we can add a link in parent readme
> > <
> >
> https://github.com/apache/submarine/blob/master/docs/userdocs/k8s/README.md
> > >
> > to our notebook.md doc. But this is not a blocker, we can improve docs
> > later since it's in GitHub for now.
> >
> > I'll give my *+1(binding)* to this RC1.
> >
> > @Wei-Chiu Chuang <weic...@apache.org> BTW, I can download the
> > "apache/submarine:mini-0.5.0-RC1" image on my laptop.
> >
> > BR,
> > Zhankun
> >
> > On Mon, 26 Oct 2020 at 10:21, Wei-Chiu Chuang
> <weic...@cloudera.com.invalid
> > >
> > wrote:
> >
> > > Curious -- I see mini-0.5.0-RC1 and RC0 docker images, however, the
> > > operator, server, database and jupyter-notebook docker images are all
> > > tagged version 0.5.0. Was this intentional?
> > >
> > > I kept getting disk full error trying to pull the RC1 minisubmarine
> > > image: docker pull apache/submarine:mini-0.5.0-RC1
> > >
> > > failed to register layer: Error processing tar file(exit status 1):
> write
> > >
> > >
> >
> /home/yarn/submarine/tf2-venv/lib/python3.6/site-packages/tensorflow_core/python/_pywrap_tensorflow_internal.so:
> > > no space left on device
> > >
> > > I've already run docker system prune to remove unnecessary volumes and
> > > images, and my local disk has more than 200GB of space.
> > >
> > > After the release, we should update the release information in the
> Apache
> > > system: https://reporter.apache.org/addrelease.html?submarine (I
> believe
> > > the access is restricted to PMCs)
> > >
> > > The docker images should be regarded as convenience binary, and our
> vote
> > is
> > > for the source code. So if you need to update the images, no need to
> > > bote again.
> > >
> > > On Sun, Oct 25, 2020 at 12:46 PM Wei-Chiu Chuang <weic...@apache.org>
> > > wrote:
> > >
> > > > Not a blocker, but I do notice our doc has lots of "TODO" and "FIXME"
> > and
> > > > then realized our doc is a WIP from SUBMARINE-518.
> > > > I'd like to find a time to contribute to the docs later.
> > > >
> > > > On Thu, Oct 22, 2020 at 4:16 AM Kevin Su <pings...@gmail.com> wrote:
> > > >
> > > >> Hi folks,
> > > >>
> > > >>
> > > >> Thanks to everyone's help on this release. Special thanks to Wangda,
> > > >>
> > > >> Zhankun, Xun, Wei-Chiu, Wanqiang, Ryan, Manikandan, and JohnTing!
> > > >>
> > > >> I've created a release candidate (RC1) for submarine 0.5.0. The
> > > >> highlighted
> > > >>
> > > >> features are as follows:
> > > >>
> > > >> 1. Submarine Experiments: Redefined the experiment spec, sync up
> code
> > > from
> > > >> Git, it could be https and ssh
> > > >>
> > > >> 2. Predefined experiment template: Register A experiment template
> and
> > > >> submit related parameter to run an experiment using Rest API
> > > >>
> > > >> 3. Environment profile: Users could easily manage their docker image
> > and
> > > >> conda environment
> > > >>
> > > >> 4. Jupyter Notebook: Spawn a jupyter notebook using Rest API, and
> > > execute
> > > >> ML code on K8s, or submit an experiment to submarine server
> > > >>
> > > >> 5. Submarine Workbench UI: CRUD Experiment, Environment, Notebook
> > > through
> > > >> the UI
> > > >>
> > > >> The RC tag in git is here:
> > > >> https://github.com/apache/submarine/releases/tag/release-0.5.0-RC1
> > > >> The RC release artifacts are available at:
> > > >> http://home.apache.org/~pingsutw/submarine-0.5.0-RC1
> > > >>
> > > >> The mini-submarine image is here:
> > > >>
> > > >>
> > >
> >
> https://hub.docker.com/layers/apache/submarine/mini-0.5.0-RC1/images/sha256-b8bc864a9a6409361de96d93e467c6458c96f6d2d85c74639a201b1c1b9af3a0?context=explore
> > > >>
> > > >> The server image is here:
> > > >>
> > > >>
> > >
> >
> https://hub.docker.com/layers/apache/submarine/server-0.5.0/images/sha256-3c197696a773cebf3409acc5ed89504e9f56240a1748b107031bf32a4ba79e40?context=explore
> > > >>
> > > >> The database image is here:
> > > >>
> > > >>
> > >
> >
> https://hub.docker.com/layers/apache/submarine/database-0.5.0/images/sha256-fcf72289e0aa46e83fc8e65c8aca79be4bba96ec9813d54568e5679925cdc94f?context=explore
> > > >>
> > > >> The Jupyter Notebook image is here:
> > > >>
> > > >>
> > >
> >
> https://hub.docker.com/layers/apache/submarine/jupyter-notebook-0.5.0/images/sha256-1e05cdd3c814063b3cac9de12bdecd70475d38d708c06794c2d3b55ef97de82a?context=explore
> > > >>
> > > >> The Maven staging repository is here:
> > > >>
> > >
> >
> https://repository.apache.org/content/repositories/orgapachesubmarine-1015
> > > >>
> > > >> My public key is here:
> > > >> https://dist.apache.org/repos/dist/release/submarine/KEYS
> > > >>
> > > >> *This vote will run for 7 days, ending on Oct 29, 2020, at 11:59 pm
> > > PST.*
> > > >>
> > > >> For the testing, I have verified the
> > > >>
> > > >> 1. Build from source, Run the mnist on Hadoop
> > > >>
> > > >> 2. Example with mini-submarine(both local and remote mode)
> > > >>
> > > >> 3. Verified the experiment operations to K8s by Submarine Server
> REST
> > > and
> > > >> PySubmarine.
> > > >>
> > > >> 4. Workbench UI (experiment, environment, notebook)
> > > >>
> > > >> 5. HTTP sync code in the experiment
> > > >>
> > > >> 6. Environment profile REST API
> > > >>
> > > >> 7. Notebook REST API
> > > >>
> > > >>
> > > >> Please follow the document to test these features.
> > > >> *
> > > >>
> > >
> >
> https://github.com/apache/submarine/tree/master/dev-support/mini-submarine
> > > >> *
> > >
> https://github.com/apache/submarine/blob/master/docs/user-guide-home.md
> > > >>
> > > >> My +1 to start. Thanks!
> > > >>
> > > >> BR,
> > > >> Kevin Su
> > > >>
> > > >
> > >
> >
>

Reply via email to