This will greatly simplify trying out portable runners: https://beam.apache.org/documentation/runners/flink/#executing-a-beam-pipeline-on-a-flink-cluster
Can't wait for following to disappear from the instructions page: ./gradlew :sdks:python:container:docker On Wed, Sep 4, 2019 at 3:35 PM Thomas Weise <t...@apache.org> wrote: > Awesome, thank you! > > > On Wed, Sep 4, 2019 at 3:22 PM Hannah Jiang <hannahji...@google.com> > wrote: > >> Hi Thomas >> >> I created snapshot images from head as of around 2PM today. >> You can pull images from gcr.io/apache-beam-testing/beam/sdks/snapshot. >> >> Thanks, >> Hannah >> >> On Wed, Sep 4, 2019 at 1:41 PM Thomas Weise <t...@apache.org> wrote: >> >>> Hi Hannah, >>> >>> Thank you, I know how to build the containers locally, but not how to >>> publish them! >>> >>> The cwiki says "Publishing images to gcr.io/beam requires permissions >>> in apache-beam-testing project." >>> >>> Can I get access to the testing project (at least temporarily) and what >>> would I need to setup to run the publish target that is shown on cwiki? >>> >>> Thanks, >>> Thomas >>> >>> >>> On Wed, Sep 4, 2019 at 11:06 AM Hannah Jiang <hannahji...@google.com> >>> wrote: >>> >>>> Hi Thomas >>>> >>>> I haven't uploaded any snapshot images yet. Here is how you can create >>>> one from head. >>>> > cd [...]/beam/ >>>> # For Python >>>> > ./gradlew :sdks:python:container:py{version}:docker *where version >>>> is {2,35,36,37}* >>>> # For Java >>>> > ./gradlew -p sdks/java/container docker >>>> # For Go >>>> > ./gradlew -p sdks/go/container docker >>>> >>>> The 2.15 one is just for testing, not a real 2.15.0, nor a snapshot >>>> from head. >>>> >>>> Please let me know if you have any questions. >>>> Hannah >>>> >>>> On Wed, Sep 4, 2019 at 10:57 AM Thomas Weise <t...@apache.org> wrote: >>>> >>>>> I actually found something in [1], but it is 2.15 unfortunately. >>>>> >>>>> [1] >>>>> https://console.cloud.google.com/gcr/images/apache-beam-testing/GLOBAL/beam/sdks/release/python2.7?gcrImageListsize=30 >>>>> >>>>> On Wed, Sep 4, 2019 at 10:35 AM Thomas Weise <t...@apache.org> wrote: >>>>> >>>>>> Thanks for working on this. Do you happen to have publicly accessible >>>>>> snapshots published for your testing currently (even when the final >>>>>> location isn't sorted out)? >>>>>> >>>>>> I would like to use a 2.16 based Python SDK image for working on my >>>>>> downstream project, but could not find anything in >>>>>> gcr.io/apache-beam-testing/beam/sdks/rc/snapshot >>>>>> >>>>>> Thanks, >>>>>> Thomas >>>>>> >>>>>> On Fri, Aug 30, 2019 at 10:56 AM Valentyn Tymofieiev < >>>>>> valen...@google.com> wrote: >>>>>> >>>>>>> On Tue, Aug 27, 2019 at 3:35 PM Hannah Jiang <hannahji...@google.com> >>>>>>> wrote: >>>>>>> >>>>>>>> Hi team >>>>>>>> >>>>>>>> I am working on improving docker container support for Beam. We >>>>>>>> would like to publish prebuilt containers for each release version and >>>>>>>> daily snapshot. Current work focuses on release images only and it >>>>>>>> would be >>>>>>>> part of the release process. >>>>>>>> >>>>>>>> The release images will be pushed to GCR which is publicly >>>>>>>> accessible(pullable). We will use the following locations. >>>>>>>> *Repository*: gcr.io/beam >>>>>>>> *Project*: apache-beam-testing >>>>>>>> More details, including naming and tagging scheme, can be found at >>>>>>>> wiki >>>>>>>> <https://cwiki.apache.org/confluence/display/BEAM/%5BWIP%5D+SDKHarness+Container+Image+Release+Process> >>>>>>>> which >>>>>>>> is written by several contributors. >>>>>>>> >>>>>>>> I would like to discuss these two questions. >>>>>>>> *1. How many tests do we need to run before pushing images to gcr*? >>>>>>>> Publishing artifacts is the last step of the release process, so at >>>>>>>> this moment, we already verified all codebase. In addition, many >>>>>>>> Jenkins >>>>>>>> tests use containers, so it is already verified several times. Do we >>>>>>>> need >>>>>>>> to run it again? >>>>>>>> >>>>>>> >>>>>>> In a docker repository, one container image can have multiple tags. >>>>>>> One possibility is that on the last step of the release process, after >>>>>>> sufficient testing, we place a production tag on an image that was >>>>>>> already >>>>>>> pushed with a dev tag. >>>>>>> >>>>>>> For example a dev tag may look like: >>>>>>> gcr.io/apache-beam/python37:2.16.0-RC4, and production tag may look >>>>>>> like: >>>>>>> gcr.io/apache-beam/python37:2.16.0 and both will refer to the same >>>>>>> image at the end. >>>>>>> >>>>>>> We should also plan what the process of updating the container image >>>>>>> will look like, if we need to release the image with additional changes, >>>>>>> and how we will test these changes before the final push (or placing >>>>>>> production tag). >>>>>>> >>>>>>> >>>>>>>> >>>>>>>> *2. How many tests do we need to run to validate pushed images?* >>>>>>>> When we push the images, we assume the images would work and pass >>>>>>>> all the tests. After pushing, we should confirm the images are >>>>>>>> pullable and >>>>>>>> useable. I suggest we run several tests on dataflow with each pushed >>>>>>>> image. >>>>>>>> What do you think? >>>>>>>> >>>>>>> >>>>>>> I think it makes sense to do - Beam runners that use SDK container >>>>>>> images should have some continuously running tests, which periodically >>>>>>> check that all supported images are pullable and still compatible with >>>>>>> the >>>>>>> runner. >>>>>>> >>>>>>> This work can be refined later as we explore more during our release >>>>>>>> process. >>>>>>>> Please comment or edit the wiki page or reply to this email with >>>>>>>> your opinions. >>>>>>>> >>>>>>>> Thanks, >>>>>>>> Hannah >>>>>>>> >>>>>>>