Environments for External Transforms

Maximilian Michels Wed, 22 May 2019 09:17:42 -0700

Hi,

Robert and me were discussing on the subject of user-specifiedenvironments for external transforms [1]. We couldn't decide whetherusers should have direct control over the environment when they use anexternal transform in their pipeline.

In my mind, it is quite natural that the Expansion Service is along-running service that gets started with a list of availableenvironments. Such a list can be outdated and users may write transformsfor a new environment they want to use in their pipeline. The easiestway would be to allow to pass the environment with the transform. Notethat we already give users control over the "main" environment via thePortablePipelineOptions, so this wouldn't be an entirely new concept.

The contrary position is that the Expansion Service should have fullcontrol over which environment is chosen. Going back to the discussionabout artifact staging [2], this could enable to perform moreoptimizations, such as merging environments or detecting conflicts.However, this only works if this information has been provided upfrontto the Expansion Service. It wouldn't be impossible to provide thesehints alongside with the environment like suggested in the previousparagraph.

Any opinions? Should we allow users to optionally specify an environmentfor external transforms?


Thanks,
Max

[1] https://github.com/apache/beam/pull/8639

[2]https://lists.apache.org/thread.html/6fcee7047f53cf1c0636fb65367ef70842016d57effe2e5795c4137d@%3Cdev.beam.apache.org%3E

Environments for External Transforms

Reply via email to