Re: Testing, automation, and pipeline rollout in a CI/CD world

Jean-Baptiste Onofré Tue, 02 Jan 2018 21:22:54 -0800

Hi Charles,

Maybe you can setup data sets and use the TestPipeline to validate (withPAssert) that it works as expected in your pipeline.

The data sets can be store somewhere (database or filesystem) and loaded intests (basically as we do in the Beam ITs).


Thought ?

Regards
JB

On 01/03/2018 12:37 AM, Charles Allen wrote:

Hello Beam list!
We are looking at adopting some more advanced use cases with Beam code at itscore including automated testing and data dependency tracking.
Specifically I'm interested in things like making sure data changes don't breakpipelines, or things that depend on pipeline output, especially if the Beam codeisn't managed by the same team that is producing the data or the systems thatconsume the Beam output.
This becomes more complex if you consider certain runners with non-zeroreplacement time doing a rolling or staged restart/upgrade/replacement thatdepend on data producers that ALSO have non-zero replacement time. Are there anybest practices for Beam code management / data dependency management when thecode in /master is not necessarily what is running live in your productionsystems? Is it all just "pretend all data is bad and try to be backwardscompatible", or are there any Beam features that help with this?
Thanks,
Charles Allen


--
Jean-Baptiste Onofré
jbono...@apache.org
http://blog.nanthrax.net
Talend - http://www.talend.com

Re: Testing, automation, and pipeline rollout in a CI/CD world

Reply via email to