[3/3] beam-site git commit: Regenerate website
Regenerate website Project: http://git-wip-us.apache.org/repos/asf/beam-site/repo Commit: http://git-wip-us.apache.org/repos/asf/beam-site/commit/5c993c61 Tree: http://git-wip-us.apache.org/repos/asf/beam-site/tree/5c993c61 Diff: http://git-wip-us.apache.org/repos/asf/beam-site/diff/5c993c61 Branch: refs/heads/asf-site Commit: 5c993c612fe47912d25c305e9ad15bbeee7c1ed1 Parents: 93c7c5f Author: Kenneth Knowles Authored: Fri Jun 9 10:43:44 2017 -0700 Committer: Kenneth Knowles Committed: Fri Jun 9 10:43:44 2017 -0700 -- .../2016/03/17/capability-matrix.html |1 + .../2016/04/03/presentation-materials.html |1 + .../sdk/2016/02/25/python-sdk-now-public.html |1 + .../beam/release/2016/06/15/first-release.html |1 + .../10/11/strata-hadoop-world-and-beam.html |1 + .../website/2016/02/22/beam-has-a-logo.html |1 + .../blog/2016/05/18/splitAtFraction-method.html |1 + .../05/27/where-is-my-pcollection-dot-map.html |1 + .../06/13/flink-batch-runner-milestone.html |1 + content/blog/2016/08/03/six-months.html |1 + content/blog/2016/10/20/test-stream.html|1 + content/blog/2017/01/09/added-apex-runner.html |1 + content/blog/2017/01/10/beam-graduates.html |1 + .../blog/2017/02/01/graduation-media-recap.html |1 + .../blog/2017/02/13/stateful-processing.html|1 + content/blog/2017/03/16/python-sdk-release.html |1 + .../2017/05/17/beam-first-stable-release.html |1 + content/blog/index.html |1 + content/coming-soon.html|1 + .../contribute/contribution-guide/index.html|1 + content/contribute/design-principles/index.html |1 + content/contribute/index.html |1 + content/contribute/logos/index.html |1 + content/contribute/maturity-model/index.html|1 + .../presentation-materials/index.html |1 + .../ptransform-style-guide/index.html |1 + content/contribute/release-guide/index.html |1 + content/contribute/runner-guide/index.html | 1375 ++ content/contribute/source-repository/index.html |1 + content/contribute/team/index.html |1 + content/contribute/testing/index.html |1 + content/contribute/work-in-progress/index.html |1 + content/documentation/index.html|1 + .../documentation/io/authoring-java/index.html |1 + .../io/authoring-overview/index.html|1 + .../io/authoring-python/index.html |1 + .../documentation/io/built-in/hadoop/index.html |1 + content/documentation/io/built-in/index.html|1 + .../documentation/io/contributing/index.html|1 + content/documentation/io/io-toc/index.html |1 + content/documentation/io/testing/index.html |1 + .../pipelines/create-your-pipeline/index.html |1 + .../pipelines/design-your-pipeline/index.html |1 + .../pipelines/test-your-pipeline/index.html |1 + .../documentation/programming-guide/index.html |1 + content/documentation/resources/index.html |1 + content/documentation/runners/apex/index.html |1 + .../runners/capability-matrix/index.html|1 + .../documentation/runners/dataflow/index.html |1 + content/documentation/runners/direct/index.html |1 + content/documentation/runners/flink/index.html |1 + .../documentation/runners/gearpump/index.html |1 + content/documentation/runners/spark/index.html |1 + .../sdks/java-extensions/index.html |1 + content/documentation/sdks/java/index.html |1 + .../sdks/python-custom-io/index.html|1 + .../python-pipeline-dependencies/index.html |1 + .../sdks/python-type-safety/index.html |1 + content/documentation/sdks/python/index.html|1 + content/get-started/beam-overview/index.html|1 + content/get-started/downloads/index.html|1 + content/get-started/index.html |1 + .../mobile-gaming-example/index.html|1 + content/get-started/quickstart-java/index.html |1 + content/get-started/quickstart-py/index.html|1 + content/get-started/support/index.html |1 + .../get-started/wordcount-example/index.html|1 + content/index.html |1 + content/js/language-switch.js |2 +- content/privacy_policy/index.html |1 + 70 files changed, 1444 insertions(+), 1 deletion(-) -- http://git-wip-us.apache.org/repos/asf/beam-site/blob/5c993c61/content/beam/capability/2016/03/17/capability-matrix.html -- diff --
[3/3] beam-site git commit: Regenerate website after merge
Regenerate website after merge Project: http://git-wip-us.apache.org/repos/asf/beam-site/repo Commit: http://git-wip-us.apache.org/repos/asf/beam-site/commit/ce15747f Tree: http://git-wip-us.apache.org/repos/asf/beam-site/tree/ce15747f Diff: http://git-wip-us.apache.org/repos/asf/beam-site/diff/ce15747f Branch: refs/heads/asf-site Commit: ce15747f32d776feb457472fe964311cd921510f Parents: 56c289f Author: Dan Halperin Authored: Thu May 25 09:43:48 2017 -0700 Committer: Dan Halperin Committed: Thu May 25 09:43:48 2017 -0700 -- content/documentation/io/built-in/index.html | 4 1 file changed, 4 insertions(+) -- http://git-wip-us.apache.org/repos/asf/beam-site/blob/ce15747f/content/documentation/io/built-in/index.html -- diff --git a/content/documentation/io/built-in/index.html b/content/documentation/io/built-in/index.html index 6b3de1b..688e24f 100644 --- a/content/documentation/io/built-in/index.html +++ b/content/documentation/io/built-in/index.html @@ -259,6 +259,10 @@ TikaIOJava https://issues.apache.org/jira/browse/BEAM-2328";>BEAM-2328 + +Cloud SpannerJava +https://issues.apache.org/jira/browse/BEAM-1542";>BEAM-1542 +
[3/3] beam-site git commit: Regenerate website
Regenerate website Project: http://git-wip-us.apache.org/repos/asf/beam-site/repo Commit: http://git-wip-us.apache.org/repos/asf/beam-site/commit/703e0bb2 Tree: http://git-wip-us.apache.org/repos/asf/beam-site/tree/703e0bb2 Diff: http://git-wip-us.apache.org/repos/asf/beam-site/diff/703e0bb2 Branch: refs/heads/asf-site Commit: 703e0bb2b0ecfaddf6716bd5774c4ff660788871 Parents: b78d316 Author: Dan Halperin Authored: Fri Feb 10 09:25:23 2017 -0800 Committer: Dan Halperin Committed: Fri Feb 10 09:25:23 2017 -0800 -- .../pipelines/test-your-pipeline/index.html | 14 +++--- 1 file changed, 7 insertions(+), 7 deletions(-) -- http://git-wip-us.apache.org/repos/asf/beam-site/blob/703e0bb2/content/documentation/pipelines/test-your-pipeline/index.html -- diff --git a/content/documentation/pipelines/test-your-pipeline/index.html b/content/documentation/pipelines/test-your-pipeline/index.html index a122117..cbfb489 100644 --- a/content/documentation/pipelines/test-your-pipeline/index.html +++ b/content/documentation/pipelines/test-your-pipeline/index.html @@ -201,7 +201,7 @@ Create a DoFnTester. Youâll need to pass an instance of the DoFn you want to test to the static factory method for DoFnTester. Create one or more main test inputs of the appropriate type for your DoFn. If your DoFn takes side inputs and/or produces side outputs, you should also create the side inputs and the side output tags. Call DoFnTester.processBundle to process the main inputs. - Use JUnitâs Assert.assertThat method to ensure the test outputs returned from processBatch match your expected values. + Use JUnitâs Assert.assertThat method to ensure the test outputs returned from processBundle match your expected values. Creating a DoFnTester @@ -261,28 +261,28 @@ Processing Test Inputs and Checking Results -To process the inputs (and thus run the test on your DoFn), you call the method DoFnTester.processBatch. When you call processBatch, you pass one or more main test input values for your DoFn. If you set side inputs, the side inputs are available to each batch of main inputs that you provide. +To process the inputs (and thus run the test on your DoFn), you call the method DoFnTester.processBundle. When you call processBundle, you pass one or more main test input values for your DoFn. If you set side inputs, the side inputs are available to each batch of main inputs that you provide. -DoFnTester.processBatch returns a List of outputsâthat is, objects of the same type as the DoFnâs specified output type. For a DoFn, processBatch returns a List : +DoFnTester.processBundle returns a List of outputsâthat is, objects of the same type as the DoFnâs specified output type. For a DoFn , processBundle returns a List : static class MyDoFn extends DoFn { ... } MyDoFn myDoFn = ...; DoFnTester fnTester = DoFnTester.of(myDoFn); String testInput = "test1"; -List testOutputs = fnTester.processBatch(testInput); +List testOutputs = fnTester.processBundle(testInput); -To check the results of processBatch, you use JUnitâs Assert.assertThat method to test if the List of outputs contains the values you expect: +To check the results of processBundle, you use JUnitâs Assert.assertThat method to test if the List of outputs contains the values you expect: String testInput = "test1"; -List testOutputs = fnTester.processBatch(testInput); +List testOutputs = fnTester.processBundle(testInput); Assert.assertThat(testOutputs, Matchers.hasItems(...)); // Process a larger batch in a single step. -Assert.assertThat(fnTester.processBatch("input1", "input2", "input3"), Matchers.hasItems(...)); +Assert.assertThat(fnTester.processBundle("input1", "input2", "input3"), Matchers.hasItems(...));
[3/3] beam-site git commit: Regenerate website
Regenerate website Project: http://git-wip-us.apache.org/repos/asf/beam-site/repo Commit: http://git-wip-us.apache.org/repos/asf/beam-site/commit/0b277be1 Tree: http://git-wip-us.apache.org/repos/asf/beam-site/tree/0b277be1 Diff: http://git-wip-us.apache.org/repos/asf/beam-site/diff/0b277be1 Branch: refs/heads/asf-site Commit: 0b277be133642f3166ef29afa4ec8eff2dc19222 Parents: 29ddf77 Author: Dan Halperin Authored: Fri Feb 10 09:23:49 2017 -0800 Committer: Dan Halperin Committed: Fri Feb 10 09:23:49 2017 -0800 -- content/documentation/programming-guide/index.html | 7 +++ 1 file changed, 7 insertions(+) -- http://git-wip-us.apache.org/repos/asf/beam-site/blob/0b277be1/content/documentation/programming-guide/index.html -- diff --git a/content/documentation/programming-guide/index.html b/content/documentation/programming-guide/index.html index 9830735..f02fd40 100644 --- a/content/documentation/programming-guide/index.html +++ b/content/documentation/programming-guide/index.html @@ -187,6 +187,7 @@ Side Inputs and Side Outputs + Composite Transforms Pipeline I/O Running the Pipeline Data Encoding and Type Safety @@ -1208,6 +1209,12 @@ tree, [2] +Composite Transforms + + + Note: This section is in progress (https://issues.apache.org/jira/browse/BEAM-1452";>BEAM-1452). + + Pipeline I/O When you create a pipeline, you often need to read data from some external source, such as a file in external data sink or a database. Likewise, you may want your pipeline to output its result data to a similar external data sink. Beam provides read and write transforms for a number of common data storage types. If you want your pipeline to read from or write to a data storage format that isnât supported by the built-in transforms, you can implement your own read and write transforms.
[3/3] beam-site git commit: Regenerate website
Regenerate website Project: http://git-wip-us.apache.org/repos/asf/beam-site/repo Commit: http://git-wip-us.apache.org/repos/asf/beam-site/commit/d9ade058 Tree: http://git-wip-us.apache.org/repos/asf/beam-site/tree/d9ade058 Diff: http://git-wip-us.apache.org/repos/asf/beam-site/diff/d9ade058 Branch: refs/heads/asf-site Commit: d9ade0584b0c4683594825df233a54ac3132787c Parents: 57c0a0d Author: Dan Halperin Authored: Wed Dec 28 23:46:00 2016 -0800 Committer: Dan Halperin Committed: Wed Dec 28 23:46:00 2016 -0800 -- content/documentation/programming-guide/index.html | 4 ++-- content/documentation/runners/dataflow/index.html | 2 +- content/documentation/runners/direct/index.html| 2 +- content/documentation/runners/flink/index.html | 2 +- 4 files changed, 5 insertions(+), 5 deletions(-) -- http://git-wip-us.apache.org/repos/asf/beam-site/blob/d9ade058/content/documentation/programming-guide/index.html -- diff --git a/content/documentation/programming-guide/index.html b/content/documentation/programming-guide/index.html index 1042062..2549abe 100644 --- a/content/documentation/programming-guide/index.html +++ b/content/documentation/programming-guide/index.html @@ -233,7 +233,7 @@ The pipeline configuration options determine, among other things, the PipelineRunner that determines where the pipeline gets executed: locally, or using a distributed back-end of your choice. Depending on where your pipeline gets executed and what your specifed Runner requires, the options can also help you specify other aspects of execution. -To set your pipelineâs configuration options and create the pipeline, create an object of type PipelineOptionshttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/utils/options.py";>PipelineOptions and pass it to Pipeline.Create(). The most common way to do this is by parsing arguments from the command-line: +To set your pipelineâs configuration options and create the pipeline, create an object of type PipelineOptionshttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/utils/pipeline_options.py";>PipelineOptions and pass it to Pipeline.Create(). The most common way to do this is by parsing arguments from the command-line: public static void main(String[] args) { // Will parse the arguments passed into the application and construct a PipelineOptions @@ -246,7 +246,7 @@ -from apache_beam.utils.options import PipelineOptions +from apache_beam.utils.pipeline_options import PipelineOptions # Will parse the arguments passed into the application and construct a PipelineOptions # Note that --help will print registered options. http://git-wip-us.apache.org/repos/asf/beam-site/blob/d9ade058/content/documentation/runners/dataflow/index.html -- diff --git a/content/documentation/runners/dataflow/index.html b/content/documentation/runners/dataflow/index.html index cc4e510..f61b477 100644 --- a/content/documentation/runners/dataflow/index.html +++ b/content/documentation/runners/dataflow/index.html @@ -256,7 +256,7 @@ -See the reference documentation for the DataflowPipelineOptionshttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/utils/options.py";>PipelineOptions interface (and its subinterfaces) for the complete list of pipeline configuration options. +See the reference documentation for the DataflowPipelineOptionshttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/utils/pipeline_options.py";>PipelineOptions interface (and its subinterfaces) for the complete list of pipeline configuration options. Additional information and caveats http://git-wip-us.apache.org/repos/asf/beam-site/blob/d9ade058/content/documentation/runners/direct/index.html -- diff --git a/content/documentation/runners/direct/index.html b/content/documentation/runners/direct/index.html index 53d07db..335632f 100644 --- a/content/documentation/runners/direct/index.html +++ b/content/documentation/runners/direct/index.html @@ -180,7 +180,7 @@ When executing your pipeline from the command-line, set runner to direct. The default values for the other pipeline options are generally sufficient. -See the reference documentation for the DirectOptionshttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/utils/options.py";>PipelineOptions interface (and its subinterfaces) for defaults and the complete list of pipeline configuration options. +See the reference documentation for the DirectOptionshttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/utils/pipeline_options.py";>PipelineOptions interface (and its subinterfac
[3/3] beam-site git commit: Regenerate website
Regenerate website Project: http://git-wip-us.apache.org/repos/asf/beam-site/repo Commit: http://git-wip-us.apache.org/repos/asf/beam-site/commit/1e2528f1 Tree: http://git-wip-us.apache.org/repos/asf/beam-site/tree/1e2528f1 Diff: http://git-wip-us.apache.org/repos/asf/beam-site/diff/1e2528f1 Branch: refs/heads/asf-site Commit: 1e2528f17ac449431ad567ef2f4e0b4e088d05f4 Parents: a60e7db Author: Dan Halperin Authored: Tue Dec 27 18:48:46 2016 -0800 Committer: Dan Halperin Committed: Tue Dec 27 18:48:46 2016 -0800 -- .../documentation/programming-guide/index.html | 179 +++ content/js/language-switch.js | 10 +- content/styles/site.css | 14 ++ 3 files changed, 160 insertions(+), 43 deletions(-) -- http://git-wip-us.apache.org/repos/asf/beam-site/blob/1e2528f1/content/documentation/programming-guide/index.html -- diff --git a/content/documentation/programming-guide/index.html b/content/documentation/programming-guide/index.html index 1781e53..1042062 100644 --- a/content/documentation/programming-guide/index.html +++ b/content/documentation/programming-guide/index.html @@ -148,6 +148,14 @@ The Beam Programming Guide is intended for Beam users who want to use the Beam SDKs to create data processing pipelines. It provides guidance for using the Beam SDK classes to build and test your pipeline. It is not intended as an exhaustive reference, but as a language-agnostic, high-level guide to programmatically building your Beam pipeline. As the programming guide is filled out, the text will include code samples in multiple languages to help illustrate how to implement Beam concepts in your programs. + + Adapt for: + +Java SDK +Python SDK + + + Contents @@ -219,13 +227,13 @@ Creating the Pipeline -The Pipeline abstraction encapsulates all the data and steps in your data processing task. Your Beam driver program typically starts by constructing a https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/Pipeline.java";>Pipeline object, and then using that object as the basis for creating the pipelineâs data sets as PCollections and its operations as Transforms. +The Pipeline abstraction encapsulates all the data and steps in your data processing task. Your Beam driver program typically starts by constructing a Pipelinehttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/pipeline.py";>Pipeline object, and then using that object as the basis for creating the pipelineâs data sets as PCollections and its operations as Transforms. To use Beam, your driver program must first create an instance of the Beam SDK class Pipeline (typically in the main() function). When you create your Pipeline, youâll also need to set some configuration options. You can set your pipelineâs configuration options programatically, but itâs often easier to set the options ahead of time (or read them from the command line) and pass them to the Pipeline object when you create the object. The pipeline configuration options determine, among other things, the PipelineRunner that determines where the pipeline gets executed: locally, or using a distributed back-end of your choice. Depending on where your pipeline gets executed and what your specifed Runner requires, the options can also help you specify other aspects of execution. -To set your pipelineâs configuration options and create the pipeline, create an object of type https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/options/PipelineOptions.java";>PipelineOptions and pass it to Pipeline.Create(). The most common way to do this is by parsing arguments from the command-line: +To set your pipelineâs configuration options and create the pipeline, create an object of type PipelineOptionshttps://github.com/apache/beam/blob/python-sdk/sdks/python/apache_beam/utils/options.py";>PipelineOptions and pass it to Pipeline.Create(). The most common way to do this is by parsing arguments from the command-line: public static void main(String[] args) { // Will parse the arguments passed into the application and construct a PipelineOptions @@ -238,11 +246,19 @@ +from apache_beam.utils.options import PipelineOptions + +# Will parse the arguments passed into the application and construct a PipelineOptions +# Note that --help will print registered options. +p = beam.Pipeline(options=PipelineOptions()) + + + The Beam SDKs contain various subclasses of PipelineOptions that correspond to different Runners. For example, DirectPipelineOptions contains options for the Direct (local) pipeline runner, while DataflowPipelineOptions contains options for using the runn