This is an automated email from the ASF dual-hosted git repository. git-site-role pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/beam.git
The following commit(s) were added to refs/heads/asf-site by this push: new d6c2ba6 Publishing website 2021/09/17 00:03:29 at commit 50997ad d6c2ba6 is described below commit d6c2ba66df754ad9901f6557b35faef2b02b5072 Author: jenkins <bui...@apache.org> AuthorDate: Fri Sep 17 00:03:29 2021 +0000 Publishing website 2021/09/17 00:03:29 at commit 50997ad --- .../dsls/dataframes/differences-from-pandas/index.html | 3 ++- .../documentation/dsls/dataframes/overview/index.html | 6 ++++-- website/generated-content/get-started/index.xml | 15 +++++++++++++++ .../generated-content/get-started/tour-of-beam/index.html | 8 ++++++-- website/generated-content/sitemap.xml | 2 +- 5 files changed, 28 insertions(+), 6 deletions(-) diff --git a/website/generated-content/documentation/dsls/dataframes/differences-from-pandas/index.html b/website/generated-content/documentation/dsls/dataframes/differences-from-pandas/index.html index 0934ce1..3fd7838 100644 --- a/website/generated-content/documentation/dsls/dataframes/differences-from-pandas/index.html +++ b/website/generated-content/documentation/dsls/dataframes/differences-from-pandas/index.html @@ -23,7 +23,8 @@ function openMenu(){addPlaceholder();blockScroll();}</script><div class="clearfi with dataframe.allow_non_parallel_operations(): quantiles = df.quantile() -</code></pre><p>Note that this collects the entire input dataset on a single node, so there’s a risk of running out of memory. You should only use this workaround if you’re sure that the input is small enough to process on a single worker.</p><h3 id=operations-that-produce-non-deferred-columns>Operations that produce non-deferred columns</h3><p>Beam DataFrame operations are deferred, but the schemas of the resulting DataFrames are not, meaning that result columns must be computable witho [...] +</code></pre><p>Note that this collects the entire input dataset on a single node, so there’s a risk of running out of memory. You should only use this workaround if you’re sure that the input is small enough to process on a single worker.</p><h3 id=operations-that-produce-non-deferred-columns>Operations that produce non-deferred columns</h3><p>Beam DataFrame operations are deferred, but the schemas of the resulting DataFrames are not, meaning that result columns must be computable witho [...] +Run in Colab</a></td></table><p><br><br><br><br></p><p>To get started with Beam in a notebook, see <a href=https://beam.apache.org/get-started/try-apache-beam/>Try Apache Beam</a>.</p></div></div><footer class=footer><div class=footer__contained><div class=footer__cols><div class="footer__cols__col footer__cols__col__logos"><div class=footer__cols__col__logo><img src=/images/beam_logo_circle.svg class=footer__logo alt="Beam logo"></div><div class=footer__cols__col__logo><img src=/images/ [...] <a href=http://www.apache.org>The Apache Software Foundation</a> | <a href=/privacy_policy>Privacy Policy</a> | <a href=/feed.xml>RSS Feed</a><br><br>Apache Beam, Apache, Beam, the Beam logo, and the Apache feather logo are either registered trademarks or trademarks of The Apache Software Foundation. All other products or name brands are trademarks of their respective holders, including The Apache Software Foundation.</div></div></div></div></footer></body></html> \ No newline at end of file diff --git a/website/generated-content/documentation/dsls/dataframes/overview/index.html b/website/generated-content/documentation/dsls/dataframes/overview/index.html index 6a6a60e..fda1042 100644 --- a/website/generated-content/documentation/dsls/dataframes/overview/index.html +++ b/website/generated-content/documentation/dsls/dataframes/overview/index.html @@ -18,7 +18,8 @@ function addPlaceholder(){$('input:text').attr('placeholder',"What are you looking for?");} function endSearch(){var search=document.querySelector(".searchBar");search.classList.add("disappear");var icons=document.querySelector("#iconsBar");icons.classList.remove("disappear");} function blockScroll(){$("body").toggleClass("fixedPosition");} -function openMenu(){addPlaceholder();blockScroll();}</script><div class="clearfix container-main-content"><div class="section-nav closed" data-offset-top=90 data-offset-bottom=500><span class="section-nav-back glyphicon glyphicon-menu-left"></span><nav><ul class=section-nav-list data-section-nav><li><span class=section-nav-list-main-title>Languages</span></li><li><span class=section-nav-list-title>Java</span><ul class=section-nav-list><li><a href=/documentation/sdks/java/>Java SDK overvi [...] +function openMenu(){addPlaceholder();blockScroll();}</script><div class="clearfix container-main-content"><div class="section-nav closed" data-offset-top=90 data-offset-bottom=500><span class="section-nav-back glyphicon glyphicon-menu-left"></span><nav><ul class=section-nav-list data-section-nav><li><span class=section-nav-list-main-title>Languages</span></li><li><span class=section-nav-list-title>Java</span><ul class=section-nav-list><li><a href=/documentation/sdks/java/>Java SDK overvi [...] +Run in Colab</a></td></table><p><br><br><br><br></p><p>The Apache Beam Python SDK provides a DataFrame API for working with pandas-like <a href=https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html>DataFrame</a> objects. The feature lets you convert a PCollection to a DataFrame and then interact with the DataFrame using the standard methods available on the pandas DataFrame API. The DataFrame API is built on top of the pandas implementation, and pandas DataFram [...] <span class=k>with</span> <span class=n>beam</span><span class=o>.</span><span class=n>Pipeline</span><span class=p>()</span> <span class=k>as</span> <span class=n>p</span><span class=p>:</span> <span class=n>df</span> <span class=o>=</span> <span class=n>p</span> <span class=o>|</span> <span class=n>read_csv</span><span class=p>(</span><span class=s2>"gs://apache-beam-samples/nyc_taxi/misc/sample.csv"</span><span class=p>)</span> @@ -62,7 +63,8 @@ along with other <a href=https://github.com/apache/beam/blob/master/sdks/python/ <span class=n>pc1</span><span class=p>,</span> <span class=n>pc2</span> <span class=o>=</span> <span class=p>{</span><span class=s1>'a'</span><span class=p>:</span> <span class=n>pc</span><span class=p>}</span> <span class=o>|</span> <span class=n>DataframeTransform</span><span class=p>(</span><span class=k>lambda</span> <span class=n>a</span><span class=p>:</span> <span class=n>expr1</span><span class=p>,</span> <span class=n>expr2</span><span class=p>)</span> -<span class=p>{</span><span class=o>...</span><span class=p>}</span> <span class=o>=</span> <span class=p>{</span><span class=n>a</span><span class=p>:</span> <span class=n>pc</span><span class=p>}</span> <span class=o>|</span> <span class=n>DataframeTransform</span><span class=p>(</span><span class=k>lambda</span> <span class=n>a</span><span class=p>:</span> <span class=p>{</span><span class=o>...</span><span class=p>})</span></code></pre></div></div></div></div></div><footer class=foot [...] +<span class=p>{</span><span class=o>...</span><span class=p>}</span> <span class=o>=</span> <span class=p>{</span><span class=n>a</span><span class=p>:</span> <span class=n>pc</span><span class=p>}</span> <span class=o>|</span> <span class=n>DataframeTransform</span><span class=p>(</span><span class=k>lambda</span> <span class=n>a</span><span class=p>:</span> <span class=p>{</span><span class=o>...</span><span class=p>})</span></code></pre></div></div></div><table align=left><td><a class [...] +Run in Colab</a></td></table><p><br><br><br><br></p></div></div><footer class=footer><div class=footer__contained><div class=footer__cols><div class="footer__cols__col footer__cols__col__logos"><div class=footer__cols__col__logo><img src=/images/beam_logo_circle.svg class=footer__logo alt="Beam logo"></div><div class=footer__cols__col__logo><img src=/images/apache_logo_circle.svg class=footer__logo alt="Apache logo"></div></div><div class=footer-wrapper><div class=wrapper-grid><div class [...] <a href=http://www.apache.org>The Apache Software Foundation</a> | <a href=/privacy_policy>Privacy Policy</a> | <a href=/feed.xml>RSS Feed</a><br><br>Apache Beam, Apache, Beam, the Beam logo, and the Apache feather logo are either registered trademarks or trademarks of The Apache Software Foundation. All other products or name brands are trademarks of their respective holders, including The Apache Software Foundation.</div></div></div></div></footer></body></html> \ No newline at end of file diff --git a/website/generated-content/get-started/index.xml b/website/generated-content/get-started/index.xml index a95ce8d..df139c5 100644 --- a/website/generated-content/get-started/index.xml +++ b/website/generated-content/get-started/index.xml @@ -4630,6 +4630,21 @@ Run in Colab </td> </table> <p><br><br><br><br></p> +<h3 id="dataframes">DataFrames</h3> +<p>Beam DataFrames provide a pandas-like <a href="https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html">DataFrame</a> +API to declare Beam pipelines. +To learn more about Beam DataFrames, take a look at the +<a href="https://beam.apache.org/documentation/dsls/dataframes/overview">Beam DataFrames overview</a> page.</p> +<table align="left"> +<td> +<a class="button" target="_blank" href="https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/tour-of-beam/dataframes.ipynb"> +<img alt="Run in Colab" width="32px" height="32px" +src="https://github.com/googlecolab/open_in_colab/raw/master/images/icon32.png" /> +Run in Colab +</a> +</td> +</table> +<p><br><br><br><br></p> <h2 id="transforms">Transforms</h2> <p>Check the <a href="/documentation/transforms/python/overview/">Python transform catalog</a> for a complete list of the available transforms.</p> diff --git a/website/generated-content/get-started/tour-of-beam/index.html b/website/generated-content/get-started/tour-of-beam/index.html index f58d755..ada98bc 100644 --- a/website/generated-content/get-started/tour-of-beam/index.html +++ b/website/generated-content/get-started/tour-of-beam/index.html @@ -18,7 +18,7 @@ function addPlaceholder(){$('input:text').attr('placeholder',"What are you looking for?");} function endSearch(){var search=document.querySelector(".searchBar");search.classList.add("disappear");var icons=document.querySelector("#iconsBar");icons.classList.remove("disappear");} function blockScroll(){$("body").toggleClass("fixedPosition");} -function openMenu(){addPlaceholder();blockScroll();}</script><div class="clearfix container-main-content"><div class="section-nav closed" data-offset-top=90 data-offset-bottom=500><span class="section-nav-back glyphicon glyphicon-menu-left"></span><nav><ul class=section-nav-list data-section-nav><li><span class=section-nav-list-main-title>Get started</span></li><li><a href=/get-started/beam-overview/>Beam Overview</a></li><li><a href=/get-started/tour-of-beam/>Tour of Beam</a></li><li><s [...] +function openMenu(){addPlaceholder();blockScroll();}</script><div class="clearfix container-main-content"><div class="section-nav closed" data-offset-top=90 data-offset-bottom=500><span class="section-nav-back glyphicon glyphicon-menu-left"></span><nav><ul class=section-nav-list data-section-nav><li><span class=section-nav-list-main-title>Get started</span></li><li><a href=/get-started/beam-overview/>Beam Overview</a></li><li><a href=/get-started/tour-of-beam/>Tour of Beam</a></li><li><s [...] <a href=https://colab.research.google.com>Colab</a>. The notebooks allow you to interactively play with the code and see how your changes affect the pipeline. You don’t need to install anything or modify your computer in any way to use these notebooks.</p><p>You can also <a href=/get-started/try-apache-beam>try an Apache Beam pipeline</a> using the Java, Python, and Go SDKs.</p><h2 id=get-started>Get started</h2><h3 id=learn-the-basics>Learn the basics</h3><p>In this notebook we go through the basics of what is Apache Beam and how to get started. @@ -28,13 +28,17 @@ We introduce the built-in <code>ReadFromText</code> and <code>WriteToText</code> We also see how we can read from CSV files, read from a SQLite database, write fixed-sized batches of elements, and write windows of elements.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/tour-of-beam/reading-and-writing-data.ipynb><img alt="Run in Colab" width=32px height=32px src=https://github.com/googlecolab/open_in_colab/raw/master/images/icon32.png> Run in Colab</a></td></table><p><br><br><br><br></p><h3 id=windowing>Windowing</h3><p>In this notebook we go through how to aggregate data based on time intervals, or in streaming pipelines. We introduce the <code>GlobalWindow</code>, <code>FixedWindows</code>, <code>SlidingWindows</code>, and <code>Sessions</code>.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/tour-of-beam/windowing.ipynb><img alt="Run in Colab" width=32px height=32px src=https://github.com/googlecolab/open_in_colab/raw/master/images/icon32.png> +Run in Colab</a></td></table><p><br><br><br><br></p><h3 id=dataframes>DataFrames</h3><p>Beam DataFrames provide a pandas-like <a href=https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html>DataFrame</a> +API to declare Beam pipelines. +To learn more about Beam DataFrames, take a look at the +<a href=https://beam.apache.org/documentation/dsls/dataframes/overview>Beam DataFrames overview</a> page.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/tour-of-beam/dataframes.ipynb><img alt="Run in Colab" width=32px height=32px src=https://github.com/googlecolab/open_in_colab/raw/master/images/icon32.png> Run in Colab</a></td></table><p><br><br><br><br></p><h2 id=transforms>Transforms</h2><p>Check the <a href=/documentation/transforms/python/overview/>Python transform catalog</a> for a complete list of the available transforms.</p><h3 id=element-wise-transforms>Element-wise transforms</h3><h4 id=map>Map</h4><p>Applies a simple one-to-one mapping function over each element in the collection.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/documentation/transforms/python/elementwise/map-py.ipynb><img alt="Run in Colab" width=32px height=32px src=https://github.com/google [...] Run in Colab</a></td></table><p><br><br><br><br></p><h4 id=flatmap>FlatMap</h4><p>Applies a simple one-to-many mapping function over each element in the collection. The many elements are flattened into the resulting collection.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/documentation/transforms/python/elementwise/flatmap-py.ipynb><img alt="Run in Colab" width=32px height=32px src=https:// [...] Run in Colab</a></td></table><p><br><br><br><br></p><h4 id=filter>Filter</h4><p>Given a predicate, filter out all elements that don’t satisfy that predicate.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/documentation/transforms/python/elementwise/filter-py.ipynb><img alt="Run in Colab" width=32px height=32px src=https://github.com/googlecolab/open_in_colab/raw/master/images/icon32.png> Run in Colab</a></td></table><p><br><br><br><br></p><h4 id=partition>Partition</h4><p>Separates elements in a collection into multiple output collections.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/documentation/transforms/python/elementwise/partition-py.ipynb><img alt="Run in Colab" width=32px height=32px src=https://github.com/googlecolab/open_in_colab/raw/master/images/icon32.png> Run in Colab</a></td></table><p><br><br><br><br></p><h4 id=pardo>ParDo</h4><p>A transform for generic parallel processing. It’s recommended to use <code>Map</code>, <code>FlatMap</code>, <code>Filter</code> or other more specific transforms when possible.</p><table align=left><td><a class=button target=_blank href=https://colab.research.google.com/github/apache/beam/blob/master/examples/notebooks/documentation/transforms/python/elementwise/pardo-py.ipynb><img alt="Run in Colab" wid [...] -Run in Colab</a></td></table><p><br><br><br><br></p><div class=feedback><p class=update>Last updated on 2021/07/01</p><h3>Have you found everything you were looking for?</h3><p class=description>Was it all useful and clear? Is there anything that you would like to change? Let us know!</p><button class=load-button><a href="mailto:d...@beam.apache.org?subject=Beam Website Feedback">SEND FEEDBACK</a></button></div></div></div><footer class=footer><div class=footer__contained><div class=foote [...] +Run in Colab</a></td></table><p><br><br><br><br></p><div class=feedback><p class=update>Last updated on 2021/09/16</p><h3>Have you found everything you were looking for?</h3><p class=description>Was it all useful and clear? Is there anything that you would like to change? Let us know!</p><button class=load-button><a href="mailto:d...@beam.apache.org?subject=Beam Website Feedback">SEND FEEDBACK</a></button></div></div></div><footer class=footer><div class=footer__contained><div class=foote [...] <a href=http://www.apache.org>The Apache Software Foundation</a> | <a href=/privacy_policy>Privacy Policy</a> | <a href=/feed.xml>RSS Feed</a><br><br>Apache Beam, Apache, Beam, the Beam logo, and the Apache feather logo are either registered trademarks or trademarks of The Apache Software Foundation. All other products or name brands are trademarks of their respective holders, including The Apache Software Foundation.</div></div></div></div></footer></body></html> \ No newline at end of file diff --git a/website/generated-content/sitemap.xml b/website/generated-content/sitemap.xml index 901b536..d70e461 100644 --- a/website/generated-content/sitemap.xml +++ b/website/generated-content/sitemap.xml @@ -1 +1 @@ -<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml"><url><loc>/blog/beam-2.32.0/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/categories/blog/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/blog/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/categories/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/blog/b [...] \ No newline at end of file +<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml"><url><loc>/blog/beam-2.32.0/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/categories/blog/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/blog/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/categories/</loc><lastmod>2021-09-15T13:15:43-07:00</lastmod></url><url><loc>/blog/b [...] \ No newline at end of file