[image: Beam.png]

February-March 2019 | Newsletter

What’s been done

------------------------------

Apache Beam 2.10.0 released (by: many contributors)

   -

   Download the release here.
   <https://beam.apache.org/get-started/downloads/>
   -

   See the blog post
   <https://beam.apache.org/blog/2019/02/15/beam-2.10.0.html> for more
   details.


Apache Beam awarded the 2019 Technology of the Year Award!

   -

   InfoWorld just awarded Beam the 2019 Technology of the Year Award.
   -

   See this  article
   
<https://www.infoworld.com/article/3336072/application-development/infoworlds-2019-technology-of-the-year-award-winners.html?nsdr=true>
   for more details.


Kettle Beam 0.5 released with support for flink (by: Matt Casters)

   -

   Kettle now supports Apache Flink as well as Cloud Dataflow and Spark.
   -

   See Matt’s Blog
   
<http://sandbox.kettle.be/wordpress/index.php/2019/02/24/kettle-beam-update-0-5-0/>
   for more details.



What we’re working on...

------------------------------

Apache Beam 2.11.0 release (by: many contributors)


Hive Metastore Table provider for SQL (by: Anton Kedin)

   -

   Support for plugging table providers through Beam SQL API to allow
   obtaining table schemas from external sources.
   -

   See the PR <https://github.com/apache/beam/pull/7746> for more details.


User Defined Coders for the Beam Go SDK (by: Robert Burke)

   -

   Working on expanding the variety of user defined types that can be a
   member of a PCollection in the Go SDK.
   -

   See BEAM-3306 <https://issues.apache.org/jira/browse/BEAM-3306> for more
   details.


Python 3 (by: Ahmet Altay, Robert Bradshaw, Charles Chen, Mark Liu, Robbe
Sneyders, Juta Staes, Valentyn Tymofieiev)

   -

   Beam 2.11.0 is the first release offering partial Python 3 support.
   -

   Many thanks to all contributors who helped to reach this milestone.
   -

   IO availablility on Python 3 is currently limited and only Python 3.5
   version has been tested extensively.
   -

   Stay tuned on BEAM-1251 for more details.


Notebooks for quickstarts and custom I/O (by: David Cavazos)

   -

   Adding IPython notebooks and snippets
   -

   See [BEAM-6557] <https://github.com/apache/beam/pull/7679> for more
   details.




     New members
------------------------------

New PMC member!

   -

   Etienne Chauchot, Nantes, France


New Committers!

   -

   Gleb Kanterov, Stockholm, Sweden
   -

   Michael Luckey


New Contributors!

   -

   Kyle Weaver, San Francisco, CA
   -

      Would like to help begin implementing portability support for the
      Spark runner
      -

   Tanay Tummapalli, Delhi, India
   -

      Would like to contribute to Open Source this summer as part of Google
      Summer of Code
      -

   Brian Hulette, Seattle, WA
   -

      Contributing to Beam Portability
      -

   Michał Walenia, Warsaw, Poland
   -

      Working on integration and load testing
      -

   Daniel Chen, San Francisco, CA
   -

      Working on Beam Samza runner



     Talks & meetups
------------------------------


Plugin Machine Intelligence and Apache Beam with Pentaho - Feb 7 @ London

   -

   Watch the How to Run Kettle on Apache Beam video here
   
<https://skillsmatter.com/skillscasts/13405-how-to-run-kettle-on-apache-beam#video>.

   -

   See event details here
   <https://www.meetup.com/Pentaho-London-User-Group/events/256773962/>..


Beam @Lyft / Streaming, TensorFlow and use-cases - Feb 7 @ San Francisco, CA

   -

   Organized by Thomas Weise and Austin Bennet, with speakers Tyler Akidau,
   Robert Crowe, Thomas Weise and Amar Pai
   -

   See event details here
   <https://www.meetup.com/San-Francisco-Apache-Beam/events/257482350/> and
   the slides for these presentation: Overview of Apache Beam and
   TensorFlow Transform (TFX) with Apache Beam
   <http://s.apache.org/beam-intro-feb-2019>, Python Streaming Pipelines
   with Beam on Flink
<http://go.lyft.com/python-flink-beam-meetup-2019>, Dynamic
   pricing of Lyft rides using streaming
   
<https://www.slideshare.net/AmarPai2/dynamic-pricing-of-lyft-rides-using-streaming>

.

Flink meetup - Feb 21@ Seattle, WA

   -

   Speakers from Alibaba, Google, and Uber gave talks about Apache Flink
   with Hive, Tensorflow, Beam, and AthenaX.
   -

   See event details here
   <https://www.meetup.com/seattle-flink/events/258723322/> and
   presentations here <https://www.slideshare.net/BowenLi9/presentations>.


Beam Summit Europe 2019 - June 19-20 @ Berlin

   -

   Beam Summit Europe 2019 will take place in Berlin on June 19-20.
   -

   Speaker CfP and other details to follow soon!
   -

   Twitter announcement!
   <https://twitter.com/matthiasbaetens/status/1098854758893273088>



     Resources
------------------------------

Apache Jira Beginner’s Guide (by:  Daniel Oliveira)

   -

   A guide
   
<https://cwiki.apache.org/confluence/display/BEAM/Beam+Jira+Beginner%27s+Guide>
   to introduce Beam contributors to the basics of using the Apache Jira for
   Beam development. Feedback welcomed!


An approach to community building from Apache Beam (by: Kenn Knowles)

   -

   The Apache Software Foundation has published committer guidelines to
   help Beam's community building work.
   -

   See the post <https://blogs.apache.org/comdev/date/20190222> on the ASF
   blog.


Exploring Beam SQL on Google Cloud Platform (by: Graham Polley)

   -

   “In this article, I’ll dive into this new feature of Beam, and see how
   it works by using a pipeline to read a data file from GCS, transform it,
   and then perform a basic calculation on the values contained in the file”.
   -

   See article
   
<https://medium.com/weareservian/exploring-beam-sql-on-google-cloud-platform-b6c77f9b4af4>
   and full source code
   
<https://github.com/polleyg/gcp-batch-ingestion-bigquery/blob/beam_sql/src/main/java/org/polleyg/BeamSQLMagic.java>
   .


*Until Next Time!*
-- 
Rose Thị Nguyễn

Reply via email to