Re: [Question] Change default file encoding in Dataflow runners

2023-06-15 Thread Ramana Venkata
Hi Bruno, I have added a log statement in a DoFn. logger.info(System.getProperty('file.encoding')) and that showed ANSI as the file encoding. There isn't anything in our code that sets ANSI file encoding. I will check with Google Support. On Fri, Jun 16, 2023 at 7:27 AM Bruno Volpato via user

Re: [Question] Change default file encoding in Dataflow runners

2023-06-15 Thread Bruno Volpato via user
Hi Ramana, Curious where you got ANSI_X3.4-1968 from -- I don't think there's any trace of this encoding anywhere in Dataflow Workers (as far as I am aware and looked around). The default encoding for JVM is UTF-8, and Dataflow doesn't appear to set it anywhere. I was able to check using: $

Re: Tour of Beam - an interactive Apache Beam learning guide

2023-06-15 Thread Kerry Donny-Clark via user
Sounds good to me, working on it now. The current content is all notebooks, I will move them around into other learning materials sections and update the names to change "tour of beam" language. Kerry On Thu, Jun 15, 2023 at 10:16 AM XQ Hu wrote: > We already have a Beam Overview there. >

Re: Tour of Beam - an interactive Apache Beam learning guide

2023-06-15 Thread XQ Hu via user
We already have a Beam Overview there. https://beam.apache.org/get-started/tour-of-beam/ contains some good Colab notebooks, which mainly are just for Python. I suggest we link this to https://tour.beam.apache.org/ but move the current content under Python Quickstart. On Thu, Jun 15, 2023 at

Re: Tour of Beam - an interactive Apache Beam learning guide

2023-06-15 Thread Svetak Sundhar via user
+1, I got confused when I first tried to navigate. Given these examples in the former are all going to be in Colab notebooks, I propose we name it something like "Colab Examples". @XQ Hu Svetak Sundhar Data Engineer s vetaksund...@google.com On Thu, Jun 15, 2023 at 10:00 AM Austin

Re: Tour of Beam - an interactive Apache Beam learning guide

2023-06-15 Thread Kerry Donny-Clark via user
Thanks for bringing this up Austin. I noticed this as well when I tried to search Google for "Tour of Beam". I propose we rebrand the get started Tour as "An Intro to Beam", or "An Overview of Beam". I'll put up a draft PR today, and update this thread so folks can comment and review. Kerry On

[Question] Change default file encoding in Dataflow runners

2023-06-15 Thread Ramana Venkata
Hi, I accidentally discovered that the default file encoding in my Dataflow runners is ANSI_X3.4-1968. We expected it to be UTF-8, and as a result, some of our data has been corrupted. I came across this Stack Overflow answer (link: https://stackoverflow.com/a/362006), but to the best of my