Valentyn Tymofieiev created BEAM-7326:
-----------------------------------------

             Summary: Document that Beam BigQuery IO expects users to pass 
base64-encoded bytes, and BQ IO serves base64-encoded bytes to the user.
                 Key: BEAM-7326
                 URL: https://issues.apache.org/jira/browse/BEAM-7326
             Project: Beam
          Issue Type: Bug
          Components: io-java-gcp, io-python-gcp
            Reporter: Valentyn Tymofieiev


BYTES is one of the Datatypes supported by Google Cloud BigQuery, and Apache 
Beam BigQuery IO connector.

Current implementation of BigQuery connector in Java and Python SDKs expects 
that users base64-encode bytes before passing them to BigQuery IO, see 
discussion on dev: [1] 

This needs to be reflected in public documentation, see [2-4]

cc: [~juta] [~chamikara] [~pabloem] 

cc: [~rebo] [~kedin] FYI and to advise whether similar action needs to be done 
for Go SDK and/or Beam SQL.

[1] 
https://lists.apache.org/thread.html/f35c836887014e059527ed1a806e730321e2f9726164a3030575f455@%3Cdev.beam.apache.org%3E
[2] https://beam.apache.org/documentation/io/built-in/google-bigquery/
[3] 
https://beam.apache.org/releases/pydoc/2.12.0/apache_beam.io.gcp.bigquery.html
[4] 
https://beam.apache.org/releases/javadoc/2.12.0/org/apache/beam/sdk/io/gcp/bigquery/BigQueryIO.html



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to