[GitHub] [beam] pabloem commented on a diff in pull request #17159: [WIP][BEAM-11587] Convert to BeamSchema type from ReadfromBQ

GitBox Wed, 27 Jul 2022 01:05:52 -0700


pabloem commented on code in PR #17159:
URL: https://github.com/apache/beam/pull/17159#discussion_r930748919



##########
sdks/python/apache_beam/io/gcp/bigquery.py:
##########
@@ -2422,6 +2422,9 @@ class ReadFromBigQuery(PTransform):
       to run queries with INTERACTIVE priority. This option is ignored when
       reading from a table rather than a query. To learn more about query
       priority, see: https://cloud.google.com/bigquery/docs/running-queries
+    output_type (str): By default, the schema returned from this transform
+      would be of type PYTHON_DICT. Other schema types can be specified
+      ("BEAM_ROW").

Review Comment:
   we don't need to have schema inference for this, though, right? We just need 
to ensure the output values look like dicts to consumer transforms, look like 
namedtuples to the pcollection coder, and the schema coder should be set for 
the pcollection - is that it? may not need to wait for #22086?
   
   However, like I said, I don't really know all of the implications of 
actually implementing those things - so if it's too much trouble, I'm okay with 
moving forward as-is. Thanks!



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [beam] pabloem commented on a diff in pull request #17159: [WIP][BEAM-11587] Convert to BeamSchema type from ReadfromBQ

Reply via email to