[
https://issues.apache.org/jira/browse/BEAM-9180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17548339#comment-17548339
]
Danny McCormick commented on BEAM-9180:
---------------------------------------
This issue has been migrated to https://github.com/apache/beam/issues/19981
> [ZetaSQL] Support 4-byte unicode in literal string unparsing
> ------------------------------------------------------------
>
> Key: BEAM-9180
> URL: https://issues.apache.org/jira/browse/BEAM-9180
> Project: Beam
> Issue Type: Improvement
> Components: dsl-sql-zetasql
> Reporter: Kirill Kozlov
> Priority: P3
> Labels: stale-assigned
>
> When unprasing literal strings we need to escape special symbols (ex: `\n`,
> `\r`, `\u0012`).
> ZetaSQL supports for some 4-byte (or 8 hex digit) unicode via `\Uhhhhhhhh`.
> As of
> [now|[https://github.com/apache/beam/blob/8a35f408f640d04c38ad6e2a497d30410b3bff32/sdks/java/extensions/sql/src/main/java/org/apache/beam/sdk/extensions/sql/meta/provider/bigquery/BeamSqlUnparseContext.java#L59]]
> only 2-byte (or 4 hex digit) unicode is supported by escaping it via `\u`.
>
> More about escape sequences here (need to scroll down a little):
> https://cloud.google.com/bigquery/docs/reference/standard-sql/lexical
--
This message was sent by Atlassian Jira
(v8.20.7#820007)