Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16851 )

Change subject: IMPALA-10358: Correct Iceberg type mappings
......................................................................


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16851/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/16851/1//COMMIT_MSG@13
PS1, Line 13:  * annotates strings with UTF8 in Parquet metadata
> These changes make sense to me. I did add a TYPE_FIXED_UDA_INTERMEDIATE typ
Thanks for the review.

Seems like in SQL the BINARY type is used for fixed-length binary data, and 
VARBINARY for variable-length binary data.

However, Hive added the BINARY type for variable-length binary data, but at 
least it's consistent with Iceberg which also has a BINARY type with the same 
semantics. Maybe FIXED_BINARY would be a good name in HiveQL for fixed-length 
data.

Thanks for bringing up TIMESTAMPTZ. Yeah, on the read-side we use Parquet 
metadata to decide if we need to convert the data to the local timezone or just 
read it as it is. But once we've read the values as TimestampValue objects in 
memory, we loose all the timezone-related information, and at the end we just 
write/output them as they are.

Until now we could write Iceberg tables with TIMESTAMPTZ columns which was 
clearly wrong, because in the Parquet files Impala didn't convert the 
timestamps to UTC. In the new patch set I raise an error when Impala tries to 
write TIMESTAMPTZ.



--
To view, visit http://gerrit.cloudera.org:8080/16851
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I652565f82708824f5cf7497139153b06f116ccd3
Gerrit-Change-Number: 16851
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Reviewer: Gabor Kaszab <gaborkas...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Tim Armstrong <tarmstr...@cloudera.com>
Gerrit-Reviewer: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Reviewer: wangsheng <sky...@163.com>
Gerrit-Comment-Date: Fri, 11 Dec 2020 14:21:59 +0000
Gerrit-HasComments: Yes

Reply via email to