Kuinox commented on code in PR #48255:
URL: https://github.com/apache/arrow/pull/48255#discussion_r3209291405
##########
python/pyarrow/tests/parquet/test_metadata.py:
##########
@@ -814,3 +814,22 @@ def msg(c):
with pytest.raises(TypeError, match=msg("FileMetaData")):
pq.FileMetaData()
+
+
+def test_read_schema_uuid_extension_type(tmp_path):
+ data = [
+ b'\xe4`\xf9p\x83QGN\xac\x7f\xa4g>K\xa8\xcb',
+ b'\x1et\x14\x95\xee\xd5C\xea\x9b\xd7s\xdc\x91BK\xaf',
+ None,
+ ]
+ table = pa.table([pa.array(data, type=pa.uuid())], names=["ext"])
+
+ file_path = tmp_path / "uuid.parquet"
+ file_path_str = str(file_path)
+ pq.write_table(table, file_path_str, store_schema=False)
Review Comment:
it was 6 months ago so I'm only guessing now:
I remember that there was differents behavior depending if arrow loaded it's
stored schema or not.
I don't remember if it was needed here, but store_schema=False would allow
to be sure that an uuid logical type is detected as is without arrow getting
the information from it's own schema.
I can confirm it if you want
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]