HonahX commented on issue #8248:
URL: https://github.com/apache/iceberg/issues/8248#issuecomment-1670899322
More context:
if we applied a row filter on UUID col,
```python
unpartitioned_uuid =
catalog.load_table("default.test_uuid_and_fixed_unpartitioned")
arrow_table_eq = unpartitioned_uuid.scan(row_filter="uuid_col ==
'102cb62f-e6f8-4eb0-9973-d9b012ff0967'").to_arrow()
```
it fails with
```
pyarrow.lib.ArrowInvalid: Could not convert
UUID('102cb62f-e6f8-4eb0-9973-d9b012ff0967') with type UUID: did not recognize
Python value type when inferring an Arrow data type
```
raised here:
https://github.com/apache/iceberg/blob/f5f543a54ff7460648bb864f4f06a29eb28938b9/python/pyiceberg/io/pyarrow.py#L451-L454
https://github.com/apache/iceberg/blob/f5f543a54ff7460648bb864f4f06a29eb28938b9/python/pyiceberg/io/pyarrow.py#L480-L481
because UUID is a `fixed_binary[16]` in pyarrow but the UUID literal stores
UUID in its value
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]