mattmartin14 commented on PR #1534:
URL: https://github.com/apache/iceberg-python/pull/1534#issuecomment-2634290851
@Fokko - as an FYI, i do have working code to just use pyarrow to identify
rows to insert; for rows to update, its a little more tricky; would you like me
to replace the existing data fusion function on the rows_to_insert part with my
pyarrow code? FYI, this is the code I could implement:
```python
def get_rows_not_in_target(source_table: pa.Table, target_table: pa.Table,
join_cols: list) -> pa.Table:
return source_table.join(target_table, keys=join_cols, join_type='left
anti')
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]