Hey all,

Recently been working on supporting pagination in the list methods in the
rest catalog in pyiceberg and I think we have formed an opinion about
wanting to do this lazily to respect the case that a user has a trillion
objects in a response, we don't eagerly load a trillion objects into
memory.  This has led to the following PR
<https://github.com/apache/iceberg-python/pull/2172> which introduces a
breaking but ideally welcome change to return iterators instead of lists in
the list methods for the catalog.  I am hoping to get the following:

1. Opinions on whether this is a welcome change
2. Alternative solutions if it is not a welcome change (Fokko has suggested
we could subclass the List object so it is more hidden)
3. Eyes on the PR
4. Other places it might make sense to return iterators instead of list
objects that we can implement and merge now, as to not introduce more
breaking changes in the future

Jayce

Reply via email to