Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-29 Thread Fokko Driesprong
Hey everyone, Since #305 has been merged, I think we're good for the release. Thank you Sung for the PR and Honah for the great review! I think it would be nice to get #311 to get people started

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-28 Thread Honah J.
Really excited for the upcoming 0.6.0 release and its new features! Big thanks to everyone for their hard work. I'm looking forward to the community feedback and future enhancements. Best regards, Honah On Fri, Jan 26, 2024 at 1:56 PM Daniel Weeks wrote: > I'm also strongly in favor of getting

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Daniel Weeks
I'm also strongly in favor of getting this release out even with the limitations as it's still a huge step forward and we can build incrementally on the write support. Incredible work everyone, I'm really excited about the progress here. -Dan On Fri, Jan 26, 2024 at 11:16 AM Fokko Driesprong wr

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Fokko Driesprong
Thanks everyone for the responses and great to see everyone is as excited as I am :D I have some good news. The guys from Eventual have been working on integrating PyIceberg into their Daft dataframe

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Ryan Blue
Like the Java implementation, we've been building toward a library that can be used in distributed applications as well as directly on a single node. For example, job planning can produce a set of file scan tasks or a scan can be pushed to duckdb (to_duckdb) or pandas (to_pandas). The write side is

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Steve Zhang
I am really excited to see the both append and overwrite support are finally checked and in favor of seeing it in 0.6.0 release. It’s a big milestone worth celebrating! I am happy to help with partitioned write and sort order. Thanks, Steve Zhang > On Jan 26, 2024, at 5:22 AM, Fokko Driespro

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Pucheng Yang
I have similar questions as Yufei's. My organization has interest in Ray Iceberg integration and during the conversation with the Ray team, we know they would also like the have Iceberg integration as well. I think this is a good opportunity for both projects to collaborate. On Fri, Jan 26, 2024 a

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Sung Yun
It’s so exciting to see the project take another step forward, Fokko!Really great job to everyone involved.Best,SungOn Jan 26, 2024, at 11:48 AM, Ryan Blue wrote:It's great to see all the progress in PyIceberg. Thanks to everyone that's been contributing!I'm all for getting a release out as soon

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Yufei Gu
Thanks for the update on PyIceberg's new features. It's exciting to see the progress! I have a quick question: Currently, as I understand, PyIceberg operates within a single process. Are there any plans to expand its capabilities to support distributed computation, particularly for write operation

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Ryan Blue
It's great to see all the progress in PyIceberg. Thanks to everyone that's been contributing! I'm all for getting a release out as soon as possible and following up with more features in the write path in 0.7.0. On Fri, Jan 26, 2024 at 5:22 AM Fokko Driesprong wrote: > Hey everyone, > > I want

[DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Fokko Driesprong
Hey everyone, I want to discuss the 0.6.0 release that will bring a lot of functionality to the public: - Write support for writing to unpartitioned tables - Includes snapshot generation - Constructing Avro writer trees - Support writing metadata which allows to commit support f