Re: [DISCUSS] Spark 3.3 support?

2024-11-18 Thread Yufei Gu
+1 to deprecate it and remove it. Yufei On Wed, Nov 13, 2024 at 9:17 AM Fokko Driesprong wrote: > +1 to deprecating and removing it > > Kind regards, > Fokko > > Op wo 13 nov 2024 om 18:02 schreef Jean-Baptiste Onofré : > >> +1 to deprecating and removing. >> >> Users can still use previous Ic

Re: [ANNOUNCE] Apache Iceberg Go release v0.1.0

2024-11-18 Thread Yufei Gu
Congrats! Thanks Matt for driving it. Thanks everyone for the contribution! Yufei On Mon, Nov 18, 2024 at 11:27 AM Kevin Liu wrote: > Excited to see the first official release of the Apache Iceberg Go > library! > Thanks everyone for contributing! And thanks Matt & Fokko for working on > the

Re: [DISCUSS] - Deprecate Equality Deletes

2024-11-18 Thread Ajantha Bhat
Hi, What's the conclusion on this thread? Users are looking for Upsert (CDC) support for OSS Iceberg kafka connect sink. We only support appends at the moment. Can we go ahead and implement the upserts using equality deletes? - Ajantha On Sun, Nov 10, 2024 at 11:56 AM Vignesh wrote: > Hi, > I

Re: [DISCUSS] Removal of last-column-id of public API

2024-11-18 Thread Jean-Baptiste Onofré
Hi Fokko I think it makes sense to deprecate and remote the field. +1 Regards JB On Thu, Nov 14, 2024 at 10:01 AM Fokko Driesprong wrote: > > Hi everyone, > > While reviewing the TableMetadataBuilder PR on Iceberg-Rust the other day, I > noticed that it exposes the last-column-id to the publi

Re: [DISCUSS] REST: Way to query if metadata pointer is the latest

2024-11-18 Thread Yufei Gu
Hi Taeyun, Thank you for the clear explanation. I agree that the ETag solution is more suitable. If we were going that way, I'd propose a customized version number as an ETag—for instance, leveraging the metadata.json file name as the identifier. To summarize, HTTP caching relies on headers (e.g

Re: [DISCUSS] REST: Way to query if metadata pointer is the latest

2024-11-18 Thread Taeyun Kim
Hi, Here are my thoughts: - HTTP Layer: To my knowledge, there isn’t a separate "HTTP layer" in this context, so concerns about control over caching shouldn’t be an issue. The header approach I mentioned simply involves handling additional headers when using HTTP client libraries to interact w

Re: [DISCUSS] Removal of last-column-id of public API

2024-11-18 Thread Renjie Liu
+1 for deprecating and removing this. I don't think it makes sense to ask users to pass this while we can infer it from existing arguments. Also I see java code

Re: [ANNOUNCE] Apache Iceberg Go release v0.1.0

2024-11-18 Thread Kevin Liu
Excited to see the first official release of the Apache Iceberg Go library! Thanks everyone for contributing! And thanks Matt & Fokko for working on the release. Cheers, Kevin Liu On Mon, Nov 18, 2024 at 11:10 AM Matt Topol wrote: > Hi everyone, > > I'm pleased to announce the release of Apach

[ANNOUNCE] Apache Iceberg Go release v0.1.0

2024-11-18 Thread Matt Topol
Hi everyone, I'm pleased to announce the release of Apache Iceberg Go v0.1.0! Apache Iceberg is an open table format for huge analytic datasets, Iceberg delivers high query performance for tables with tens of petabytes of data, along with atomic commits, concurrent writes, and SQL-compatible tabl

Re: [VOTE] Release Apache PyIceberg 0.8.0rc2

2024-11-18 Thread Kevin Liu
Thanks everyone for voting! The 72 hours have passed, and a minimum of 3 binding votes have been cast: The vote passes with 3 non-binding +1 votes and 3 binding +1 votes and no -1 votes: non-binding: Kevin, Sung, Andre binding: Fokko, Honah, Daniel The release candidate has been accepted as PyIce

[ANNOUNCE] Apache PyIceberg release 0.8.0

2024-11-18 Thread Kevin Liu
Hi everyone, I'm pleased to announce the release of Apache PyIceberg 0.8.0! Apache Iceberg is an open table format for huge analytic datasets. Iceberg delivers high query performance for tables with tens of petabytes of data, along with atomic commits, concurrent writes, and SQL-compatible table

Re: [DISCUSS] REST: Way to query if metadata pointer is the latest

2024-11-18 Thread Zoltán Borók-Nagy
Hey Everyone, Thanks Gábor, I think the proposed interface would be very useful to any engine that employs caching, e.g. Impala. And it is pretty neat that it is catalog-agnostic, i.e. we just give all the information we have about the table and let the catalog implementation efficiently reload it

Re: [VOTE] Release Apache PyIceberg 0.8.0rc2

2024-11-18 Thread Daniel Weeks
+1 (binding) Verified sigs/sums/license/tests+s3 (Python 3.11.9) -Dan On Sat, Nov 16, 2024 at 4:03 PM André Luis Anastácio wrote: > +1 (non-binding) > > - verified signature and checksum > - verified license check > - ran install and some manual tests in python 3.11 > > André Anastácio > > On

Re: [VOTE][Go] Release Apache Iceberg Go v0.1.0 RC2

2024-11-18 Thread Matt Topol
Thanks everyone! The vote passes with 4 non-binding +1 votes and 3 binding +1 votes and no -1 votes: non-binding: Me, JB, Kevin Liu, Alex Dutra binding: Fokko, Eduard, Amogh We'll do the release shortly, thanks! --Matt On Sun, Nov 17, 2024 at 10:25 PM Amogh Jahagirdar <2am...@gmail.com> wrote:

Re: [DISCUSS] REST: Way to query if metadata pointer is the latest

2024-11-18 Thread Gabor Kaszab
Hi Iceberg Community, This is a great conversation so far, and thanks everyone for the valuable inputs! I'd like to articulate 2 things that we have to keep in mind with the design: *1: There are 2 interfaces here that we should consider:* What I mean by this is that so far we have been talking a