Hi devs, AFAIK, hudi has been designed to have primary keys in the hudi's key. However it is possible to also choose a non unique field. I have listed several trouble with such design:
Non unique key yield to : - cannot delete / update a unique record - cannot apply primary key for new sql tables feature Is there other downsides to choose a non unique key you have in mind ? In my case, having user_id as a hudi key will help to apply deletion on the user level in any user table. The table are insert only, so the drawbacks listed above do not really apply. In case of error in the tables I have several options: - rollback to a previous commit - read partition/filter overwrite partition Thanks
