Hello,

I'm trying to understand the purpose of the rowid column inside ORC delta
file
{"transactionid":11359,"bucketid":5,"*rowid*":0}
Orc view: {"operation":0,"originalTransaction":11359,"bucket":5,"*rowId*
":0,"currentTransaction":11359,"row":...}
I use HDP 2.6 => Hive 2

If I want to be idempotent with INSERT / DELETE / INSERT.
Do we have to keep the same rowid ?
It seems that when the rowid is changed during the second INSERT I have a
duplicate row.
For me, I can create a new rowid for the new transaction during the second
INSERT but that seems to generate duplicate records.

Regards,
David

Reply via email to