Re: [PR] [core] Support convert TopN to limit for primary-key table in deletion-vector mode [paimon]

via GitHub Mon, 08 Sep 2025 21:07:10 -0700


JingsongLi commented on PR #6193:
URL: https://github.com/apache/paimon/pull/6193#issuecomment-3268793819


   > > I remember that when DV is turned on, the returns are out of order, not 
sorted by primary key.
   > 
   > Why is it out of order? IMO, it will sort by the primary keys in a single 
DataFile, and using the deletion-vector to mark its deleted row id.
   > 
   > The mainly idea on this PR is to convert the TopN primary keys predicate 
into limit predicate when reading a single DataFile, then the compute engine 
(e.g. Apache Spark) will do the Global TopN.
   
   I see, for single data file, it is true.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] [core] Support convert TopN to limit for primary-key table in deletion-vector mode [paimon]

Reply via email to