liujinhui1994 commented on pull request #2438:
URL: https://github.com/apache/hudi/pull/2438#issuecomment-1073504190


   The purpose of introducing timestamps: Mainly when users want to consume 
from a certain location, deltastreamer can only specify checkpoint sites in the 
past. For example, kafka may have 50+ partitions, and users need to manually 
configure the checkpoint string. Introducing this simplifies this operation
   
   Regarding your example: I think you are right and agree with your idea. 
Partition 2 should not be populated with this value.
   At that time, the main consideration of this PR was to solve the problem of 
complex user configuration. It can simplify consumption data as much as 
possible. This example of partition 2 makes sense for some businesses. Maybe 
your current scenario may be a bit contradictory, and I feel like we can 
improve it and make it better


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to