vinishjail97 opened a new pull request, #10736:
URL: https://github.com/apache/hudi/pull/10736

   ### Change Logs
   
   There were test failures in the original PR and had to be reverted, bringing 
back the change and fixed the tests now. 
   https://github.com/apache/hudi/pull/10687 
   
   Introducing a new class known as SourceProfile which contains details about 
how the next sync round in StreamSync should be consumed. For eg:
   
   `KafkaSourceProfile` contains number of events to consume in this sync round.
   `S3SourceProfile` contains the list of files to consume in this sync round
   `HudiIncrementalSourceProfile` contains the beginInstant and endInstant 
commit times to consume in this sync round.
   In future we can add the method for choosing the writeOperationType and 
indexType as well, for `sourceProfile.getSourceSpecificContext()` will be used 
to consume the data from the source.
   
   ### Impact
   
   No change in public API's, Option has been used to define the new field in 
the constructors and previous constructors are backwards compatible.
   
   ### Risk level (write none, low medium or high below)
   
   Low
   
   ### Documentation Update
   
   None, this is just adding an optional interface that can be used to consume 
and write data in StreamSync utility.
   
   ### Contributor's checklist
   
   - [x] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [x] Change Logs and Impact were stated clearly
   - [x] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to