liming30 opened a new issue, #846:
URL: https://github.com/apache/incubator-paimon/issues/846

   ### Search before asking
   
   - [X] I searched in the 
[issues](https://github.com/apache/incubator-paimon/issues) and found nothing 
similar.
   
   
   ### Paimon version
   
   0.4-SNAPSHOT
   
   ### Compute Engine
   
   Flink-1.16
   
   ### Minimal reproduce step
   
   1. Set `scan.mode` to `from-snapshot` or `from-snapshot-full`, and set 
`scan.snapshot-id` correctly.
   2. Set `log.system` to `kafka`
   3. Start the job in streaming mode.
   
   ### What doesn't meet your expectations?
   
   The job only reads the latest data from the `logSourceProvider`, and the 
historical snapshot's data is lost. 
   **related 
code**:https://github.com/apache/incubator-paimon/blob/master/paimon-flink/paimon-flink-common/src/main/java/org/apache/paimon/flink/source/FlinkSourceBuilder.java#L148
   
   I think the current `LogSourceProvider` is not compatible with stream 
reading from the specified snapshot, we should do a pre-check and throw an 
exception.
   
   ### Anything else?
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [X] I'm willing to submit a PR!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to