Hi All, I'd like to share a design document aiming to be the basis of a new replication methodology in Kudu, that we are working on with Marton Greber. The basic idea is to create a Flink job to handle the scheduling/resource management aspect and initially use Kudu's diffscan to move data between two clusters. Please find the details in the following document: https://docs.google.com/document/d/1oaAn_cOY7aKth0C6MbNXgKU3R-PYols-V4got-_gpDk/edit?usp=sharing
Any and all comments and reviews are welcome! Best regards, Zoltan