[GitHub] [hudi] ad1happy2go commented on issue #9471: [SUPPORT] When using Deltasteamer JdbcSource to extract data, there are issues with data loss and slow query of source side data

2023-08-29 Thread via GitHub


ad1happy2go commented on issue #9471:
URL: https://github.com/apache/hudi/issues/9471#issuecomment-1697755515

   @yihao-tcf Thanks. Feel free to contribute in case you are interested. Feel 
free to reach me out on slack. You can find me on hudi community slack (Aditya 
Goenka). 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [hudi] ad1happy2go commented on issue #9471: [SUPPORT] When using Deltasteamer JdbcSource to extract data, there are issues with data loss and slow query of source side data

2023-08-22 Thread via GitHub


ad1happy2go commented on issue #9471:
URL: https://github.com/apache/hudi/issues/9471#issuecomment-1687945459

   @yihao-tcf Thanks for raising this. I do understand your concern. Ideally 
run-once(without --continous) also should take care of incremental fetch 
according to source limit. We need a fix here.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [hudi] ad1happy2go commented on issue #9471: [SUPPORT] When using Deltasteamer JdbcSource to extract data, there are issues with data loss and slow query of source side data

2023-08-18 Thread via GitHub


ad1happy2go commented on issue #9471:
URL: https://github.com/apache/hudi/issues/9471#issuecomment-1683999061

   @yyh2954360585 JDBC is slow and put lot of load on source system. So full 
query  a full query on large table can cause high load or even downtime to the 
database server. You can set the value of source-limit according to your 
dataset and requirement. You can even set it to a very high value.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org