Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-31 Thread via GitHub


nicholasxu commented on issue #10539:
URL: https://github.com/apache/hudi/issues/10539#issuecomment-1920406638

   > @nicholasxu Closing out this issue. Please reopen or create a new one in 
case of any further queries/issues. Thanks.
   
   ok!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-31 Thread via GitHub


codope closed issue #10539: [SUPPORT] Flink streaming read MOR table, thrown 
Unexpected cdc file split infer case: LOG_FILE Exception 
URL: https://github.com/apache/hudi/issues/10539


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-31 Thread via GitHub


ad1happy2go commented on issue #10539:
URL: https://github.com/apache/hudi/issues/10539#issuecomment-1918968410

   @nicholasxu Closing out this issue. Please reopen or create a new one in 
case of any further queries/issues. Thanks.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-23 Thread via GitHub


nicholasxu commented on issue #10539:
URL: https://github.com/apache/hudi/issues/10539#issuecomment-1907345218

   > The changelog mode is used for upstream data source that is already a 
changelog stream(has operation insert or delete per-row), so that Hudi can keep 
these operations as it is and keep good write throughput, the cdc mode is more 
general for all data source, Hudi itself maintains the changes so it is more 
costly and only COW table is supported for Flink now.
   
   Thx,danny


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-20 Thread via GitHub


danny0405 commented on issue #10539:
URL: https://github.com/apache/hudi/issues/10539#issuecomment-1902478208

   The changelog mode is used for upstream data source that is already a 
changelog stream(has operation insert or delete per-row), so that Hudi can keep 
these operations as it is and keep good write throughput, the cdc mode is more 
general for all data source, Hudi itself maintains the changes so it is more 
costly and only COW table is supported for Flink now.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-19 Thread via GitHub


nicholasxu commented on issue #10539:
URL: https://github.com/apache/hudi/issues/10539#issuecomment-1901814817

   > either `changelog.enabled` or `cdc.enabled`, only one option is support at 
a time.
   
   I tried, only `changelog.enabled` is ok,  but only `cdc.enabled` is not ok.  
   
   The latter Includes 2 scenes:
   1. when no baseFile is generated,  throws  
java.lang.IllegalArgumentException: Can not create a Path from an empty string 
(https://github.com/apache/hudi/issues/10458)
   2. when baseFiles has already generated, still throws AssertionError: 
Unexpected cdc file split infer case: LOG_FILE


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-19 Thread via GitHub


nicholasxu commented on issue #10539:
URL: https://github.com/apache/hudi/issues/10539#issuecomment-1901778415

   > cdc.enabled
   @danny0405 Thx!, so what's the difference between them?
   I just follow official example:
   
![image](https://github.com/apache/hudi/assets/12593964/893b7ea0-f44a-4f34-b4e8-c5653176e98f)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception [hudi]

2024-01-19 Thread via GitHub


danny0405 commented on issue #10539:
URL: https://github.com/apache/hudi/issues/10539#issuecomment-1901628057

   either `changelog.enabled` or `cdc.enabled`, only one option is support at a 
time.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org