[GitHub] [hudi] tandonraghav commented on issue #3078: [SUPPORT] combineAndGetUpdateValue is not getting called when Schema evolution happens

2021-06-29 Thread GitBox


tandonraghav commented on issue #3078:
URL: https://github.com/apache/hudi/issues/3078#issuecomment-870831328


   @fanaticjo ok, but you should implement `preCombine` also, as the logic 
should be uniform, might be that is the reason?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] tandonraghav commented on issue #3078: [SUPPORT] combineAndGetUpdateValue is not getting called when Schema evolution happens

2021-06-29 Thread GitBox


tandonraghav commented on issue #3078:
URL: https://github.com/apache/hudi/issues/3078#issuecomment-870826823


   @fanaticjo As per the above comments, you should also implement `preCombine` 
method.
   And also do u faced the same problem while Schema Evolution? or something 
else?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] tandonraghav commented on issue #3078: [SUPPORT] combineAndGetUpdateValue is not getting called when Schema evolution happens

2021-06-27 Thread GitBox


tandonraghav commented on issue #3078:
URL: https://github.com/apache/hudi/issues/3078#issuecomment-869211898


   @guanziyue Well, in my testing combinAndGetUpdateValue in not getting called 
at all. I am not worried about how compaction is happening for delta records.
   Do you have a reference implementation?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] tandonraghav commented on issue #3078: [SUPPORT] combineAndGetUpdateValue is not getting called when Schema evolution happens

2021-06-22 Thread GitBox


tandonraghav commented on issue #3078:
URL: https://github.com/apache/hudi/issues/3078#issuecomment-866269573


   @n3nash I am trying to implement exactly 
`https://github.com/apache/hudi/pull/2106/files` , in this also 
`PartialAvroPayload` class only implements `combineAndGetUpdateValue` .
   Also, i do not have `Schema` in preCombine method. 
   Also, i am saying if the record preexists in (.parquet) and if i add the 
field in incoming records (log file .avro files) then on compaction 
`combineAndGetUpdateValue` is not at all getting called but this doesnt happen 
if the schema is unchanged.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] tandonraghav commented on issue #3078: [SUPPORT] combineAndGetUpdateValue is not getting called when Schema evolution happens

2021-06-20 Thread GitBox


tandonraghav commented on issue #3078:
URL: https://github.com/apache/hudi/issues/3078#issuecomment-864611953


   @n3nash I am not able to resolve this. `preCombine` can be called even if 
there are no duplicates in the incoming records?
   Can you please explain with a sample ?
   
   I fail to understand how will i combine the existing record if it doesn't 
call the method `combineAndGetUpdateValue` as nowhere else i have ref to 
`IndexedRecord currentValue`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] tandonraghav commented on issue #3078: [SUPPORT] combineAndGetUpdateValue is not getting called when Schema evolution happens

2021-06-16 Thread GitBox


tandonraghav commented on issue #3078:
URL: https://github.com/apache/hudi/issues/3078#issuecomment-862168089


   @n3nash I thought `preCombine` is only used to dedup incoming records.
   Then why `OverwriteNonDefaultsWithLatestAvroPayload` is not overriding 
`preCombine`? 
   As well as in this [PR](https://github.com/apache/hudi/pull/3035/files)
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org