[GitHub] [hudi] kasured commented on issue #5298: [SUPPORT] File is deleted during inline compaction on MOR table causing subsequent FileNotFoundException on a reader

GitBox Sat, 16 Apr 2022 10:41:48 -0700


kasured commented on issue #5298:
URL: https://github.com/apache/hudi/issues/5298#issuecomment-1100717281


   @nsivabalan Sure, let me provide more details. There is a StreamingQuery 
entity which s started by Spark to consume the stream. This is basically what 
we use and described here 
https://hudi.apache.org/docs/compaction#spark-structured-streaming
   
   So what we do is we create multiple StreamingQuery streams and start them. 
Each of them though consumes from single kafka topic and writes to single Hudi 
table. So it is `3 different streaming pipeline writing to 3 diff hudi table 
but using same spark session` with the only exception that we use 3 different 
SparkSession objects. Each of them are reusing single sparkContext which is 
okay as there should be only one spark context per jvm.
   
   As to 4753 I have already specified it in the section **Possibly Related 
Issues** HUDI-3370. However, from what I checked it is related to metadata 
service which we do not use "hoodie.metadata.enable" = "false". May it also be 
relevant even if we do not use metadata table? I am asking cause we are using 
0.9.0 from Amazon and I will need to replace it with the one with patch  


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [hudi] kasured commented on issue #5298: [SUPPORT] File is deleted during inline compaction on MOR table causing subsequent FileNotFoundException on a reader

Reply via email to