nsivabalan commented on issue #5492:
URL: https://github.com/apache/hudi/issues/5492#issuecomment-1302887335
@ashah-lightbox : gentle ping. any updates please. if you got the issue
resolved, can we close it out.
--
This is an automated message from the Apache Git Service.
To respond
nsivabalan commented on issue #5492:
URL: https://github.com/apache/hudi/issues/5492#issuecomment-1244809050
also, I see you have used 2 diff scripts in both.
Can you try your EMR script
https://gist.github.com/ashays83/6beaf642bd55b4c46292b8f382d0088b also in
docker and share what you
nsivabalan commented on issue #5492:
URL: https://github.com/apache/hudi/issues/5492#issuecomment-1244808566
@ashah-lightbox : sorry to have dropped the ball on this. I can help me
understand what you mean by docker(case 2)? is it hdfs that you are using
within docker?
when you say EMR(
nsivabalan commented on issue #5492:
URL: https://github.com/apache/hudi/issues/5492#issuecomment-1125508485
my hunch is that setting nullable did not work as expected. Can you do
df1.printSchema before ingesting to hudi and confirm that nullable for
hoodie_is_deleted is set to true in both
nsivabalan commented on issue #5492:
URL: https://github.com/apache/hudi/issues/5492#issuecomment-1124401057
w/ docker, can you ensure the operation is "upsert" and save mode is
"Append".
--
This is an automated message from the Apache Git Service.
To respond to the message, please l
nsivabalan commented on issue #5492:
URL: https://github.com/apache/hudi/issues/5492#issuecomment-1124400786
hmmm, interesting. this is the first time I am hearing someone saying that
they are seeing diff behavior in emr and docker using same script in
spark-datasource.
Can you paste th