[GitHub] [hudi] nsivabalan commented on issue #5492: _hoodie_is_delete works differently on hudi spark datasource on docker compare to hudi on emr.

2022-11-03 Thread GitBox
nsivabalan commented on issue #5492: URL: https://github.com/apache/hudi/issues/5492#issuecomment-1302887335 @ashah-lightbox : gentle ping. any updates please. if you got the issue resolved, can we close it out. -- This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] nsivabalan commented on issue #5492: _hoodie_is_delete works differently on hudi spark datasource on docker compare to hudi on emr.

2022-09-12 Thread GitBox
nsivabalan commented on issue #5492: URL: https://github.com/apache/hudi/issues/5492#issuecomment-1244809050 also, I see you have used 2 diff scripts in both. Can you try your EMR script https://gist.github.com/ashays83/6beaf642bd55b4c46292b8f382d0088b also in docker and share what you

[GitHub] [hudi] nsivabalan commented on issue #5492: _hoodie_is_delete works differently on hudi spark datasource on docker compare to hudi on emr.

2022-09-12 Thread GitBox
nsivabalan commented on issue #5492: URL: https://github.com/apache/hudi/issues/5492#issuecomment-1244808566 @ashah-lightbox : sorry to have dropped the ball on this. I can help me understand what you mean by docker(case 2)? is it hdfs that you are using within docker? when you say EMR(

[GitHub] [hudi] nsivabalan commented on issue #5492: _hoodie_is_delete works differently on hudi spark datasource on docker compare to hudi on emr.

2022-05-12 Thread GitBox
nsivabalan commented on issue #5492: URL: https://github.com/apache/hudi/issues/5492#issuecomment-1125508485 my hunch is that setting nullable did not work as expected. Can you do df1.printSchema before ingesting to hudi and confirm that nullable for hoodie_is_deleted is set to true in both

[GitHub] [hudi] nsivabalan commented on issue #5492: _hoodie_is_delete works differently on hudi spark datasource on docker compare to hudi on emr.

2022-05-11 Thread GitBox
nsivabalan commented on issue #5492: URL: https://github.com/apache/hudi/issues/5492#issuecomment-1124401057 w/ docker, can you ensure the operation is "upsert" and save mode is "Append". -- This is an automated message from the Apache Git Service. To respond to the message, please l

[GitHub] [hudi] nsivabalan commented on issue #5492: _hoodie_is_delete works differently on hudi spark datasource on docker compare to hudi on emr.

2022-05-11 Thread GitBox
nsivabalan commented on issue #5492: URL: https://github.com/apache/hudi/issues/5492#issuecomment-1124400786 hmmm, interesting. this is the first time I am hearing someone saying that they are seeing diff behavior in emr and docker using same script in spark-datasource. Can you paste th