KevinyhZou commented on code in PR #12327:
URL: https://github.com/apache/gluten/pull/12327#discussion_r3456972203


##########
gluten-flink/runtime/src/main/java/org/apache/gluten/table/runtime/operators/GlutenOneInputOperator.java:
##########
@@ -238,16 +260,31 @@ public void processWatermark2(Watermark mark) throws 
Exception {
 
   @Override
   public void close() throws Exception {
-    if (task != null) {
-      task.close();
-    }
-    if (inputQueue != null) {
-      inputQueue.noMoreInput();
-      inputQueue.close();
-    }
-    if (sessionResource != null) {
-      sessionResource.close();
-    }
+    GlutenCloseables.runWithCleanup(
+        () -> {

Review Comment:
   `GlutenCloseables`.runWithCleanUp seems to flush the data in memory to 
filesystem, but these data's offset may not submit to MQ, therefore, if the job 
failover or restart, the data would be duplicated



##########
gluten-flink/ut/src/test/resources/nexmark/q10.sql:
##########
@@ -9,7 +9,7 @@ CREATE TABLE nexmark_q10 (
 ) PARTITIONED BY (dt, hm) WITH (
   'connector' = 'filesystem',
   'path' = 'file:///tmp/data/output/bid/',
-  'format' = 'csv',
+  'format' = 'orc',

Review Comment:
   may be we can add another sql `q10_orc` to test this? or change format value 
in `NexmarkTest` to test this?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to