fujianhua168 commented on issue #8754: URL: https://github.com/apache/hudi/issues/8754#issuecomment-1562377431
> Here's a branch in my Trino fork which has MOR Snapshot query support - https://github.com/codope/trino/tree/mor-snapshot-async-split When I performed data validation check on the branch, I found a strange error: after the flink yarn-session process was suddenly interrupted, I executed the sql statement again through the trino branch( but spark3 read this hudi table is normal) , and an error would be generated: Query 20230525_ 063757_ 00006_ bdgnm failed: Index 0 out of bounds for length 0. error sql statement: select count(1) as cnt from hudi_demo.ods_pub_log_qt_collect_mor_d; Here is then error Stack Trace: java.lang.IndexOutOfBoundsException: Index 0 out of bounds for length 0 at java.base/jdk.internal.util.Preconditions.outOfBounds(Preconditions.java:64) at java.base/jdk.internal.util.Preconditions.outOfBoundsCheckIndex(Preconditions.java:70) at java.base/jdk.internal.util.Preconditions.checkIndex(Preconditions.java:266) at java.base/java.util.Objects.checkIndex(Objects.java:361) at java.base/java.util.ArrayList.get(ArrayList.java:427) at io.trino.plugin.hudi.HudiUtil.getHudiBaseFile(HudiUtil.java:176) at io.trino.plugin.hudi.HudiRecordCursor.createRealtimeRecordCursor(HudiRecordCursor.java:71) at io.trino.plugin.hudi.HudiPageSourceProvider.createPageSource(HudiPageSourceProvider.java:193) at io.trino.plugin.base.classloader.ClassLoaderSafeConnectorPageSourceProvider.createPageSource(ClassLoaderSafeConnectorPageSourceProvider.java:49) at io.trino.split.PageSourceManager.createPageSource(PageSourceManager.java:62) at io.trino.operator.TableScanOperator.getOutput(TableScanOperator.java:298) at io.trino.operator.Driver.processInternal(Driver.java:402) at io.trino.operator.Driver.lambda$process$8(Driver.java:305) at io.trino.operator.Driver.tryWithLock(Driver.java:701) at io.trino.operator.Driver.process(Driver.java:297) at io.trino.operator.Driver.processForDuration(Driver.java:268) at io.trino.execution.SqlTaskExecution$DriverSplitRunner.processFor(SqlTaskExecution.java:888) at io.trino.execution.executor.PrioritizedSplitRunner.process(PrioritizedSplitRunner.java:187) at io.trino.execution.executor.TaskExecutor$TaskRunner.run(TaskExecutor.java:561) at io.trino.$gen.Trino_359_8548_g9e61519_dirty____20230525_055552_2.run(Unknown Source) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) at java.base/java.lang.Thread.run(Thread.java:833) ![image](https://github.com/apache/hudi/assets/5379274/97397f2a-96ad-4a23-b7a4-1d9b6c0375d0) ![image](https://github.com/apache/hudi/assets/5379274/e56aa65d-5eeb-48b3-b9ea-f9e9b0705abe) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org