[ https://issues.apache.org/jira/browse/HUDI-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Balaji Varadarajan updated HUDI-1091: ------------------------------------- Status: Open (was: New) > Handle empty input batch gracefully in ParquetDFSSource > ------------------------------------------------------- > > Key: HUDI-1091 > URL: https://issues.apache.org/jira/browse/HUDI-1091 > Project: Apache Hudi > Issue Type: Bug > Components: DeltaStreamer > Reporter: Balaji Varadarajan > Priority: Blocker > Fix For: 0.6.0 > > > [https://github.com/apache/hudi/issues/1813] > Looking at 0.5.3, it is possible the below exception can happen when running > in standalone mode and the next batch to write is empty. > ERROR HoodieDeltaStreamer: Got error running delta sync once. Shutting down > org.apache.hudi.exception.HoodieException: Please provide a valid schema > provider class! at > org.apache.hudi.utilities.sources.InputBatch.getSchemaProvider(InputBatch.java:53) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.readFromSource(DeltaSync.java:312) > at > org.apache.hudi.utilities.deltastreamer.DeltaSync.syncOnce(DeltaSync.java:226) > at > org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.sync(HoodieDeltaStreamer.java:121) > at > org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.main(HoodieDeltaStreamer.java:294) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:498) at > org.apache.spark.deploy.JavaMainApplication.start(SparkApplication.scala:52) > at > org.apache.spark.deploy.SparkSubmit.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:853) > at org.apache.spark.deploy.SparkSubmit.doRunMain$1(SparkSubmit.scala:161) at > org.apache.spark.deploy.SparkSubmit.submit(SparkSubmit.scala:184) at > org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:86) at > org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:928) > at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:937) at > org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala) -- This message was sent by Atlassian Jira (v8.3.4#803005)