Raymond Xu created HUDI-4549: -------------------------------- Summary: hive sync bundle causes class loader issue Key: HUDI-4549 URL: https://issues.apache.org/jira/browse/HUDI-4549 Project: Apache Hudi Issue Type: Bug Components: dependencies Reporter: Raymond Xu Fix For: 0.12.0
A weird classpath issue i found: when testing deltastreamer using hudi-utilities-slim-bundle, if i put --jars hudi-hive-sync-bundle.jar,hudi-spark-bundle.jar then i’ll get this error when writing {code:java} Caused by: java.lang.NoSuchMethodError: org.apache.hudi.avro.MercifulJsonConverter.convert(Ljava/lang/String;Lorg/apache/avro/Schema;)Lorg/apache/avro/generic/GenericRecord; at org.apache.hudi.utilities.sources.helpers.AvroConvertor.fromJson(AvroConvertor.java:86) at org.apache.spark.api.java.JavaPairRDD$.$anonfun$toScalaFunction$1(JavaPairRDD.scala:1070) {code} if i put the spark bundle before the hive sync bundle, then no issue. Without hive-sync-bundle, also no issue. So hive-sync-bundle somehow messes up with classpath? not sure why it reports a hudi-common API not found… caused by shading avro? the same behavior i observed with aws-bundle, which makes sense, as it’s a superset of hive-sync-bundle -- This message was sent by Atlassian Jira (v8.20.10#820010)