David Perkins created SPARK-47768: ------------------------------------- Summary: Data Source names unavailable when using Delta Share and Kafka SQL Key: SPARK-47768 URL: https://issues.apache.org/jira/browse/SPARK-47768 Project: Spark Issue Type: Bug Components: Input/Output Affects Versions: 3.5.1 Environment: I'm using Spark 3.5.1 on Kubernetes with the Spark operator.
My project includes these depenedencies: implementation 'org.apache.spark:spark-core_2.12:3.5.1' implementation 'org.apache.spark:spark-sql_2.12:3.5.1' implementation 'com.fasterxml.jackson.dataformat:jackson-dataformat-yaml:2.17.0' sparkConnectorShadowJar 'org.apache.spark:spark-sql-kafka-0-10_2.12:3.5.1' sparkConnectorShadowJar 'io.delta:delta-sharing-spark_2.12:3.1.0' The `sparkConnectorShadowJar` is packaged into a shadow jar and copied onto the 'apache/spark:3.5.1' docker image. Reporter: David Perkins I have a simple Spark application that is reading from a csv file via Delta Share and writing the contents to Kafka. When both the Delta Share Kafka SQL libraries are included in the project, Spark is unable to load them by their format short names. If I use one of them without the other, everything works fine. When both are included, then I get this root exception: ClassNotFoundException: deltaSharing.DefaultSource. If I specify the source class names ( io.delta.sharing.spark.DeltaSharingDataSource, org.apache.spark.sql.kafka010.KafkaSourceProvider) instead of the short names, it works correctly. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org