Hello folks,
I have a use case where I save two pyspark dataframes as parquet files and
then use them later to join with each other or with other tables and
perform multiple aggregations.
Since I know the column being used in the downstream joins and groupby, I
was hoping I could use
Hi scrypso,
Thanks for the help so far, and I think you're definitely on to something
here. I tried loading the class as you suggested with the code below:
try {
Thread.currentThread().getContextClassLoader().loadClass(MyS3ClientFactory.class.getCanonicalName());
logger.info("Loaded
UNSUBSCRIBE
UNSUBSCRIBE
yixu2...@163.com
UNSUBSCRIBE
Your DDL statement doesn't look right. You may want to check the Spark
SQL Reference online for how to create table in Hive format
(https://spark.apache.org/docs/latest/sql-ref-syntax-ddl-create-table-hiveformat.html).
You should be able to populate the table directly using CREATE by
I'm on my phone, so can't compare with the Spark source, but that looks to
me like it should be well after the ctx loader has been set. You could try
printing the classpath of the loader
Thread.currentThread().getThreadContextClassLoader(), or try to load your
class from that yourself to see if
Thanks for the response, scrypso! I will try adding the extraClassPath
option. Meanwhile, please find the full stack trace below (I have
masked/removed references to proprietary code)
java.lang.RuntimeException: java.lang.RuntimeException:
java.lang.ClassNotFoundException: Class
Two ideas you could try:
You can try spark.driver.extraClassPath as well. Spark loads the user's jar
in a child classloader, so Spark/Yarn/Hadoop can only see your classes
reflectively. Hadoop's Configuration should use the thread ctx classloader,
and Spark should set that to the loader that
Missed to mention it above, but just to add, the error is coming from the
driver. I tried using *--driver-class-path /path/to/my/jar* as well, but no
luck.
Thanks!
On Mon, Dec 12, 2022 at 4:21 PM Hariharan wrote:
> Hello folks,
>
> I have a spark app with a custom implementation of
>
10 matches
Mail list logo