
In my spark-submit to eks cluster, I use the standard code to submit to the
cluster as below:

spark-submit --verbose \
   --master k8s://$KUBERNETES_MASTER_IP:443 \
   --deploy-mode cluster \
   --name sparkOnEks \
   --py-files local://$CODE_DIRECTORY/spark_on_eks.zip \

In Google Kubernetes Engine (GKE) I simply load them from gs:// storage
bucket.and it works fine.

I am getting the following error in driver pod

 + CMD=("$SPARK_HOME/bin/spark-submit" --conf
"spark.driver.bindAddress=$SPARK_DRIVER_BIND_ADDRESS" --deploy-mode
client "$@")
    + exec /usr/bin/tini -s -- /opt/spark/bin/spark-submit --conf
spark.driver.bindAddress= --deploy-mode client
--properties-file /opt/spark/conf/spark.properties --class
    23/04/11 23:07:23 WARN NativeCodeLoader: Unable to load
native-hadoop library for your platform... using builtin-java classes
where applicable
    /usr/bin/python3: can't open file
[Errno 2] No such file or directory
    log4j:WARN No appenders could be found for logger
It says  can't open file

[Errno 2] No such file or directory but it is there!

ls -l /home/hduser/dba/bin/python/spark_on_eks/src/RandomDataBigQuery.py
    -rw-rw-rw- 1 hduser hadoop 5060 Mar 18 14:16
So not sure what is going on. I have suspicion that it is looking
inside the docker itself for this file?

Is that a correct assumption?


Mich Talebzadeh,
Lead Solutions Architect/Engineering Lead
Palantir Technologies Limited
United Kingdom

