Thanks, Mich - will check this and update.

Karan Alang

On Sat, Feb 12, 2022 at 1:57 AM Mich Talebzadeh

On Sat, 12 Feb 2022 at 08:24, Mich Talebzadeh
> wrote:
>> You are trying to access a Google storage bucket gs:// from your local
>> host.
>> It does not see it because spark-submit assumes that it is a local file
>> system on the host which is not.
>> You need to mount gs:// bucket as a local file system.
>> You can use the tool called gcsfuse
>> . Cloud Storage FUSE is
>> an open source FUSE <> adapter that allows
>> you to mount Cloud Storage buckets as file systems on Linux or macOS
>> systems. You can download gcsfuse from here
>> <>
>> Pretty simple.
>> It will be installed as /usr/bin/gcsfuse and you can mount it by creating
>> a local mount file like /mnt/gs as root and give permission to others to
>> use it.
>> As a normal user that needs to access gs:// bucket (not as root), use
>> gcsfuse to mount it. For example I am mounting a gcs bucket called
>> spark-jars-karan here
>> Just use the bucket name itself
>> gcsfuse spark-jars-karan /mnt/gs
>> Then you can refer to it as /mnt/gs in spark-submit from on-premise host
>> spark-submit --packages org.apache.spark:spark-sql-kafka-0-10_2.12:3.2.0 
>> --jars /mnt/gs/spark-bigquery-with-dependencies_2.12-0.23.2.jar
On Sat, 12 Feb 2022 at 04:31, karan alang
>>> Hello All,
>>> I'm trying to access gcp buckets while running spark-submit from local,
>>> and running into issues.
>>> I'm getting error :
>>> ```
>>> 22/02/11 20:06:59 WARN NativeCodeLoader: Unable to load native-hadoop 
>>> library for your platform... using builtin-java classes where applicable
>>> Exception in thread "main" 
>>> org.apache.hadoop.fs.UnsupportedFileSystemException: No FileSystem for 
>>> scheme "gs"
>>> ```
>>> I tried adding the --conf
>>> to the spark-submit command, but getting ClassNotFoundException
>>> Details are in stackoverflow :
>>> Any ideas on how to fix this ?
>>> tia !

