Re: Disable/Remove datasources in Spark

2022-05-05 Thread wilson
btw, I use drill to query webserver log only, b/c drill has that a storage plugin for httpd server log. but I found spark is also convenient to query webserver log for which I wrote a note: https://notes.4shield.net/how-to-query-webserver-log-with-spark.html Thanks wilson wrote: though

Re: Disable/Remove datasources in Spark

2022-05-05 Thread wilson
though this is off-topic. but Apache Drill can does that. for instance, you can keep only the csv storage plugin in the configuration, but remove all other storage plugins. then users on drill can query csv only. regards Aditya wrote: So, is there a way for me to get a list of "leaf"

Re: Disable/Remove datasources in Spark

2022-05-05 Thread Aditya
My understanding is if I can disable a parquet datasource, the user will get an error when they try spark.read.parquet() To give context my main objective is that I provide a few dataframes to my users, and I don't want them to be able to access any data other than these specific dataframes. So,

Re: Disable/Remove datasources in Spark

2022-05-05 Thread wilson
it's maybe impossible to disable that? user can run spark.read... to read any datasource he can reach. Aditya wrote: 2. But I am not able to figure out how to "disable" all other data sources - To unsubscribe e-mail:

Disable/Remove datasources in Spark

2022-05-05 Thread Aditya
Hi, I am trying to force all users to use only 1 datasource (A custom datasource I plan to write) to read/write data. So, I was looking at the DataSource api in Spark: 1. I was able to figure out how to create my own Datasource (Reference