We are trying to use Apache Drill to run a query against a JSON file on our Azure Blob storage account. I've followed all of the documentation steps for setting up the Azure blob access found here: https://drill.apache.org/docs/azure-blob-storage-plugin/. I've confirmed the machine can access the file via a SAS URL so there is no firewall blocking Azure blob. The query runs for around 20-25 minutes and then throws the below error. The Azure key is setup in the core-site.xml file. We're using apache-drill-1.19.0 along with the following JAR files: hadoop.azure-2.7.7.jar and azure-storage-8.0.0.jar. Any ideas to resolve this issue?
Query: SELECT * FROM azureblob.`RSP_DST_AMP_Extract_All.json` LIMIT 1 Error: RESOURCE ERROR: Failed to load schema for "azureblob"! org.apache.hadoop.fs.azure.AzureException: com.microsoft.azure.storage.StorageException: An unknown failure occurred : Connection timed out: connect Storage plugin: [cid:[email protected]] Configuration: { "type": "file", "connection": "wasbs://[email protected]", "workspaces": { "tmp": { "location": "/tmp", "writable": true, "defaultInputFormat": null, "allowAccessOutsideWorkspace": false }, "root": { "location": "/", "writable": false, "defaultInputFormat": null, "allowAccessOutsideWorkspace": false } }, "formats": { "parquet": { "type": "parquet" }, "json": { "type": "json", "extensions": [ "json" ] }, "sequencefile": { "type": "sequencefile", "extensions": [ "seq" ] }, "csvh": { "type": "text", "extensions": [ "csvh" ], "extractHeader": true }, "psv": { "type": "text", "extensions": [ "tbl" ], "fieldDelimiter": "|" }, "avro": { "type": "avro", "extensions": [ "avro" ] }, "tsv": { "type": "text", "extensions": [ "tsv" ], "fieldDelimiter": "\t" }, "csv": { "type": "text", "extensions": [ "csv" ], "extractHeader": true } }, "enabled": true } ---------------------------------------------------------------------- CONFIDENTIALITY NOTICE: The information in this message, and any files transmitted with it, is confidential, may be legally privileged, and intended only for the use of the individual(s) named above. Be aware that the use of any confidential or personal information may be restricted by state and federal privacy laws. If you are not the intended recipient, do not further disseminate this message. If this message was received in error, please notify the sender and delete it.
