We are trying to use Apache Drill to run a query against a JSON file on our 
Azure Blob storage account. I've followed all of the documentation steps for 
setting up the Azure blob access found here: 
https://drill.apache.org/docs/azure-blob-storage-plugin/. I've confirmed the 
machine can access the file via a SAS URL so there is no firewall blocking 
Azure blob. The query runs for around 20-25 minutes and then throws the below 
error. The Azure key is setup in the core-site.xml file. We're using 
apache-drill-1.19.0 along with the following JAR files: hadoop.azure-2.7.7.jar 
and azure-storage-8.0.0.jar.
Any ideas to resolve this issue?

Query:
SELECT * FROM azureblob.`RSP_DST_AMP_Extract_All.json` LIMIT 1

Error:
RESOURCE ERROR: Failed to load schema for "azureblob"!

org.apache.hadoop.fs.azure.AzureException: 
com.microsoft.azure.storage.StorageException: An unknown failure occurred : 
Connection timed out: connect

Storage plugin:
[cid:[email protected]]

Configuration:
{
  "type": "file",
  "connection": 
"wasbs://[email protected]",
  "workspaces": {
    "tmp": {
      "location": "/tmp",
      "writable": true,
      "defaultInputFormat": null,
      "allowAccessOutsideWorkspace": false
    },
    "root": {
      "location": "/",
      "writable": false,
      "defaultInputFormat": null,
      "allowAccessOutsideWorkspace": false
    }
  },
  "formats": {
    "parquet": {
      "type": "parquet"
    },
    "json": {
      "type": "json",
      "extensions": [
        "json"
      ]
    },
    "sequencefile": {
      "type": "sequencefile",
      "extensions": [
        "seq"
      ]
    },
    "csvh": {
      "type": "text",
      "extensions": [
        "csvh"
      ],
      "extractHeader": true
    },
    "psv": {
      "type": "text",
      "extensions": [
        "tbl"
      ],
      "fieldDelimiter": "|"
    },
    "avro": {
      "type": "avro",
      "extensions": [
        "avro"
      ]
    },
    "tsv": {
      "type": "text",
      "extensions": [
        "tsv"
      ],
      "fieldDelimiter": "\t"
    },
    "csv": {
      "type": "text",
      "extensions": [
        "csv"
      ],
      "extractHeader": true
    }
  },
  "enabled": true
}

----------------------------------------------------------------------
CONFIDENTIALITY NOTICE: The information in this message, and any files 
transmitted with it, is confidential, may be legally privileged, and intended 
only for the use of the individual(s) named above. Be aware that the use of any 
confidential or personal information may be restricted by state and federal 
privacy laws. If you are not the intended recipient, do not further disseminate 
this message. If this message was received in error, please notify the sender 
and delete it.

Reply via email to