See

http://drill.apache.org/docs/file-system-storage-plugin/ 
<http://drill.apache.org/docs/file-system-storage-plugin/>

"connection": "hdfs://<IP Address>:<Port>/"


As Ted stated you need to point to the name node for HDFS.


--Andries


> On Oct 13, 2016, at 5:04 AM, David Kincaid <kincaid.d...@gmail.com> wrote:
> 
> Thanks, Ted. The full URL I was using was http://. I'll give your
> suggestion a try when I'm able to work on this again tonight. I guess I
> took the documentation too literally when it said "To query a file on HDFS
> from a node on the cluster, you can simply change the connection from
> file:/// to hdfs:// in the dfs storage plugin."
> Thanks again,
> Dave
> 
> 
> On Thu, Oct 13, 2016 at 12:39 AM, Ted Dunning <ted.dunn...@gmail.com> wrote:
> 
>> What is the full URL you used?
>> 
>> With hdfs://, you need to supply a name node address.
>> 
>> With file://, you don't.
>> 
>> Contrarily, with maprfs:// you don't need an address since it is implied in
>> the client connection.
>> 
>> 
>> 
>> On Wed, Oct 12, 2016 at 6:29 PM, David Kincaid <kincaid.d...@gmail.com>
>> wrote:
>> 
>>> I have an Amazon EMR cluster launched with Drill loaded. I'm trying to
>>> configure the dfs storage plugin to use HDFS. The docs say that I should
>>> simply need to change the "connection" setting from "file:///" to
>> "hdfs://"
>>> in order to use HDFS on the cluster that Drill is running on. However,
>> when
>>> I do this and try to run a query I get an error that says
>>> "org.apache.drill.common.exceptions.UserRemoteException:
>>> SYSTEM ERROR: URISyntaxException: Expected authority at index 7: hdfs://
>>> [Error Id: f9e6c674-4dd7-4c5d-b9a8-95b64b9dbaa3"
>>> 
>>> Am I doing something wrong or is there an issue here?
>>> 
>>> Thanks,
>>> 
>>> Dave
>>> 
>> 

Reply via email to