Laszlo Gaal created IMPALA-13017:
------------------------------------

             Summary: Add configuration option to set S3 endpoint for the 
minicluster
                 Key: IMPALA-13017
                 URL: https://issues.apache.org/jira/browse/IMPALA-13017
             Project: IMPALA
          Issue Type: Improvement
            Reporter: Laszlo Gaal
            Assignee: Laszlo Gaal


As AWS S3 adds functional improvements, some of the required configuration 
items become stricter, or optional items become mandatory. One example is the 
endpoint configuration: while older regions support the legacy endpoint syntax 
of _bucketname_.s3.amazonaws.com,  new regions, where S3 buckets can be created 
require the AWS v4 request signing algorithm, which requires the request URL to 
contain the region as well. In the latter case the bucket URL becomes 
_bucketname.region-code_.amazonaws.com, e.g. 
test-bucket.us-west-2.amazonaws.com.

Bucket configuration for the Hadoop Cloud Connector (for the s3a:// protocol) 
works a bit differently: the bucket name is supplied separately, e.g. 
s3a://landsat-pds, and various other parameters can be supplied via optional 
fields in core-site.xml

When the Impala minicluster is configured to use S3 for its default filesystem, 
it should be possible to supply the endpoint to use so that S3-specific tests 
can be run against endpoints that don't support connecting via the legacy 
generic endpoint.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to