[
https://issues.apache.org/jira/browse/PIG-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239277#comment-15239277
]
Anirudh Beria commented on PIG-4860:
------------------------------------
Thanks for taking a look Daniel. Here's how I tested it:
{noformat}
X = LOAD 's3://<my_bucket>/data/excite-small.log' USING PigStorage(' ') AS
(user, time, query);
store X into 'hdfs://ec2-54-205-241-48.compute-1.amazonaws.com:9000/orc_file'
using OrcStorage();
store X into 's3://<my_bucket>/beria/orc_file' using OrcStorage();
Y = LOAD 's3://<my_bucket>/beria/orc_file' using OrcStorage();
Z = LOAD 'hdfs://ec2-54-205-241-48.compute-1.amazonaws.com:9000/orc_file' using
OrcStorage();
dump Y;
dump Z;
{noformat}
I also downloaded data files ( {{*/orc_file/part-m-00000}} ) and checked they
are indeed orc format files.
> Loading data using OrcStorage() accepts only default FileSystem path
> --------------------------------------------------------------------
>
> Key: PIG-4860
> URL: https://issues.apache.org/jira/browse/PIG-4860
> Project: Pig
> Issue Type: Bug
> Reporter: Anirudh Beria
> Assignee: Anirudh Beria
> Attachments: PIG-4860.patch
>
>
> If the default file system is HDFS, but one wants to load data from S3 (or
> some other storage) using OrcStorage(), the query fails since the path
> expected is of format `hdfs://` but gets `s3://`
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)