[ 
https://issues.apache.org/jira/browse/PIG-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15239277#comment-15239277
 ] 

Anirudh Beria commented on PIG-4860:
------------------------------------

Thanks for taking a look Daniel. Here's how I tested it:
{noformat}
X = LOAD 's3://<my_bucket>/data/excite-small.log' USING PigStorage(' ') AS 
(user, time, query);
store X into 'hdfs://ec2-54-205-241-48.compute-1.amazonaws.com:9000/orc_file' 
using OrcStorage();
store X into 's3://<my_bucket>/beria/orc_file' using OrcStorage();
Y = LOAD 's3://<my_bucket>/beria/orc_file' using OrcStorage();
Z = LOAD 'hdfs://ec2-54-205-241-48.compute-1.amazonaws.com:9000/orc_file' using 
OrcStorage();
dump Y;
dump Z;
{noformat}
I also downloaded data files ( {{*/orc_file/part-m-00000}} ) and checked they 
are indeed orc format files.

> Loading data using OrcStorage() accepts only default FileSystem path
> --------------------------------------------------------------------
>
>                 Key: PIG-4860
>                 URL: https://issues.apache.org/jira/browse/PIG-4860
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Anirudh Beria
>            Assignee: Anirudh Beria
>         Attachments: PIG-4860.patch
>
>
> If the default file system is HDFS, but one wants to load data from S3 (or 
> some other storage) using OrcStorage(), the query fails since the path 
> expected is of format `hdfs://` but gets `s3://`



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to