[ 
https://issues.apache.org/jira/browse/HIVE-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12908729#action_12908729
 ] 

He Yongqiang commented on HIVE-1624:
------------------------------------

S3 -> client -> cluster maybe better than directly downloading the script from 
S3 to TaskTracker node.
There may be thousands of concurrent downloading request to S3 for downloading 
a script. (I agree that the script can be cached in local machine, but right 
now hive does not do any cache clean up).
S3 -> client -> cluster will be able to use hadoop distributed cache.

> Patch to allows scripts in S3 location
> --------------------------------------
>
>                 Key: HIVE-1624
>                 URL: https://issues.apache.org/jira/browse/HIVE-1624
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Vaibhav Aggarwal
>         Attachments: HIVE-1624.patch
>
>
> I want to submit a patch which allows user to run scripts located in S3.
> This patch enables Hive to download the hive scripts located in S3 buckets 
> and execute them. This saves users the effort of copying scripts to HDFS 
> before executing them.
> Thanks
> Vaibhav

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to