[
https://issues.apache.org/jira/browse/HIVE-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12908729#action_12908729
]
He Yongqiang commented on HIVE-1624:
------------------------------------
S3 -> client -> cluster maybe better than directly downloading the script from
S3 to TaskTracker node.
There may be thousands of concurrent downloading request to S3 for downloading
a script. (I agree that the script can be cached in local machine, but right
now hive does not do any cache clean up).
S3 -> client -> cluster will be able to use hadoop distributed cache.
> Patch to allows scripts in S3 location
> --------------------------------------
>
> Key: HIVE-1624
> URL: https://issues.apache.org/jira/browse/HIVE-1624
> Project: Hadoop Hive
> Issue Type: New Feature
> Reporter: Vaibhav Aggarwal
> Attachments: HIVE-1624.patch
>
>
> I want to submit a patch which allows user to run scripts located in S3.
> This patch enables Hive to download the hive scripts located in S3 buckets
> and execute them. This saves users the effort of copying scripts to HDFS
> before executing them.
> Thanks
> Vaibhav
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.