[ https://issues.apache.org/jira/browse/HIVE-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12908729#action_12908729 ]
He Yongqiang commented on HIVE-1624: ------------------------------------ S3 -> client -> cluster maybe better than directly downloading the script from S3 to TaskTracker node. There may be thousands of concurrent downloading request to S3 for downloading a script. (I agree that the script can be cached in local machine, but right now hive does not do any cache clean up). S3 -> client -> cluster will be able to use hadoop distributed cache. > Patch to allows scripts in S3 location > -------------------------------------- > > Key: HIVE-1624 > URL: https://issues.apache.org/jira/browse/HIVE-1624 > Project: Hadoop Hive > Issue Type: New Feature > Reporter: Vaibhav Aggarwal > Attachments: HIVE-1624.patch > > > I want to submit a patch which allows user to run scripts located in S3. > This patch enables Hive to download the hive scripts located in S3 buckets > and execute them. This saves users the effort of copying scripts to HDFS > before executing them. > Thanks > Vaibhav -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.