Good catch, Pedro! Russell: Not sure how you can be using Hive 0.9 on EMR since EMR only supports upto Hive 0.7.1.
Check this for details: http://aws.amazon.com/elasticmapreduce/faqs/#hive-9 Mark ----- Original Message ----- From: "Russell Jurney" <[email protected]> To: [email protected] Sent: Wednesday, May 30, 2012 3:52:27 PM Subject: Re: HIVE and S3 via EMR? You = Excellent Russell Jurney twitter.com/rjurney [email protected] datasyndrome.com On May 29, 2012, at 11:06 PM, Pedro Figueiredo < [email protected] > wrote: On 30 May 2012, at 02:17, Russell Jurney wrote: I've made the bucket - which is derived from the enron emails - available at s3:///rjurney_public_web/from_to_date and a sample is available at http://s3.amazonaws.com/rjurney_public_web/from_to_date/part-m-00004 The problem is that your bucket name contains the '_' character. When Hive (or whatever, really) tries to resolve the hostname rjurney_public_web.s3.amazonaws.com it fails, because '_' is an illegal character in DNS. It's got nothing to do with Hive, or your table definition. You can have a look at "Rules for bucket naming" in http://docs.amazonwebservices.com/AmazonS3/latest/dev/BucketRestrictions.html Cheers, Pedro Pedro Figueiredo Skype: pfig.89clouds http://89clouds.com/ - Big Data Consulting
