Re: Hive shell code exception, urgent help needed

2014-07-25 Thread Sarfraz Ramay
icro you should really try using a bigger Amazon > instance size. This might probably be a lot more useful than trying > different configurations. > On Jul 24, 2014 7:08 AM, "Sarfraz Ramay" wrote: > >> Can anyone please help with this ? >> >> [image: Inline ima

Re: Hive shell code exception, urgent help needed

2014-07-24 Thread Sarfraz Ramay
the configurated memory size. > > Since you are using a t2.micro you should really try using a bigger Amazon > instance size. This might probably be a lot more useful than trying > different configurations. > On Jul 24, 2014 7:08 AM, "Sarfraz Ramay" wrote: > >> Can any

Re: Hive shell code exception, urgent help needed

2014-07-24 Thread Sarfraz Ramay
Can anyone please help with this ? [image: Inline image 1] i followed the advice here http://stackoverflow.com/questions/20390217/mapreduce-job-in-headless-environment-fails-n-times-due-to-am-container-exceptio and added to mapred-site.xml following properties but still getting the same error.

JOIN query results not printing to cli - HELP please.

2014-07-11 Thread Sarfraz Ramay
Hi, A very strange thing is happening. I am running the TPC-H benchmark. I have loaded the tables on HDFS running in pseudo-distributed mode. When i query one table at a time select * from customer LIMIT 2; OR select * from NATION LIMIT 2; results are printed to the cli but as soon as i try somet

Re: Column selection in Hive

2014-07-01 Thread Sarfraz Ramay
Hi, A workaround could be to create a table with all the columns and then drop the columns you don't need. Hope it helps. Regards, Sarfraz Rasheed Ramay (DIT) Dublin, Ireland. On Tue, Jul 1, 2014 at 5:20 AM, Szehon Ho wrote: > Why dont you just use CTAS (Create table as select), putting the 1

Suggestions for different coding techniques in Hive

2014-06-29 Thread Sarfraz Ramay
Hi, I am doing my MSc thesis on investigation of different coding techniques in Hive. I am looking for suggestions on 9 techniques, 3 easy, 3 medium and 3 hard. I will have to code these techniques up and compare and evaluate them. I have extracted a list of techniques below from the book Programm

Re: Efficient Equality Joins of Large Tables

2014-06-09 Thread Sarfraz Ramay
May be UDF could solve your problem. Regards, Sarfraz Rasheed Ramay (DIT) Dublin, Ireland. On Mon, Jun 9, 2014 at 7:30 PM, Mark Desnoyer wrote: > Hi Furcy, > > Thanks for the reply. I looked at MapJoin but it won't do what I need > because all the tables will be large and actually, explicitly

Re: Hive Vs Pig: Master's thesis

2014-05-03 Thread Sarfraz Ramay
How does the system support plugins, ie, is there support to get data from > mongo or who knows access/ excel? What about user functions to trim data or > reshape xml? etc. What are the pluggable points of both systems? > > > > > On Sat, May 3, 2014 at 1:12 PM, Sarfraz Ramay wrote: &g

Re: Hive Vs Pig: Master's thesis

2014-05-03 Thread Sarfraz Ramay
re > procedural language then a declarative. Look in the world for scripts and > see who is deploying them effectively. > > > > > > On Sat, May 3, 2014 at 4:46 AM, Sarfraz Ramay wrote: > >> Thanks Thejas for your input! These are interesting and very specific >

Re: Hive Vs Pig: Master's thesis

2014-05-03 Thread Sarfraz Ramay
mpressed > - mapreduce vs tez execution engine > - stats optimized queries > > > > On Thu, May 1, 2014 at 5:47 AM, Sarfraz Ramay > wrote: > >> > >> Hi, > >> > >> It seems that both Hive and Pig are used for managing large data sets. >

Re: Hive Vs Pig: Master's thesis

2014-05-01 Thread Sarfraz Ramay
> > > Hi, > > It seems that both Hive and Pig are used for managing large data sets. > Hive is more SQL oriented whereas Pig is more for the data flows. I am > doing a master's thesis on the performance evaluation of both. Can some > please provide a list of tasks that would make for an interesting