icro you should really try using a bigger Amazon
> instance size. This might probably be a lot more useful than trying
> different configurations.
> On Jul 24, 2014 7:08 AM, "Sarfraz Ramay" wrote:
>
>> Can anyone please help with this ?
>>
>> [image: Inline ima
the configurated memory size.
>
> Since you are using a t2.micro you should really try using a bigger Amazon
> instance size. This might probably be a lot more useful than trying
> different configurations.
> On Jul 24, 2014 7:08 AM, "Sarfraz Ramay" wrote:
>
>> Can any
Can anyone please help with this ?
[image: Inline image 1]
i followed the advice here
http://stackoverflow.com/questions/20390217/mapreduce-job-in-headless-environment-fails-n-times-due-to-am-container-exceptio
and added to mapred-site.xml following properties but still getting the
same error.
Hi,
A very strange thing is happening. I am running the TPC-H benchmark. I have
loaded the tables on HDFS running in pseudo-distributed mode. When i query
one table at a time
select * from customer LIMIT 2; OR
select * from NATION LIMIT 2; results are printed to the cli but as soon as
i try somet
Hi,
A workaround could be to create a table with all the columns and then drop
the columns you don't need. Hope it helps.
Regards,
Sarfraz Rasheed Ramay (DIT)
Dublin, Ireland.
On Tue, Jul 1, 2014 at 5:20 AM, Szehon Ho wrote:
> Why dont you just use CTAS (Create table as select), putting the 1
Hi,
I am doing my MSc thesis on investigation of different coding techniques in
Hive. I am looking for suggestions on 9 techniques, 3 easy, 3 medium and 3
hard. I will have to code these techniques up and compare and evaluate
them. I have extracted a list of techniques below from the book Programm
May be UDF could solve your problem.
Regards,
Sarfraz Rasheed Ramay (DIT)
Dublin, Ireland.
On Mon, Jun 9, 2014 at 7:30 PM, Mark Desnoyer wrote:
> Hi Furcy,
>
> Thanks for the reply. I looked at MapJoin but it won't do what I need
> because all the tables will be large and actually, explicitly
How does the system support plugins, ie, is there support to get data from
> mongo or who knows access/ excel? What about user functions to trim data or
> reshape xml? etc. What are the pluggable points of both systems?
>
>
>
>
> On Sat, May 3, 2014 at 1:12 PM, Sarfraz Ramay wrote:
&g
re
> procedural language then a declarative. Look in the world for scripts and
> see who is deploying them effectively.
>
>
>
>
>
> On Sat, May 3, 2014 at 4:46 AM, Sarfraz Ramay wrote:
>
>> Thanks Thejas for your input! These are interesting and very specific
>
mpressed
> - mapreduce vs tez execution engine
> - stats optimized queries
>
>
>
> On Thu, May 1, 2014 at 5:47 AM, Sarfraz Ramay
> wrote:
> >>
> >> Hi,
> >>
> >> It seems that both Hive and Pig are used for managing large data sets.
>
>
>
> Hi,
>
> It seems that both Hive and Pig are used for managing large data sets.
> Hive is more SQL oriented whereas Pig is more for the data flows. I am
> doing a master's thesis on the performance evaluation of both. Can some
> please provide a list of tasks that would make for an interesting
11 matches
Mail list logo