Re: Pig vs hive performance

abhishek dodda Wed, 03 Oct 2012 20:43:45 -0700

On Wed, Oct 3, 2012 at 7:50 PM, Dan Richelson <drichel...@tendrilinc.com> wrote:
> Anecdotally I can say that Pig seems to scale down better than Hive.
> We see this in tests- hive scripts running small amounts of data take
> much longer than similar Pig scripts. Hive parallel settings are
> enabled.


-- Same as in our case, for the small data pig seems to be much faster
than hive.

I think this has to do with the fact that there doesn't seem
> to be a 'local' mode for hive- you have to run it as mapreduce jobs
> (either embedded or on a cluster). Please correct me if I am wrong
> here.

-- I am not very sure whether this makes difference ?

Regards
Abhi
>
>
>
> On Wed, Oct 3, 2012 at 3:52 PM, Abhishek <abhishek.dod...@gmail.com> wrote:
>> Hi all,
>>
>> Can we discuss performance of pig vs hive
>>
>> 1) what hive is good at?
>> 2) what pig is good at?
>> 3) Hive optimizer vs pig optimizer
>> 4) hive limitations vs pig limitations
>>
>> Regards
>> Abhi
>>
>> Sent from my iPhone
>>
>> --
>>
>>
>>
>
>
>
> --
> Dan Richelson, Software Engineer
>
> Tendril
> 2560 55th St. | Boulder, Colorado 80301
> M 303-709-2214
> www.tendrilinc.com
>
> This email and any files transmitted with it are confidential and intended 
> solely for the use of the individual or entity to whom they are addressed.
> If you have received this email in error please notify the sender.
> Please note that any views or opinions presented in this email are solely 
> those of the author and do not necessarily represent those of the company.
> Finally, the recipient should check this email and any attachments for the 
> presence of viruses.
> The company accepts no liability for any damage caused by any virus 
> transmitted by this email.
>
> --
>
>
>

Re: Pig vs hive performance

Reply via email to