Hello,

Hive Order by is known to be slow. It is slightly odd that it is slow even when 
we use a limit under strict mode. I am running this query over 3 billion rows 
with a limit of 20. It takes an hour to run. I expect the maps to do some 
sorting and limiting in parallel. That way the reducer load would be small. I 
don’t think it does that. Can you tell me why? 

Thanks and Regards,
Mainak

Reply via email to