Hi,

I'm just getting started with Hive. I was wondering if anyone can tell me how Hive would translate the following SQL query into MapReduce:

select distinct track, count(track)
from music
where date_listened between '12-10-2008' and '12-11-2008'
group by user
order by count(track) desc

How many mapred jobs would it use? How would the map and red functions looks like? Also, does Hive have a utility that gives you this information?

Thanks,

Shirley



Reply via email to