Thanks Jay so much ! I will try this. ^^ Vào 10:52 Ngày 24 tháng 4 năm 2012, Jay Vyas <jayunit...@gmail.com> đã viết:
> Ahh... Well than the key will be teacher, and the value will simply be > > <-1 * # students, class_id> . > > Then, you will see in the reducer that the first 3 entries will always be > the ones you wanted. > > On Mon, Apr 23, 2012 at 10:17 PM, Lac Trung <trungnb3...@gmail.com> wrote: > > > Hi Jay ! > > I think it's a bit difference here. I want to get 30 classId for each > > teacherId that have most students. > > For example : get 3 classId. > > (File1) > > 1) Teacher1, Class11, 30 > > 2) Teacher1, Class12, 29 > > 3) Teacher1, Class13, 28 > > 4) Teacher1, Class14, 27 > > ... n ... > > > > n+1) Teacher2, Class21, 45 > > n+2) Teacher2, Class22, 44 > > n+3) Teacher2, Class23, 43 > > n+4) Teacher2, Class24, 42 > > ... n+m ... > > > > => return 3 line 1, 2, 3 for Teacher1 and line n+1, n+2, n+3 for Teacher2 > > > > > > Vào 09:52 Ngày 24 tháng 4 năm 2012, Jay Vyas <jayunit...@gmail.com> đã > > viết: > > > > > Its somewhat tricky to understand exactly what you need from your > > > explanation, but I believe you want teachers who have the most students > > in > > > a given class. So for English, i have 10 teachers teaching the class - > > and > > > i want the ones with the highes # of students. > > > > > > You can output key= <classid>, value=<-1*#ofstudent,teacherid> as the > > > values. > > > > > > The values will then be sorted, by # of students. You can thus pick > > > teacher in the the first value of your reducer, and that will be the > > > teacher for class id = xyz , with the highes number of students. > > > > > > You can also be smart in your mapper by running a combiner to remove > the > > > teacherids who are clearly not maximal. > > > > > > On Mon, Apr 23, 2012 at 9:38 PM, Lac Trung <trungnb3...@gmail.com> > > wrote: > > > > > > > Hello everyone ! > > > > > > > > I have a problem with MapReduce [:(] like that : > > > > I have 4 file input with 3 fields : teacherId, classId, > numberOfStudent > > > > (numberOfStudent is ordered by desc for each teach) > > > > Output is top 30 classId that numberOfStudent is max for each > teacher. > > > > My approach is MapReduce like Wordcount example. But I don't know how > > to > > > > determine key for map function. > > > > I run Wordcount example, understand its code but I have no experience > > at > > > > programming MapReduce. > > > > > > > > Can anyone help me to resolve this problem ? > > > > Thanks so much ! > > > > > > > > > > > > -- > > > > Lạc Trung > > > > 20083535 > > > > > > > > > > > > > > > > -- > > > Jay Vyas > > > MMSB/UCHC > > > > > > > > > > > -- > > Lạc Trung > > 20083535 > > > > > > -- > Jay Vyas > MMSB/UCHC > -- Lạc Trung 20083535