I think you can also simulate PageRank Algorithm with hadoop. Simon -
On Sun, Feb 27, 2011 at 9:20 PM, Lance Norskog <goks...@gmail.com> wrote: > This is an exercise that will appeal to undergrads: pull the Craiglist > personals ads from several cities, and do text classification. Given a > training set of all the cities, attempt to classify test ads by city. > (If Peter Harrington is out there, I stole this from you.) > > Lance > > On Sun, Feb 27, 2011 at 4:55 PM, Ted Dunning <tdunn...@maprtech.com> > wrote: > > Ted, > > > > Greetings back at you. It has been a while. > > > > Check out Jimmy Lin and Chris Dyer's book about text processing with > > hadoop: > > > > http://www.umiacs.umd.edu/~jimmylin/book.html > > > > > > On Sun, Feb 27, 2011 at 4:34 PM, Ted Pedersen <tpede...@d.umn.edu> > wrote: > > > >> Greetings all, > >> > >> I'm teaching an undergraduate Computer Science class that is using > >> Hadoop quite heavily, and would like to include some case studies at > >> various points during this semester. > >> > >> We are using Tom White's "Hadoop The Definitive Guide" as a text, and > >> that includes a very nice chapter of case studies which might even > >> provide enough material for my purposes. > >> > >> But, I wanted to check and see if there were other case studies out > >> there that might provide motivating and interesting examples of how > >> Hadoop is currently being used. The idea is to find material that goes > >> beyond simply saying "X uses Hadoop" to explaining in more detail how > >> and why X are using Hadoop. > >> > >> Any hints would be very gratefully received. > >> > >> Cordially, > >> Ted > >> > >> -- > >> Ted Pedersen > >> http://www.d.umn.edu/~tpederse > >> > > > > > > -- > Lance Norskog > goks...@gmail.com > -- Regards, Simon