As far as replication goes, you should look at a project called pastry. Apparently some people have used hadoop mapreduce on top of it. You will need to be clever, however, in how you do your mapreduce because you probably won't want the job to eat all the users cpu time.
On Dec 2, 2009 5:11 PM, "Habermaas, William" <william.haberm...@fatwire.com> wrote: Hadoop isn't going to like losing its datanodes when people shutdown their computers. More importantly, when the datanodes are running, your users will be impacted by data replication. Unlike Seti, Hadoop doesn't know when the user's screensaver is running so it will start doing things when it feels like it. Can someone else comment on whether HOD (hadoop-on-demand) would fit this scenario? Bill -----Original Message----- From: Maciej Trebacz [mailto: maciej.treb...@gmail.com] Sent: Wednesday,...