Hi Julien, Gora doesn't take into account data locality as this is taken care of by each different data store. The one thing that Gora "take" into account is the number of partitions it should use, and that number of partitions are used to run more/less map tasks. This partition number hasn't been implemented by all data stores properly and AFAIK most of them return a single partition, which means we only use a Map task to read as much data as we have to. Planning to work on this coming summer (SouthAmerican summer) ;)
Renato M. 2013/10/2 Julien Nioche <[email protected]> > Hi guys, > > I can't quite remember whether Gora takes data locality into account when > generating the input for a map reduce job. Could someone explain how its is > currently handled and if things differ from one backend to the other then > how? > > Thanks > > Julien > > -- > * > *Open Source Solutions for Text Engineering > > http://digitalpebble.blogspot.com/ > http://www.digitalpebble.com > http://twitter.com/digitalpebble >

