tim robertson wrote:
Thanks Alex - this will allow me to share the shapefile, but I need to "one time only per job per jvm" read it, parse it and store the objects in the index. Is the Mapper.configure() the best place to do this? E.g. will it only be called once per job?
In 0.19, with HADOOP-249, all tasks from a job can be run in a single JVM. So, yes, you could access a static cache from Mapper.configure().
Doug