On 1/6/2011 7:53 AM, Matt Pouttu-Clarke wrote:
Hi All,

I am using Derby as an embedded database within a Hadoop job to lookup IP geographic info.
http://mpouttuclarke.wordpress.com/2010/12/10/java-embedded-db-for-ip2location-in-hadoop/

The problem is that Hadoop has an option called JVM sharing where more than one thread may be active in the JVM instance. Since the embedded option only supports one thread at a time, I have had to turn off JVM sharing for my IP lookup job (by setting mapred.job.reuse.jvm.num.tasks to 1).

Embedded definitely supports more than one thread at a time as long as all of those threads are in the same JVM. Each thread should just have it's own separate Connection.

Reply via email to