data locality in logs

2014-02-05 Thread Tsai Li Ming
Hi, In older posts on Google Groups, there was mention of checking the logs on “preferred/non-preferred” for data locality. But I can’t seem to find this on 0.9.0 anymore? Has this been changed to “PROCESS_LOCAL” , like this: 14/02/06 13:51:45 INFO TaskSetManager: Starting task 9.0:50 as TID

Re: data locality in logs

2014-02-05 Thread Andrew Ash
If you have multiple executors running on a single node then you might have data that's on the same server but in different JVMs. Just on the same server is NODE_LOCAL, but being in the same JVM is PROCESS_LOCAL. Yes it was changed to be more specific than just preferred/non-preferred. The new