Hi All,
The data size of my task is about 30mb. It runs smoothly in local mode. However, when I submit it to the cluster, it throws the titled error (Please see below for the complete output). Actually, my output is almost the same with http://stackoverflow.com/questions/24080891/spark-program-hangs-at-job-finished-toarray-workers-throw-java-util-concurren. I also toArray my data, which was the reason of his case. However, how come it runs OK in local but not in the cluster? The memory of each worker is over 60g, and my run command is: "$SPARK_HOME/bin/spark-class org.apache.spark.deploy.Client launch spark://10.196.135.101:7077 $jar_path $programname -Dspark.master=spark://10.196.135.101:7077 -Dspark.cores.max=300 -Dspark.executor.memory=20g -spark.jars=$jar_path -Dspark.default.parallelism=100 -Dspark.hadoop.hadoop.job.ugi=$username,$groupname -Dspark.app.name=$appname $in_path $scala_out_path" Looking for help and thanks a lot! Below please find the complete output: 14/07/31 15:06:53 WARN Configuration: DEPRECATED: hadoop-site.xml found in the classpath. Usage of hadoop-site.xml is deprecated. Instead use core-site.xml, mapred-site.xml and hdfs-site.xml to override properties of core-default.xml, mapred-default.xml and hdfs-default.xml respectively 14/07/31 15:06:53 INFO SecurityManager: Changing view acls to: spark 14/07/31 15:06:53 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(spark) 14/07/31 15:06:53 INFO Slf4jLogger: Slf4jLogger started 14/07/31 15:06:53 INFO Remoting: Starting remoting 14/07/31 15:06:54 INFO Remoting: Remoting started; listening on addresses :[akka.tcp://sparkExecutor@tdw-10-215-140-22:39446] 14/07/31 15:06:54 INFO Remoting: Remoting now listens on addresses: [akka.tcp://sparkExecutor@tdw-10-215-140-22:39446] 14/07/31 15:06:54 INFO CoarseGrainedExecutorBackend: Connecting to driver: akka.tcp://spark@tdw-10-196-135-106:38502/user/CoarseGrainedScheduler 14/07/31 15:06:54 INFO WorkerWatcher: Connecting to worker akka.tcp://sparkWorker@tdw-10-215-140-22:34755/user/Worker 14/07/31 15:06:54 INFO WorkerWatcher: Successfully connected to akka.tcp://sparkWorker@tdw-10-215-140-22:34755/user/Worker 14/07/31 15:06:56 INFO CoarseGrainedExecutorBackend: Successfully registered with driver 14/07/31 15:06:56 INFO SecurityManager: Changing view acls to: spark 14/07/31 15:06:56 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(spark) 14/07/31 15:06:56 INFO Slf4jLogger: Slf4jLogger started 14/07/31 15:06:56 INFO Remoting: Starting remoting 14/07/31 15:06:56 INFO Remoting: Remoting started; listening on addresses :[akka.tcp://spark@tdw-10-215-140-22:56708] 14/07/31 15:06:56 INFO Remoting: Remoting now listens on addresses: [akka.tcp://spark@tdw-10-215-140-22:56708] 14/07/31 15:06:56 INFO SparkEnv: Connecting to MapOutputTracker: akka.tcp://spark@tdw-10-196-135-106:38502/user/MapOutputTracker 14/07/31 15:06:58 INFO SparkEnv: Connecting to BlockManagerMaster: akka.tcp://spark@tdw-10-196-135-106:38502/user/BlockManagerMaster 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data1/sparkenv/local/spark-local-20140731150659-3f12 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data2/sparkenv/local/spark-local-20140731150659-1602 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data3/sparkenv/local/spark-local-20140731150659-d213 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data4/sparkenv/local/spark-local-20140731150659-f42e 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data5/sparkenv/local/spark-local-20140731150659-63d0 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data6/sparkenv/local/spark-local-20140731150659-9003 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data7/sparkenv/local/spark-local-20140731150659-f260 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data8/sparkenv/local/spark-local-20140731150659-6334 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data9/sparkenv/local/spark-local-20140731150659-3af4 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data10/sparkenv/local/spark-local-20140731150659-133d 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at /data11/sparkenv/local/spark-local-20140731150659-ed08 14/07/31 15:06:59 INFO MemoryStore: MemoryStore started with capacity 11.5 GB. 14/07/31 15:06:59 INFO ConnectionManager: Bound socket to port 35127 with id = ConnectionManagerId(tdw-10-215-140-22,35127) 14/07/31 15:06:59 INFO BlockManagerMaster: Trying to register BlockManager 14/07/31 15:07:00 INFO BlockManagerMaster: Registered BlockManager 14/07/31 15:07:00 INFO HttpFileServer: HTTP File server directory is /tmp/spark-0914d215-dd22-4d5e-9ec0-724937dbfd8b 14/07/31 15:07:00 INFO HttpServer: Starting HTTP Server 14/07/31 15:07:26 INFO CoarseGrainedExecutorBackend: Got assigned task 12 14/07/31 15:07:26 INFO CoarseGrainedExecutorBackend: Got assigned task 25 14/07/31 15:07:26 INFO Executor: Running task ID 25 14/07/31 15:07:26 INFO Executor: Running task ID 12 14/07/31 15:07:26 INFO Executor: Fetching hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/adsorption_2.10-1.0.jar with timestamp 1406790410442 14/07/31 15:07:26 INFO Executor: Adding file:/data/home/spark/spark-1.0.0-bin-0.20.2-cdh3u3/work/app-20140731150652-4911/8/./adsorption_2.10-1.0.jar to class loader 14/07/31 15:07:26 INFO HttpBroadcast: Started reading broadcast variable 0 14/07/31 15:07:28 INFO MemoryStore: ensureFreeSpace(102650) called with curMem=0, maxMem=12348240691 14/07/31 15:07:28 INFO MemoryStore: Block broadcast_0 stored as values to memory (estimated size 100.2 KB, free 11.5 GB) 14/07/31 15:07:28 INFO HttpBroadcast: Reading broadcast variable 0 took 1.702291406 s 14/07/31 15:07:28 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:28 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00072:0+197955 14/07/31 15:07:28 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00062:0+218630 14/07/31 15:07:28 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable 14/07/31 15:07:28 WARN LoadSnappy: Snappy native library not loaded 14/07/31 15:07:29 INFO Executor: Serialized size of result for 12 is 887 14/07/31 15:07:29 INFO Executor: Serialized size of result for 25 is 887 14/07/31 15:07:29 INFO Executor: Sending result for 25 directly to driver 14/07/31 15:07:29 INFO Executor: Sending result for 12 directly to driver 14/07/31 15:07:29 INFO Executor: Finished task ID 12 14/07/31 15:07:29 INFO Executor: Finished task ID 25 14/07/31 15:07:30 INFO CoarseGrainedExecutorBackend: Got assigned task 30 14/07/31 15:07:30 INFO Executor: Running task ID 30 14/07/31 15:07:30 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:30 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00084:0+196433 14/07/31 15:07:30 INFO Executor: Serialized size of result for 30 is 887 14/07/31 15:07:30 INFO Executor: Sending result for 30 directly to driver 14/07/31 15:07:30 INFO Executor: Finished task ID 30 14/07/31 15:07:30 INFO CoarseGrainedExecutorBackend: Got assigned task 31 14/07/31 15:07:30 INFO Executor: Running task ID 31 14/07/31 15:07:30 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:30 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00089:0+190194 14/07/31 15:07:30 INFO Executor: Serialized size of result for 31 is 887 14/07/31 15:07:30 INFO Executor: Sending result for 31 directly to driver 14/07/31 15:07:30 INFO Executor: Finished task ID 31 14/07/31 15:07:31 INFO CoarseGrainedExecutorBackend: Got assigned task 54 14/07/31 15:07:31 INFO Executor: Running task ID 54 14/07/31 15:07:31 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:31 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00096:0+153443 14/07/31 15:07:31 INFO CoarseGrainedExecutorBackend: Got assigned task 55 14/07/31 15:07:31 INFO Executor: Running task ID 55 14/07/31 15:07:31 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:31 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00139:0+174726 14/07/31 15:07:31 INFO Executor: Serialized size of result for 54 is 887 14/07/31 15:07:31 INFO Executor: Sending result for 54 directly to driver 14/07/31 15:07:31 INFO Executor: Finished task ID 54 14/07/31 15:07:31 INFO Executor: Serialized size of result for 55 is 887 14/07/31 15:07:31 INFO Executor: Sending result for 55 directly to driver 14/07/31 15:07:31 INFO Executor: Finished task ID 55 14/07/31 15:07:32 INFO CoarseGrainedExecutorBackend: Got assigned task 76 14/07/31 15:07:32 INFO Executor: Running task ID 76 14/07/31 15:07:32 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:32 INFO CoarseGrainedExecutorBackend: Got assigned task 79 14/07/31 15:07:32 INFO Executor: Running task ID 79 14/07/31 15:07:32 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:32 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00149:0+134758 14/07/31 15:07:32 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00157:0+103176 14/07/31 15:07:32 INFO Executor: Serialized size of result for 79 is 887 14/07/31 15:07:32 INFO Executor: Sending result for 79 directly to driver 14/07/31 15:07:32 INFO Executor: Finished task ID 79 14/07/31 15:07:32 INFO Executor: Serialized size of result for 76 is 887 14/07/31 15:07:32 INFO Executor: Sending result for 76 directly to driver 14/07/31 15:07:32 INFO Executor: Finished task ID 76 14/07/31 15:07:34 INFO CoarseGrainedExecutorBackend: Got assigned task 99 14/07/31 15:07:34 INFO Executor: Running task ID 99 14/07/31 15:07:34 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:34 INFO HadoopRDD: Input split: hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00167:0+51005 14/07/31 15:07:34 INFO Executor: Serialized size of result for 99 is 887 14/07/31 15:07:34 INFO Executor: Sending result for 99 directly to driver 14/07/31 15:07:34 INFO Executor: Finished task ID 99 14/07/31 15:07:39 INFO CoarseGrainedExecutorBackend: Got assigned task 181 14/07/31 15:07:39 INFO Executor: Running task ID 181 14/07/31 15:07:39 INFO CoarseGrainedExecutorBackend: Got assigned task 196 14/07/31 15:07:39 INFO Executor: Running task ID 196 14/07/31 15:07:39 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:39 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Updating epoch to 1 and clearing cache 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-24/10.215.140.24] 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-196-135-105/10.196.135.105] 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-21/10.215.140.21] 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-12/10.215.140.12] 46.947: [GC 10486272K->32824K(40196096K), 0.0347340 secs] 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-13/10.215.140.13] 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-23/10.215.140.23] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-13/10.215.140.13:58657] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-23/10.215.140.23:39188] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-21/10.215.140.21:36128] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-12/10.215.140.12:33380] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-196-135-105/10.196.135.105:36859] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-24/10.215.140.24:49100] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-23/10.215.140.23:39188], 2 messages pending 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-13/10.215.140.13:58657], 2 messages pending 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-196-135-105/10.196.135.105:36859], 2 messages pending 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-12/10.215.140.12:33380], 1 messages pending 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-21/10.215.140.21:36128], 1 messages pending 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-24/10.215.140.24:49100], 2 messages pending 14/07/31 15:07:39 INFO CacheManager: Partition rdd_6_10 not found, computing it 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Don't have map outputs for shuffle 0, fetching them 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Doing the fetch; tracker actor = Actor[akka.tcp://spark@tdw-10-196-135-106:38502/user/MapOutputTracker#-128956169] 14/07/31 15:07:39 INFO CacheManager: Partition rdd_6_25 not found, computing it 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Don't have map outputs for shuffle 0, fetching them 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-17/10.215.140.17] 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-25/10.215.140.25] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-17/10.215.140.17:35040] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-17/10.215.140.17:35040], 1 messages pending 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-18/10.215.140.18] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-25/10.215.140.25:50298] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-25/10.215.140.25:50298], 2 messages pending 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-18/10.215.140.18:33575] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-18/10.215.140.18:33575], 1 messages pending 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-14/10.215.140.14] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-14/10.215.140.14:37220] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-14/10.215.140.14:37220], 1 messages pending 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Got the output locations 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 171 non-empty blocks out of 171 blocks 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 171 non-empty blocks out of 171 blocks 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-196-135-107/10.196.135.107:59290] 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-11/10.215.140.11:50302] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-11/10.215.140.11:50302], 1 messages pending 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-196-135-106/10.196.135.106:34128] 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-11/10.215.140.11] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-196-135-107/10.196.135.107:59290], 2 messages pending 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-196-135-106/10.196.135.106:34128], 1 messages pending 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to [tdw-10-215-140-15/10.215.140.15:50069] 14/07/31 15:07:39 INFO SendingConnection: Connected to [tdw-10-215-140-15/10.215.140.15:50069], 1 messages pending 14/07/31 15:07:40 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 14 remote fetches in 48 ms 14/07/31 15:07:40 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-15/10.215.140.15] 14/07/31 15:07:40 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 14 remote fetches in 49 ms 14/07/31 15:07:40 INFO ConnectionManager: Accepted connection from [tdw-10-215-140-16/10.215.140.16] 14/07/31 15:07:40 INFO ConnectionManager: Accepted connection from [tdw-10-196-135-102/10.196.135.102] 14/07/31 15:07:40 INFO SendingConnection: Initiating connection to [tdw-10-215-140-16/10.215.140.16:58648] 14/07/31 15:07:40 INFO SendingConnection: Connected to [tdw-10-215-140-16/10.215.140.16:58648], 1 messages pending 14/07/31 15:07:40 INFO SendingConnection: Initiating connection to [tdw-10-196-135-102/10.196.135.102:45729] 14/07/31 15:07:40 INFO SendingConnection: Connected to [tdw-10-196-135-102/10.196.135.102:45729], 1 messages pending 14/07/31 15:07:42 INFO ConnectionManager: Accepted connection from [tdw-10-196-135-106/10.196.135.106] 14/07/31 15:07:44 INFO ConnectionManager: Accepted connection from [tdw-10-196-135-107/10.196.135.107] 14/07/31 15:07:45 INFO MemoryStore: ensureFreeSpace(1922882) called with curMem=102650, maxMem=12348240691 14/07/31 15:07:45 INFO MemoryStore: Block rdd_6_25 stored as values to memory (estimated size 1877.8 KB, free 11.5 GB) 14/07/31 15:07:46 INFO MemoryStore: ensureFreeSpace(1912396) called with curMem=2025532, maxMem=12348240691 14/07/31 15:07:46 INFO MemoryStore: Block rdd_6_10 stored as values to memory (estimated size 1867.6 KB, free 11.5 GB) 14/07/31 15:07:46 INFO BlockManagerMaster: Updated info of block rdd_6_10 14/07/31 15:07:46 INFO BlockManagerMaster: Updated info of block rdd_6_25 14/07/31 15:07:46 INFO Executor: Serialized size of result for 181 is 421363 14/07/31 15:07:46 INFO Executor: Serialized size of result for 196 is 421522 14/07/31 15:07:46 INFO Executor: Sending result for 181 directly to driver 14/07/31 15:07:46 INFO Executor: Sending result for 196 directly to driver 14/07/31 15:07:46 INFO Executor: Finished task ID 181 14/07/31 15:07:46 INFO Executor: Finished task ID 196 14/07/31 15:07:50 INFO CoarseGrainedExecutorBackend: Got assigned task 219 14/07/31 15:07:50 INFO Executor: Running task ID 219 14/07/31 15:07:50 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:50 INFO CacheManager: Partition rdd_6_48 not found, computing it 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 171 non-empty blocks out of 171 blocks 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 14 remote fetches in 19 ms 14/07/31 15:07:50 INFO CoarseGrainedExecutorBackend: Got assigned task 225 14/07/31 15:07:50 INFO Executor: Running task ID 225 14/07/31 15:07:50 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:50 INFO CacheManager: Partition rdd_6_54 not found, computing it 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 171 non-empty blocks out of 171 blocks 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 14 remote fetches in 15 ms 14/07/31 15:07:51 INFO MemoryStore: ensureFreeSpace(1927469) called with curMem=3937928, maxMem=12348240691 14/07/31 15:07:51 INFO MemoryStore: Block rdd_6_48 stored as values to memory (estimated size 1882.3 KB, free 11.5 GB) 14/07/31 15:07:51 INFO BlockManagerMaster: Updated info of block rdd_6_48 14/07/31 15:07:51 INFO Executor: Serialized size of result for 219 is 424342 14/07/31 15:07:51 INFO Executor: Sending result for 219 directly to driver 14/07/31 15:07:51 INFO Executor: Finished task ID 219 14/07/31 15:07:51 INFO MemoryStore: ensureFreeSpace(1909775) called with curMem=5865397, maxMem=12348240691 14/07/31 15:07:51 INFO MemoryStore: Block rdd_6_54 stored as values to memory (estimated size 1865.0 KB, free 11.5 GB) 14/07/31 15:07:51 INFO BlockManagerMaster: Updated info of block rdd_6_54 14/07/31 15:07:51 INFO Executor: Serialized size of result for 225 is 421546 14/07/31 15:07:51 INFO Executor: Sending result for 225 directly to driver 14/07/31 15:07:51 INFO Executor: Finished task ID 225 14/07/31 15:07:53 INFO CoarseGrainedExecutorBackend: Got assigned task 251 14/07/31 15:07:53 INFO Executor: Running task ID 251 14/07/31 15:07:53 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:53 INFO CacheManager: Partition rdd_6_80 not found, computing it 14/07/31 15:07:53 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:07:53 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 171 non-empty blocks out of 171 blocks 14/07/31 15:07:53 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 14 remote fetches in 15 ms 14/07/31 15:07:53 INFO MemoryStore: ensureFreeSpace(1927469) called with curMem=7775172, maxMem=12348240691 14/07/31 15:07:53 INFO MemoryStore: Block rdd_6_80 stored as values to memory (estimated size 1882.3 KB, free 11.5 GB) 14/07/31 15:07:53 INFO BlockManagerMaster: Updated info of block rdd_6_80 14/07/31 15:07:53 INFO Executor: Serialized size of result for 251 is 424634 14/07/31 15:07:53 INFO Executor: Sending result for 251 directly to driver 14/07/31 15:07:53 INFO Executor: Finished task ID 251 14/07/31 15:07:54 INFO CoarseGrainedExecutorBackend: Got assigned task 259 14/07/31 15:07:54 INFO Executor: Running task ID 259 14/07/31 15:07:54 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:54 INFO CacheManager: Partition rdd_6_88 not found, computing it 14/07/31 15:07:54 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:07:54 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 171 non-empty blocks out of 171 blocks 14/07/31 15:07:54 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 14 remote fetches in 13 ms 14/07/31 15:07:54 INFO MemoryStore: ensureFreeSpace(1921571) called with curMem=9702641, maxMem=12348240691 14/07/31 15:07:54 INFO MemoryStore: Block rdd_6_88 stored as values to memory (estimated size 1876.5 KB, free 11.5 GB) 14/07/31 15:07:54 INFO BlockManagerMaster: Updated info of block rdd_6_88 14/07/31 15:07:54 INFO Executor: Serialized size of result for 259 is 418167 14/07/31 15:07:54 INFO Executor: Sending result for 259 directly to driver 14/07/31 15:07:54 INFO Executor: Finished task ID 259 14/07/31 15:07:56 INFO CoarseGrainedExecutorBackend: Got assigned task 273 14/07/31 15:07:56 INFO Executor: Running task ID 273 14/07/31 15:07:56 INFO CoarseGrainedExecutorBackend: Got assigned task 290 14/07/31 15:07:56 INFO Executor: Running task ID 290 14/07/31 15:07:56 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:56 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:56 INFO BlockManager: Found block rdd_6_10 locally 14/07/31 15:07:56 INFO BlockManager: Found block rdd_6_25 locally 14/07/31 15:07:56 INFO Executor: Serialized size of result for 273 is 887 14/07/31 15:07:56 INFO Executor: Sending result for 273 directly to driver 14/07/31 15:07:56 INFO Executor: Finished task ID 273 14/07/31 15:07:56 INFO Executor: Serialized size of result for 290 is 887 14/07/31 15:07:56 INFO Executor: Sending result for 290 directly to driver 14/07/31 15:07:56 INFO Executor: Finished task ID 290 14/07/31 15:07:57 INFO CoarseGrainedExecutorBackend: Got assigned task 308 14/07/31 15:07:57 INFO Executor: Running task ID 308 14/07/31 15:07:57 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:57 INFO BlockManager: Found block rdd_6_48 locally 14/07/31 15:07:57 INFO CoarseGrainedExecutorBackend: Got assigned task 311 14/07/31 15:07:57 INFO Executor: Running task ID 311 14/07/31 15:07:57 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:57 INFO BlockManager: Found block rdd_6_54 locally 14/07/31 15:07:57 INFO Executor: Serialized size of result for 308 is 887 14/07/31 15:07:57 INFO Executor: Sending result for 308 directly to driver 14/07/31 15:07:57 INFO Executor: Finished task ID 308 14/07/31 15:07:57 INFO Executor: Serialized size of result for 311 is 887 14/07/31 15:07:57 INFO Executor: Sending result for 311 directly to driver 14/07/31 15:07:57 INFO Executor: Finished task ID 311 14/07/31 15:07:58 INFO CoarseGrainedExecutorBackend: Got assigned task 339 14/07/31 15:07:58 INFO Executor: Running task ID 339 14/07/31 15:07:58 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:58 INFO BlockManager: Found block rdd_6_80 locally 14/07/31 15:07:58 INFO CoarseGrainedExecutorBackend: Got assigned task 341 14/07/31 15:07:58 INFO Executor: Running task ID 341 14/07/31 15:07:58 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:58 INFO BlockManager: Found block rdd_6_88 locally 14/07/31 15:07:58 INFO Executor: Serialized size of result for 339 is 887 14/07/31 15:07:58 INFO Executor: Sending result for 339 directly to driver 14/07/31 15:07:58 INFO Executor: Finished task ID 339 14/07/31 15:07:58 INFO Executor: Serialized size of result for 341 is 887 14/07/31 15:07:58 INFO Executor: Sending result for 341 directly to driver 14/07/31 15:07:58 INFO Executor: Finished task ID 341 14/07/31 15:07:59 INFO CoarseGrainedExecutorBackend: Got assigned task 377 14/07/31 15:07:59 INFO Executor: Running task ID 377 14/07/31 15:07:59 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Updating epoch to 2 and clearing cache 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Don't have map outputs for shuffle 1, fetching them 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Doing the fetch; tracker actor = Actor[akka.tcp://spark@tdw-10-196-135-106:38502/user/MapOutputTracker#-128956169] 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Got the output locations 14/07/31 15:07:59 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:07:59 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 100 non-empty blocks out of 100 blocks 14/07/31 15:07:59 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 16 remote fetches in 9 ms 14/07/31 15:08:00 INFO CoarseGrainedExecutorBackend: Got assigned task 393 14/07/31 15:08:00 INFO Executor: Running task ID 393 14/07/31 15:08:00 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:08:00 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:08:00 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 100 non-empty blocks out of 100 blocks 14/07/31 15:08:00 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 16 remote fetches in 8 ms 14/07/31 15:08:00 INFO Executor: Serialized size of result for 377 is 303256 14/07/31 15:08:00 INFO Executor: Sending result for 377 directly to driver 14/07/31 15:08:00 INFO Executor: Finished task ID 377 14/07/31 15:08:00 INFO Executor: Serialized size of result for 393 is 310660 14/07/31 15:08:00 INFO Executor: Sending result for 393 directly to driver 14/07/31 15:08:00 INFO Executor: Finished task ID 393 14/07/31 15:08:01 INFO CoarseGrainedExecutorBackend: Got assigned task 403 14/07/31 15:08:01 INFO Executor: Running task ID 403 14/07/31 15:08:01 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:08:01 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:08:01 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 100 non-empty blocks out of 100 blocks 14/07/31 15:08:01 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 16 remote fetches in 7 ms 14/07/31 15:08:01 INFO Executor: Serialized size of result for 403 is 299667 14/07/31 15:08:01 INFO Executor: Sending result for 403 directly to driver 14/07/31 15:08:01 INFO Executor: Finished task ID 403 14/07/31 15:08:02 INFO CoarseGrainedExecutorBackend: Got assigned task 412 14/07/31 15:08:02 INFO Executor: Running task ID 412 14/07/31 15:08:02 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:08:02 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:08:02 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 100 non-empty blocks out of 100 blocks 14/07/31 15:08:02 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 16 remote fetches in 6 ms 14/07/31 15:08:02 INFO Executor: Serialized size of result for 412 is 301593 14/07/31 15:08:02 INFO Executor: Sending result for 412 directly to driver 14/07/31 15:08:02 INFO Executor: Finished task ID 412 14/07/31 15:08:04 INFO CoarseGrainedExecutorBackend: Got assigned task 437 14/07/31 15:08:04 INFO Executor: Running task ID 437 14/07/31 15:08:04 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 100 non-empty blocks out of 100 blocks 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 16 remote fetches in 6 ms 14/07/31 15:08:04 INFO Executor: Serialized size of result for 437 is 312543 14/07/31 15:08:04 INFO Executor: Sending result for 437 directly to driver 14/07/31 15:08:04 INFO Executor: Finished task ID 437 14/07/31 15:08:04 INFO CoarseGrainedExecutorBackend: Got assigned task 445 14/07/31 15:08:04 INFO Executor: Running task ID 445 14/07/31 15:08:04 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 100 non-empty blocks out of 100 blocks 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 16 remote fetches in 6 ms 14/07/31 15:08:04 INFO Executor: Serialized size of result for 445 is 307049 14/07/31 15:08:04 INFO Executor: Sending result for 445 directly to driver 14/07/31 15:08:04 INFO Executor: Finished task ID 445 14/07/31 15:08:06 INFO CoarseGrainedExecutorBackend: Got assigned task 467 14/07/31 15:08:06 INFO Executor: Running task ID 467 14/07/31 15:08:06 INFO BlockManager: Found block broadcast_0 locally 14/07/31 15:08:06 INFO BlockFetcherIterator$BasicBlockFetcherIterator: maxBytesInFlight: 50331648, targetRequestSize: 10066329 14/07/31 15:08:06 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Getting 100 non-empty blocks out of 100 blocks 14/07/31 15:08:06 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Started 16 remote fetches in 6 ms 14/07/31 15:08:07 INFO Executor: Serialized size of result for 467 is 301177 14/07/31 15:08:07 INFO Executor: Sending result for 467 directly to driver 14/07/31 15:08:07 INFO Executor: Finished task ID 467 14/07/31 15:08:18 INFO ShuffleBlockManager: Deleted all files for shuffle 1 14/07/31 15:09:00 WARN BlockManagerMaster: Error sending message to BlockManagerMaster in 1 attempts java.util.concurrent.TimeoutException: Futures timed out after [30 seconds] at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:219) at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223) at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:107) at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53) at scala.concurrent.Await$.result(package.scala:107) at org.apache.spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:237) at org.apache.spark.storage.BlockManagerMaster.sendHeartBeat(BlockManagerMaster.scala:51) at org.apache.spark.storage.BlockManager.org$apache$spark$storage$BlockManager$$heartBeat(BlockManager.scala:113) at org.apache.spark.storage.BlockManager$$anonfun$initialize$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(BlockManager.scala:158) at org.apache.spark.util.Utils$.tryOrExit(Utils.scala:790) at org.apache.spark.storage.BlockManager$$anonfun$initialize$1.apply$mcV$sp(BlockManager.scala:158) at akka.actor.Scheduler$$anon$9.run(Scheduler.scala:80) at akka.actor.LightArrayRevolverScheduler$$anon$3$$anon$2.run(Scheduler.scala:241) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:744) 14/07/31 15:09:12 INFO ShuffleBlockManager: Deleted all files for shuffle 0 14/07/31 15:09:12 INFO BlockManager: Removing RDD 6 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_88 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_88 of size 1921571 dropped from memory (free 12338538050) 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_25 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_25 of size 1922882 dropped from memory (free 12340460932) 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_10 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_10 of size 1912396 dropped from memory (free 12342373328) 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_54 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_54 of size 1909775 dropped from memory (free 12344283103) 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_80 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_80 of size 1927469 dropped from memory (free 12346210572) 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_48 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_48 of size 1927469 dropped from memory (free 12348138041)