[ https://issues.apache.org/jira/browse/SYSTEMML-911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15487888#comment-15487888 ]
Glenn Weidner commented on SYSTEMML-911: ---------------------------------------- Updated SystemML.jar on Spark 1.6 test system and successfully ran LinearRegrCG.scala using MLContext from spark-shell for nRows: Int = 1000000 nCols: Int = 100 Also increased to 4096MB following spark-config settings: spark.yarn.driver.memoryOverhead spark.yarn.executor.memoryOverhead Attached console results with final output shown below. scala> res.getDF( sqlContext, "M").sort("__INDEX").show() 16/09/13 08:58:05 INFO SparkContext: Starting job: show at <console>:43 16/09/13 08:58:05 INFO DAGScheduler: Registering RDD 19 (flatMapToPair at RDDConverterUtilsExt.java:501) 16/09/13 08:58:05 INFO DAGScheduler: Got job 6 (show at <console>:43) with 4 output partitions 16/09/13 08:58:05 INFO DAGScheduler: Final stage: ResultStage 9 (show at <console>:43) 16/09/13 08:58:05 INFO DAGScheduler: Parents of final stage: List(ShuffleMapStage 8) 16/09/13 08:58:05 INFO DAGScheduler: Missing parents: List(ShuffleMapStage 8) 16/09/13 08:58:05 INFO DAGScheduler: Submitting ShuffleMapStage 8 (MapPartitionsRDD[19] at flatMapToPair at RDDConverterUtilsExt.java:501), which has no missing parents 16/09/13 08:58:05 INFO MemoryStore: Block broadcast_8 stored as values in memory (estimated size 3.8 KB, free 9.4 KB) 16/09/13 08:58:05 INFO MemoryStore: Block broadcast_8_piece0 stored as bytes in memory (estimated size 2.1 KB, free 11.5 KB) 16/09/13 08:58:05 INFO BlockManagerInfo: Added broadcast_8_piece0 in memory on localhost:51459 (size: 2.1 KB, free: 2.7 GB) 16/09/13 08:58:05 INFO SparkContext: Created broadcast 8 from broadcast at DAGScheduler.scala:1006 16/09/13 08:58:05 INFO DAGScheduler: Submitting 4 missing tasks from ShuffleMapStage 8 (MapPartitionsRDD[19] at flatMapToPair at RDDConverterUtilsExt.java:501) 16/09/13 08:58:05 INFO TaskSchedulerImpl: Adding task set 8.0 with 4 tasks 16/09/13 08:58:05 INFO TaskSetManager: Starting task 0.0 in stage 8.0 (TID 14, localhost, partition 0,PROCESS_LOCAL, 2118 bytes) 16/09/13 08:58:05 INFO TaskSetManager: Starting task 1.0 in stage 8.0 (TID 15, localhost, partition 1,PROCESS_LOCAL, 2118 bytes) 16/09/13 08:58:05 INFO TaskSetManager: Starting task 2.0 in stage 8.0 (TID 16, localhost, partition 2,PROCESS_LOCAL, 2118 bytes) 16/09/13 08:58:05 INFO TaskSetManager: Starting task 3.0 in stage 8.0 (TID 17, localhost, partition 3,PROCESS_LOCAL, 3133 bytes) 16/09/13 08:58:05 INFO Executor: Running task 0.0 in stage 8.0 (TID 14) 16/09/13 08:58:05 INFO Executor: Running task 1.0 in stage 8.0 (TID 15) 16/09/13 08:58:05 INFO Executor: Running task 3.0 in stage 8.0 (TID 17) 16/09/13 08:58:05 INFO Executor: Running task 2.0 in stage 8.0 (TID 16) 16/09/13 08:58:05 INFO Executor: Finished task 1.0 in stage 8.0 (TID 15). 1161 bytes result sent to driver 16/09/13 08:58:05 INFO TaskSetManager: Finished task 1.0 in stage 8.0 (TID 15) in 20 ms on localhost (1/4) 16/09/13 08:58:05 INFO Executor: Finished task 0.0 in stage 8.0 (TID 14). 1161 bytes result sent to driver 16/09/13 08:58:05 INFO TaskSetManager: Finished task 0.0 in stage 8.0 (TID 14) in 30 ms on localhost (2/4) 16/09/13 08:58:05 INFO Executor: Finished task 2.0 in stage 8.0 (TID 16). 1161 bytes result sent to driver 16/09/13 08:58:05 INFO TaskSetManager: Finished task 2.0 in stage 8.0 (TID 16) in 28 ms on localhost (3/4) 16/09/13 08:58:06 INFO Executor: Finished task 3.0 in stage 8.0 (TID 17). 1161 bytes result sent to driver 16/09/13 08:58:06 INFO TaskSetManager: Finished task 3.0 in stage 8.0 (TID 17) in 64 ms on localhost (4/4) 16/09/13 08:58:06 INFO TaskSchedulerImpl: Removed TaskSet 8.0, whose tasks have all completed, from pool 16/09/13 08:58:06 INFO DAGScheduler: ShuffleMapStage 8 (flatMapToPair at RDDConverterUtilsExt.java:501) finished in 0.064 s 16/09/13 08:58:06 INFO DAGScheduler: looking for newly runnable stages 16/09/13 08:58:06 INFO DAGScheduler: running: Set() 16/09/13 08:58:06 INFO DAGScheduler: waiting: Set(ResultStage 9) 16/09/13 08:58:06 INFO DAGScheduler: failed: Set() 16/09/13 08:58:06 INFO DAGScheduler: Submitting ResultStage 9 (MapPartitionsRDD[25] at show at <console>:43), which has no missing parents 16/09/13 08:58:06 INFO MemoryStore: Block broadcast_9 stored as values in memory (estimated size 8.0 KB, free 19.5 KB) 16/09/13 08:58:06 INFO MemoryStore: Block broadcast_9_piece0 stored as bytes in memory (estimated size 3.9 KB, free 23.5 KB) 16/09/13 08:58:06 INFO BlockManagerInfo: Added broadcast_9_piece0 in memory on localhost:51459 (size: 3.9 KB, free: 2.7 GB) 16/09/13 08:58:06 INFO SparkContext: Created broadcast 9 from broadcast at DAGScheduler.scala:1006 16/09/13 08:58:06 INFO DAGScheduler: Submitting 4 missing tasks from ResultStage 9 (MapPartitionsRDD[25] at show at <console>:43) 16/09/13 08:58:06 INFO TaskSchedulerImpl: Adding task set 9.0 with 4 tasks 16/09/13 08:58:06 INFO TaskSetManager: Starting task 0.0 in stage 9.0 (TID 18, localhost, partition 0,NODE_LOCAL, 1947 bytes) 16/09/13 08:58:06 INFO TaskSetManager: Starting task 1.0 in stage 9.0 (TID 19, localhost, partition 1,NODE_LOCAL, 1947 bytes) 16/09/13 08:58:06 INFO TaskSetManager: Starting task 2.0 in stage 9.0 (TID 20, localhost, partition 2,NODE_LOCAL, 1947 bytes) 16/09/13 08:58:06 INFO TaskSetManager: Starting task 3.0 in stage 9.0 (TID 21, localhost, partition 3,NODE_LOCAL, 1947 bytes) 16/09/13 08:58:06 INFO Executor: Running task 1.0 in stage 9.0 (TID 19) 16/09/13 08:58:06 INFO Executor: Running task 0.0 in stage 9.0 (TID 18) 16/09/13 08:58:06 INFO Executor: Running task 2.0 in stage 9.0 (TID 20) 16/09/13 08:58:06 INFO Executor: Running task 3.0 in stage 9.0 (TID 21) 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks out of 4 blocks 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks out of 4 blocks 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in 0 ms 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in 0 ms 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks out of 4 blocks 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in 0 ms 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Getting 1 non-empty blocks out of 4 blocks 16/09/13 08:58:06 INFO ShuffleBlockFetcherIterator: Started 0 remote fetches in 0 ms 16/09/13 08:58:06 INFO Executor: Finished task 1.0 in stage 9.0 (TID 19). 3394 bytes result sent to driver 16/09/13 08:58:06 INFO Executor: Finished task 3.0 in stage 9.0 (TID 21). 3394 bytes result sent to driver 16/09/13 08:58:06 INFO Executor: Finished task 0.0 in stage 9.0 (TID 18). 3394 bytes result sent to driver 16/09/13 08:58:06 INFO Executor: Finished task 2.0 in stage 9.0 (TID 20). 3394 bytes result sent to driver 16/09/13 08:58:06 INFO TaskSetManager: Finished task 3.0 in stage 9.0 (TID 21) in 87 ms on localhost (1/4) 16/09/13 08:58:06 INFO TaskSetManager: Finished task 0.0 in stage 9.0 (TID 18) in 88 ms on localhost (2/4) 16/09/13 08:58:06 INFO TaskSetManager: Finished task 2.0 in stage 9.0 (TID 20) in 89 ms on localhost (3/4) 16/09/13 08:58:06 INFO TaskSetManager: Finished task 1.0 in stage 9.0 (TID 19) in 90 ms on localhost (4/4) 16/09/13 08:58:06 INFO TaskSchedulerImpl: Removed TaskSet 9.0, whose tasks have all completed, from pool 16/09/13 08:58:06 INFO DAGScheduler: ResultStage 9 (show at <console>:43) finished in 0.082 s 16/09/13 08:58:06 INFO DAGScheduler: Job 6 finished: show at <console>:43, took 0.217130 s +-------+-------------------+ |__INDEX| C1| +-------+-------------------+ | 1.0| 76968.98355680963| | 2.0| 61126.760973463985| | 3.0| -62533.46098410971| | 4.0| -74881.44246795037| | 5.0| 55562.459377873136| | 6.0| 132353.84110196662| | 7.0|-44015.332187924614| | 8.0| -73566.43996214279| | 9.0|-12097.392788251369| | 10.0| 93135.01959966512| | 11.0| 140033.21385197915| | 12.0|-18857.254762123357| | 13.0| 83846.40655800412| | 14.0| -38278.92770290728| | 15.0|-108215.28591613166| | 16.0| 32121.895848963155| | 17.0| -95914.97506536992| | 18.0| 110237.73819600634| | 19.0|-108943.07821723573| | 20.0| 28768.465846286177| +-------+-------------------+ only showing top 20 rows > GC overhead limit exceeded running LinearRegressionCG from MLContext > -------------------------------------------------------------------- > > Key: SYSTEMML-911 > URL: https://issues.apache.org/jira/browse/SYSTEMML-911 > Project: SystemML > Issue Type: Bug > Components: APIs > Reporter: Glenn Weidner > Attachments: LinearRegrCG.0.10.scala > > > Running attached scala from spark-shell using original MLContext against > Spark 1.6 (or 2.0) encountered out-of-memory GC overhead limit exceeded: > uncaught exception during compilation: java.lang.AssertionError > org.apache.sysml.runtime.DMLRuntimeException: > org.apache.sysml.runtime.DMLRuntimeException: ERROR: Runtime error in program > block generated from statement block between lines 3 and 9 -- Error > evaluating instruction: > SPARK°rblk°X·MATRIX·DOUBLE°_mVar2·MATRIX·DOUBLE°1000°1000°true > at > org.apache.sysml.runtime.controlprogram.Program.execute(Program.java:152) > at > org.apache.sysml.api.MLContext.executeUsingSimplifiedCompilationChain(MLContext.java:1398) > at > org.apache.sysml.api.MLContext.compileAndExecuteScript(MLContext.java:1257) > at org.apache.sysml.api.MLContext.executeScript(MLContext.java:1146) > at org.apache.sysml.api.MLContext.executeScript(MLContext.java:1136) > at org.apache.sysml.api.MLContext.executeScript(MLContext.java:1131) > at $iwC$$iwC$$iwC$$iwC$$iwC$$iwC$$iwC$$iwC.<init>(<console>:32) > at $iwC$$iwC$$iwC$$iwC$$iwC$$iwC$$iwC.<init>(<console>:37) > at $iwC$$iwC$$iwC$$iwC$$iwC$$iwC.<init>(<console>:39) > at $iwC$$iwC$$iwC$$iwC$$iwC.<init>(<console>:41) > at $iwC$$iwC$$iwC$$iwC.<init>(<console>:43) > at $iwC$$iwC$$iwC.<init>(<console>:45) > at $iwC$$iwC.<init>(<console>:47) > at $iwC.<init>(<console>:49) > at <init>(<console>:51) > at .<init>(<console>:55) > at .<clinit>(<console>) > at .<init>(<console>:7) > at .<clinit>(<console>) > at $print(<console>) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:498) > at > org.apache.spark.repl.SparkIMain$ReadEvalPrint.call(SparkIMain.scala:1065) > at > org.apache.spark.repl.SparkIMain$Request.loadAndRun(SparkIMain.scala:1346) > at > org.apache.spark.repl.SparkIMain.loadAndRunReq$1(SparkIMain.scala:840) > at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:871) > at org.apache.spark.repl.SparkIMain.interpret(SparkIMain.scala:819) > at > org.apache.spark.repl.SparkILoop.reallyInterpret$1(SparkILoop.scala:857) > at > org.apache.spark.repl.SparkILoop.interpretStartingWith(SparkILoop.scala:902) > at org.apache.spark.repl.SparkILoop.command(SparkILoop.scala:814) > at org.apache.spark.repl.SparkILoop.processLine$1(SparkILoop.scala:657) > at org.apache.spark.repl.SparkILoop.innerLoop$1(SparkILoop.scala:665) > at > org.apache.spark.repl.SparkILoop.org$apache$spark$repl$SparkILoop$$loop(SparkILoop.scala:670) > at > org.apache.spark.repl.SparkILoop$$anonfun$org$apache$spark$repl$SparkILoop$$process$1.apply$mcZ$sp(SparkILoop.scala:997) > at > org.apache.spark.repl.SparkILoop$$anonfun$org$apache$spark$repl$SparkILoop$$process$1.apply(SparkILoop.scala:945) > at > org.apache.spark.repl.SparkILoop$$anonfun$org$apache$spark$repl$SparkILoop$$process$1.apply(SparkILoop.scala:945) > at > scala.tools.nsc.util.ScalaClassLoader$.savingContextLoader(ScalaClassLoader.scala:135) > at > org.apache.spark.repl.SparkILoop.org$apache$spark$repl$SparkILoop$$process(SparkILoop.scala:945) > at org.apache.spark.repl.SparkILoop.process(SparkILoop.scala:1059) > at org.apache.spark.repl.Main$.main(Main.scala:31) > at org.apache.spark.repl.Main.main(Main.scala) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:498) > at > org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:731) > at > org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:181) > at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:206) > at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:121) > at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala) > Caused by: org.apache.sysml.runtime.DMLRuntimeException: ERROR: Runtime error > in program block generated from statement block between lines 3 and 9 -- > Error evaluating instruction: > SPARK°rblk°X·MATRIX·DOUBLE°_mVar2·MATRIX·DOUBLE°1000°1000°true > at > org.apache.sysml.runtime.controlprogram.ProgramBlock.executeSingleInstruction(ProgramBlock.java:333) > at > org.apache.sysml.runtime.controlprogram.ProgramBlock.executeInstructions(ProgramBlock.java:222) > at > org.apache.sysml.runtime.controlprogram.ProgramBlock.execute(ProgramBlock.java:166) > at > org.apache.sysml.runtime.controlprogram.Program.execute(Program.java:145) > ... 51 more > Caused by: org.apache.spark.SparkException: Job aborted due to stage failure: > Task 0 in stage 6.0 failed 1 times, most recent failure: Lost task 0.0 in > stage 6.0 (TID 10, localhost): java.lang.OutOfMemoryError: GC overhead limit > exceeded > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$RowToBinaryBlockFunctionHelper.flushBlocksToList(RDDConverterUtilsExt.java:800) > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$RowToBinaryBlockFunctionHelper.convertToBinaryBlock(RDDConverterUtilsExt.java:736) > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$DataFrameToBinaryBlockFunction.call(RDDConverterUtilsExt.java:463) > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$DataFrameToBinaryBlockFunction.call(RDDConverterUtilsExt.java:448) > at > org.apache.spark.api.java.JavaRDDLike$$anonfun$fn$7$1.apply(JavaRDDLike.scala:192) > at > org.apache.spark.api.java.JavaRDDLike$$anonfun$fn$7$1.apply(JavaRDDLike.scala:192) > at > org.apache.spark.rdd.RDD$$anonfun$mapPartitions$1$$anonfun$apply$20.apply(RDD.scala:710) > at > org.apache.spark.rdd.RDD$$anonfun$mapPartitions$1$$anonfun$apply$20.apply(RDD.scala:710) > at > org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38) > at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306) > at org.apache.spark.rdd.RDD.iterator(RDD.scala:270) > at > org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:73) > at > org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:41) > at org.apache.spark.scheduler.Task.run(Task.scala:89) > at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:214) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Driver stacktrace: > at > org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1431) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1419) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1418) > at > scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59) > at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:47) > at > org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:1418) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:799) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:799) > at scala.Option.foreach(Option.scala:236) > at > org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:799) > at > org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.doOnReceive(DAGScheduler.scala:1640) > at > org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1599) > at > org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1588) > at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:48) > at > org.apache.spark.scheduler.DAGScheduler.runJob(DAGScheduler.scala:620) > at org.apache.spark.SparkContext.runJob(SparkContext.scala:1832) > at org.apache.spark.SparkContext.runJob(SparkContext.scala:1952) > at org.apache.spark.rdd.RDD$$anonfun$aggregate$1.apply(RDD.scala:1114) > at > org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:150) > at > org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:111) > at org.apache.spark.rdd.RDD.withScope(RDD.scala:316) > at org.apache.spark.rdd.RDD.aggregate(RDD.scala:1107) > at > org.apache.spark.api.java.JavaRDDLike$class.aggregate(JavaRDDLike.scala:411) > at > org.apache.spark.api.java.AbstractJavaRDDLike.aggregate(JavaRDDLike.scala:46) > at > org.apache.sysml.runtime.instructions.spark.utils.SparkUtils.computeNNZFromBlocks(SparkUtils.java:458) > at > org.apache.sysml.runtime.controlprogram.context.SparkExecutionContext.writeRDDtoHDFS(SparkExecutionContext.java:802) > at > org.apache.sysml.runtime.controlprogram.caching.MatrixObject.readBlobFromRDD(MatrixObject.java:612) > at > org.apache.sysml.runtime.controlprogram.caching.MatrixObject.readBlobFromRDD(MatrixObject.java:62) > at > org.apache.sysml.runtime.controlprogram.caching.CacheableData.acquireRead(CacheableData.java:440) > at > org.apache.sysml.hops.recompile.Recompiler.executeInMemoryReblock(Recompiler.java:2067) > at > org.apache.sysml.runtime.instructions.spark.ReblockSPInstruction.processInstruction(ReblockSPInstruction.java:100) > at > org.apache.sysml.runtime.controlprogram.ProgramBlock.executeSingleInstruction(ProgramBlock.java:303) > ... 54 more > Caused by: java.lang.OutOfMemoryError: GC overhead limit exceeded > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$RowToBinaryBlockFunctionHelper.flushBlocksToList(RDDConverterUtilsExt.java:800) > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$RowToBinaryBlockFunctionHelper.convertToBinaryBlock(RDDConverterUtilsExt.java:736) > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$DataFrameToBinaryBlockFunction.call(RDDConverterUtilsExt.java:463) > at > org.apache.sysml.runtime.instructions.spark.utils.RDDConverterUtilsExt$DataFrameToBinaryBlockFunction.call(RDDConverterUtilsExt.java:448) > at > org.apache.spark.api.java.JavaRDDLike$$anonfun$fn$7$1.apply(JavaRDDLike.scala:192) > at > org.apache.spark.api.java.JavaRDDLike$$anonfun$fn$7$1.apply(JavaRDDLike.scala:192) > at > org.apache.spark.rdd.RDD$$anonfun$mapPartitions$1$$anonfun$apply$20.apply(RDD.scala:710) > at > org.apache.spark.rdd.RDD$$anonfun$mapPartitions$1$$anonfun$apply$20.apply(RDD.scala:710) > at > org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38) > at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306) > at org.apache.spark.rdd.RDD.iterator(RDD.scala:270) > at > org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:73) > at > org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:41) > at org.apache.spark.scheduler.Task.run(Task.scala:89) > at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:214) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) -- This message was sent by Atlassian JIRA (v6.3.4#6332)