[
https://issues.apache.org/jira/browse/MAHOUT-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15234707#comment-15234707
]
Hudson commented on MAHOUT-1824:
--------------------------------
FAILURE: Integrated in Mahout-Quality #3324 (See
[https://builds.apache.org/job/Mahout-Quality/3324/])
MAHOUT-1824: Optimize FlinkOpAtA to use upper triangular matrices. (apalumbo:
rev 4fc65d4e26957cfef68eb30e0bf712758e21a5a1)
* flink/src/test/scala/org/apache/mahout/flinkbindings/FailingTestsSuite.scala
* flink/src/main/scala/org/apache/mahout/flinkbindings/blas/FlinkOpAtA.scala
> Optimize FlinkOpAtA to use upper triangular matrices
> -----------------------------------------------------
>
> Key: MAHOUT-1824
> URL: https://issues.apache.org/jira/browse/MAHOUT-1824
> Project: Mahout
> Issue Type: Bug
> Components: Flink
> Affects Versions: 0.11.2
> Reporter: Suneel Marthi
> Assignee: Andrew Palumbo
> Labels: performance
> Fix For: 0.12.0
>
>
> Optimize FlinkOpAtA to use upper triangular matrices (similar to what's being
> done in Spark backend).
> Presently dals fails on FlinkOpAtA computation with an OOM
> {Code}
> 57766 [flink-akka.actor.default-dispatcher-5] ERROR
> akka.actor.ActorSystemImpl - exception on LARS’ timer thread
> java.lang.OutOfMemoryError: GC overhead limit exceeded
> 57770 [flink-akka.actor.default-dispatcher-5] ERROR
> akka.actor.ActorSystemImpl - Uncaught fatal error from thread
> [flink-scheduler-1] shutting down ActorSystem [flink]
> java.lang.OutOfMemoryError: GC overhead limit exceeded
> - dals *** FAILED ***
> org.apache.flink.runtime.client.JobTimeoutException: Timeout while waiting
> for JobManager answer. Job time exceeded 21474835 seconds
> at
> org.apache.flink.runtime.client.JobClient.submitJobAndWait(JobClient.java:136)
> at
> org.apache.flink.runtime.minicluster.FlinkMiniCluster.submitJobAndWait(FlinkMiniCluster.scala:423)
> at
> org.apache.flink.runtime.minicluster.FlinkMiniCluster.submitJobAndWait(FlinkMiniCluster.scala:409)
> at
> org.apache.flink.runtime.minicluster.FlinkMiniCluster.submitJobAndWait(FlinkMiniCluster.scala:401)
> at org.apache.flink.client.LocalExecutor.executePlan(LocalExecutor.java:190)
> at
> org.apache.flink.api.java.LocalEnvironment.execute(LocalEnvironment.java:90)
> at
> org.apache.flink.api.java.ExecutionEnvironment.execute(ExecutionEnvironment.java:855)
> at
> org.apache.flink.api.scala.ExecutionEnvironment.execute(ExecutionEnvironment.scala:638)
> at org.apache.flink.api.scala.DataSet.collect(DataSet.scala:546)
> at
> org.apache.mahout.flinkbindings.blas.FlinkOpAtA$.slim(FlinkOpAtA.scala:53)
> ...
> Cause: akka.pattern.AskTimeoutException: Ask timed out on
> [Actor[akka://flink/user/$a#372851579]] after [21474835000 ms]
> at
> akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:333)
> at akka.actor.Scheduler$$anon$7.run(Scheduler.scala:117)
> at
> akka.actor.LightArrayRevolverScheduler$TaskHolder.run(Scheduler.scala:476)
> at
> akka.actor.LightArrayRevolverScheduler$$anonfun$close$1.apply(Scheduler.scala:282)
> at
> akka.actor.LightArrayRevolverScheduler$$anonfun$close$1.apply(Scheduler.scala:281)
> at scala.collection.Iterator$class.foreach(Iterator.scala:727)
> at scala.collection.AbstractIterator.foreach(Iterator.scala:1157)
> at scala.collection.IterableLike$class.foreach(IterableLike.scala:72)
> at scala.collection.AbstractIterable.foreach(Iterable.scala:54)
> at akka.actor.LightArrayRevolverScheduler.close(Scheduler.scala:280)
> {Code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)