[
https://issues.apache.org/jira/browse/MAHOUT-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15567471#comment-15567471
]
ASF GitHub Bot commented on MAHOUT-1888:
----------------------------------------
Github user dlyubimov commented on a diff in the pull request:
https://github.com/apache/mahout/pull/260#discussion_r82932511
--- Diff:
spark/src/test/scala/org/apache/mahout/sparkbindings/test/DistributedSparkSuite.scala
---
@@ -45,6 +45,7 @@ trait DistributedSparkSuite extends
DistributedMahoutSuite with LoggerConfigurat
.set("spark.akka.frameSize", "30")
.set("spark.default.parallelism", "10")
.set("spark.executor.memory", "2G")
+ .set("spark.kryo.registrationRequired", "true")
--- End diff --
This is not needed, this is why it is failing. we can enable that to see
what else is left, but we don't have to patch every class out there that is
used in tests. I think the classes that still have that have something to do
with IndexedDataSet which is (in my view) not part of algebra engine, so we can
ignore the rest.
> Performance Bug with Mahout Vector Serialization
> ------------------------------------------------
>
> Key: MAHOUT-1888
> URL: https://issues.apache.org/jira/browse/MAHOUT-1888
> Project: Mahout
> Issue Type: Bug
> Components: spark
> Affects Versions: 0.12.2
> Reporter: Suneel Marthi
> Assignee: Suneel Marthi
> Fix For: 0.13.0
>
>
> Identified a performance bug with Mahout Vector serialization in
> DistributedSparkSuite.
> Add the following
> {Code}
> .set("spark.kryo.registrationRequired", "true")
> {Code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)