[
https://issues.apache.org/jira/browse/MAHOUT-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15567472#comment-15567472
]
ASF GitHub Bot commented on MAHOUT-1888:
----------------------------------------
Github user dlyubimov commented on a diff in the pull request:
https://github.com/apache/mahout/pull/260#discussion_r82932383
--- Diff:
spark/src/test/scala/org/apache/mahout/sparkbindings/blas/BlasSuite.scala ---
@@ -39,11 +42,11 @@ class BlasSuite extends FunSuite with
DistributedSparkSuite {
val drmA = drmParallelize(m = inCoreA, numPartitions = 3)
val drmB = drmParallelize(m = inCoreB, numPartitions = 2)
- val op = new OpABt(drmA, drmB)
+ val op = OpABt(drmA, drmB)
val drm = new CheckpointedDrmSpark(ABt.abt(op, srcA = drmA, srcB =
drmB), op.nrow, op.ncol)
- printf("AB' num partitions = %d.\n", drm.rdd.partitions.size)
+ printf("AB' num partitions = %d.\n", drm.rdd.partitions.length)
--- End diff --
in scala, we use size() to measure collection cardinality. using .length
may cause conversion to a java collection.
> Performance Bug with Mahout Vector Serialization
> ------------------------------------------------
>
> Key: MAHOUT-1888
> URL: https://issues.apache.org/jira/browse/MAHOUT-1888
> Project: Mahout
> Issue Type: Bug
> Components: spark
> Affects Versions: 0.12.2
> Reporter: Suneel Marthi
> Assignee: Suneel Marthi
> Fix For: 0.13.0
>
>
> Identified a performance bug with Mahout Vector serialization in
> DistributedSparkSuite.
> Add the following
> {Code}
> .set("spark.kryo.registrationRequired", "true")
> {Code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)