[ 
https://issues.apache.org/jira/browse/MAHOUT-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15567472#comment-15567472
 ] 

ASF GitHub Bot commented on MAHOUT-1888:
----------------------------------------

Github user dlyubimov commented on a diff in the pull request:

    https://github.com/apache/mahout/pull/260#discussion_r82932383
  
    --- Diff: 
spark/src/test/scala/org/apache/mahout/sparkbindings/blas/BlasSuite.scala ---
    @@ -39,11 +42,11 @@ class BlasSuite extends FunSuite with 
DistributedSparkSuite {
         val drmA = drmParallelize(m = inCoreA, numPartitions = 3)
         val drmB = drmParallelize(m = inCoreB, numPartitions = 2)
     
    -    val op = new OpABt(drmA, drmB)
    +    val op = OpABt(drmA, drmB)
     
         val drm = new CheckpointedDrmSpark(ABt.abt(op, srcA = drmA, srcB = 
drmB), op.nrow, op.ncol)
     
    -    printf("AB' num partitions = %d.\n", drm.rdd.partitions.size)
    +    printf("AB' num partitions = %d.\n", drm.rdd.partitions.length)
    --- End diff --
    
    in scala, we use size() to measure collection cardinality. using .length 
may cause conversion to a java collection.


> Performance Bug with Mahout Vector Serialization
> ------------------------------------------------
>
>                 Key: MAHOUT-1888
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1888
>             Project: Mahout
>          Issue Type: Bug
>          Components: spark
>    Affects Versions: 0.12.2
>            Reporter: Suneel Marthi
>            Assignee: Suneel Marthi
>             Fix For: 0.13.0
>
>
> Identified a performance bug with Mahout Vector serialization in 
> DistributedSparkSuite.
> Add the following
> {Code}
> .set("spark.kryo.registrationRequired", "true")
> {Code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to