[ 
https://issues.apache.org/jira/browse/MAHOUT-1653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592641#comment-14592641
 ] 

ASF GitHub Bot commented on MAHOUT-1653:
----------------------------------------

Github user andrewpalumbo commented on a diff in the pull request:

    https://github.com/apache/mahout/pull/136#discussion_r32784888
  
    --- Diff: 
spark/src/main/scala/org/apache/mahout/sparkbindings/drm/CheckpointedDrmSpark.scala
 ---
    @@ -165,7 +168,14 @@ class CheckpointedDrmSpark[K: ClassTag](
           else if (classOf[Writable].isAssignableFrom(ktag.runtimeClass)) (x: 
K) => x.asInstanceOf[Writable]
           else throw new IllegalArgumentException("Do not know how to convert 
class tag %s to Writable.".format(ktag))
     
    -    rdd.saveAsSequenceFile(path)
    +    // rdd.saveAsSequenceFile(path)
    +    // this is a (working) deprecated method used as a stop-gap while we 
investigate the shell issues
    +    SparkContext.rddToSequenceFileRDDFunctions(rdd.asInstanceOf[RDD[(K, 
Vector)]]).saveAsSequenceFile(path)
    +    // discussion here: http://search-lucene.com/m/rcu7o1J1Z7mvnGmP1
    +    // The correct function to use is here: 
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/RDD.scala#L1671
    +    // The commit message for this change is here: 
https://github.com/apache/spark/tree/master/core/src/test/scala/org/apache/sparktest
    +    // The key and value factories will be determined using reflection on 
the RDD if the right classes are set to null
    +    // see this example: 
https://github.com/apache/spark/blob/branch-1.4/core/src/main/scala/org/apache/spark/SparkContext.scala#L2236
       }
    --- End diff --
    
    I think that it should be something like:
    ```
    implicit val keyWritableFactory = 
spark.WritableFactory.writableWritableFactory[IntWritable]
    implicit val valueWritableFactory = 
spark.WritableFactory.writableWritableFactory[VectorWritable] 
    ```
    For an Int-keyed Drm. Similarly with String-keyed, and Long-keyed.



> Spark 1.3
> ---------
>
>                 Key: MAHOUT-1653
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1653
>             Project: Mahout
>          Issue Type: Dependency upgrade
>            Reporter: Andrew Musselman
>            Assignee: Andrew Palumbo
>             Fix For: 0.11.0
>
>
> Support Spark 1.3



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to