[ 
https://issues.apache.org/jira/browse/JENA-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15474191#comment-15474191
 ] 

Itsuki Toyota commented on JENA-1233:
-------------------------------------

> Which objects?
The RDD-wrapped object what I intend to address is the following
1) A Triple that includes blank nodes (i.e. RDD[Triple])
2) A Node that is the blank node (i.e RDD[Node])

> Have you looked at jena-elephas?
No, I haven't.

> Make RDF primitives Serializable
> --------------------------------
>
>                 Key: JENA-1233
>                 URL: https://issues.apache.org/jira/browse/JENA-1233
>             Project: Apache Jena
>          Issue Type: Improvement
>          Components: Elephas
>    Affects Versions: Jena 3.1.0
>            Reporter: Itsuki Toyota
>
> I always use Jena when I handle RDF data with Apache Spark.
> However, when I want to store resulting RDD data (ex. RDD[Triple]) in binary 
> format, I can't call RDD.saveAsObjectFile method.
> It's because RDD.saveAsObjectFile requires java.io.Serializable interface.
> See the following code. 
> https://github.com/apache/spark/blob/v1.6.0/core/src/main/scala/org/apache/spark/rdd/RDD.scala#L1469
> https://github.com/apache/spark/blob/v1.6.0/core/src/main/scala/org/apache/spark/util/Utils.scala#L79-L86
> You can see that 
> 1) RDD.saveAsObjectFile calls Util.serialize method
> 2) Util.serialize method requires the RDD-wrapped object implementing 
> java.io.Serializable interface. For example, if you want to save a 
> RDD[Triple] object, Triple must implements java.io.Serializable.
> So why not implement java.io.Serializable ?
> I think it will improve the usability in Apache Spark.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to