[
https://issues.apache.org/jira/browse/TINKERPOP-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yang Xia closed TINKERPOP-1309.
-------------------------------
Resolution: Won't Do
Closing given
[discussion|https://lists.apache.org/thread/om2m0phg25s83529p9w0gldmcxz7578h] -
it can be reopened if there is expectation that there will be active work on
this item.
> Memory output in HadoopGraph is too strongly tied to MapReduce and should be
> generalized.
> -----------------------------------------------------------------------------------------
>
> Key: TINKERPOP-1309
> URL: https://issues.apache.org/jira/browse/TINKERPOP-1309
> Project: TinkerPop
> Issue Type: Improvement
> Components: hadoop, process
> Affects Versions: 3.2.0-incubating
> Reporter: Marko A. Rodriguez
> Priority: Major
> Labels: breaking
>
> The {{Memory}} object is not being written to disk in {{SparkGraphComputer}}
> unless its being updated within a {{MapReduce}} job. That is no bueno. We
> should really have the computed {{Memory}} be written as such:
> {code}
> hdfs.ls("output")
> ==>~g
> ==>~memory
> {code}
> Moreover, {{~g}} should be {{~graph}} :) but that is a different story...
> Then:
> {code}
> hdfs.ls("output/~memory")
> ==>gremlin.traversalVertexProgram.haltedTraversals
> ==>a
> ==>x
> {code}
> Note that every {{GraphComputer}} job yields a {{ComputerResult}} which is
> basically {{Pair<Graph,Memory>}}. The {{Graph}} reference denotes the
> adjacency list of vertices and on all those vertices, if there are
> HALTED_TRAVERSERS, they will be on those vertices. This is a distributed
> representation. Next, the {{Memory}} reference denotes data that is no longer
> "attached to the graph" -- like maps, counts, sums, etc. In general,
> reduction barriers. This data is not tied to any one vertex anymore an thus
> exists at the "master traversal" via {{Memory}}. Thus, "graph is
> distributed/workers" and "memory is local/master." We need to make sure that
> the {{Memory}} data is serialized to disk appropriately for
> {{HadoopGraph}}-based implementations...
--
This message was sent by Atlassian Jira
(v8.20.10#820010)