[
https://issues.apache.org/jira/browse/ODFTOOLKIT-458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Svante Schubert updated ODFTOOLKIT-458:
---------------------------------------
Attachment: Gephi-GraphML-Odf-Table-View.png
table_table.graphml
After building generator/schema2template of the codegenerator branch of my
GitHub fork, there will be GraphML files for all XML element of PuzzlePieces
(see https://incubator.apache.org/odftoolkit/0.6.2-incubating/schema2template/)
under
<TOOLKIT_ROOT>/generator/schema2template/target/graphML/<SCHEMA_FILE_NAME>
For instance, OpenDocument-v1_2-os-schema_rng/ as <SCHEMA_FILE_NAME>
The table_table.graphml file is attached similar a screenshot from the rendered
GraphML in Gephi, after using the forceLayout and several layout extensions.
(Gephi is basically an Application build on top of Netbeans). A description
how you tune the layout can be found at
https://gephi.org/tutorials/gephi-tutorial-visualization.pdf
Note: The graphml screenshot is of the previous version, the current one alrady
marks graph edges representing a sequence green and providing an order
attribute.
> Map the ODF XML RelaxNG schema into a GraphDB for Analysis
> ----------------------------------------------------------
>
> Key: ODFTOOLKIT-458
> URL: https://issues.apache.org/jira/browse/ODFTOOLKIT-458
> Project: ODF Toolkit
> Issue Type: Wish
> Reporter: Svante Schubert
> Assignee: Svante Schubert
> Attachments: Gephi-GraphML-Odf-Table-View.png, edge.properties,
> odf12-graph.xml, table_table.graphml, vertex.properties
>
>
> *PROBLEM*
> The ODF XML (RelaxNG) schema is too big to easily read or be analysed by
> humans.
> In version ODF 1.2 it has 598 elements and 1300 attributes.
> *SOLUTION*
> Therefore I would love to load the ODF XML RelaxNG schema into a GraphDB (for
> instance Neo4J) and do some basic analysis (sanity checks) on it.
> For instance, I am curious on query questions as:
> a) is a certain ODF element able to become nested (e.g. <text: p>)
> b) is every ODF element with an ID allowed to exist more than once (this
> issue occurred)
> c) what is the minimum mandatory ODF XML document
> etc.
> These queries could help a lot to understand and test the XML schema.
> Certainly, I would love to have afterwards more tooling.
> For instance, to be able to add metadata to the nodes to categorise nodes
> (which are meant for metadata, styles, text container, which are just plain
> boilerplate (e.g. office:body)
> The idea is to improve the generation of ODFDOM source code to allow easier
> maintainability.
> *DESIGN IDEA*
> Instead of reading plain RelaxNG, I thought it might be a better idea to read
> already a 'normalised' document the dumped internal model from MSV. You may
> find the dump for each ODF version as test references from
> <ODFTOOLKIT_ROOT>/generator/schema2template/src/test/resources/examples/odf
> e.g.
> http://svn.apache.org/viewvc/incubator/odf/trunk/generator/schema2template/src/test/resources/examples/odf/odf12-msvtree.ref?revision=1167972&view=co
>
> NOTE:
> You may find more about the information on the dump and the MSV model in:
> <ODFTOOLKIT_ROOT>/generator/schema2template/src/main/java/schema2template/example/odf/OdfHelper.java
> and
> <ODFTOOLKIT_ROOT>/generator/schema2template/target/apidocs/index.html
> https://incubator.apache.org/odftoolkit/0.6.2-incubating/schema2template/
> I would love to have a discussion on further thoughts of yours on the list.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)