If you are comfortable using xproc, a *very* robust solution is this: https://transpect.github.io/
We use this for roundtripping among docx/indesign/html/xml since years (we have converted 10^5+ pages from docx/idml/xml) and we are *very* happy. The idea is to use a central format (hubxml, wich is docbook + css attributes) to convert to/from. The docx 2hub module is quite easy to configure and run and gives you a flat docbook you can further transform (i.e to add sectioning, semantic tagging, cleaning, etc.) via the evolvehub module. For a complete list of the modules see here https://github.com/transpect. Regards, __peppo On Wed, Apr 14, 2021 at 5:02 PM Michel van den Burg <[email protected]> wrote: > Hello everyone, > > > > I would like to know your opinion about this question: What would you > consider the best way to get from Microsoft .docx to DocBook these days? We > have to work with styled Word documents (every element has been styled, > fortunately). We know XMLMind has a converter, there are some Github > projects (https://github.com/albertjuhe/word2docbook and > https://github.com/oxygenxml/oxygen-resources-converter) and I would say > oXygen can do this. > > > > And, can it be done in one transformation, or should we first convert to > HTML, for example? > > > > I hope this question isn’t inappropriate. Thanks in advance! > > > > Kind regards, > > > > Michel van den Burg > > digital publishing | Uitgeverij Coutinho > > 06 - 10 75 75 14 > > LinkedIn <https://www.linkedin.com/in/michelvandenburg/> > > >
