If you are comfortable using xproc, a *very* robust solution is this:

https://transpect.github.io/

We use this for roundtripping among docx/indesign/html/xml since years (we
have converted 10^5+ pages from docx/idml/xml) and we are *very* happy.

The idea is to use a central format (hubxml, wich is docbook + css
attributes) to convert to/from.

The docx 2hub module is quite easy to configure and run and gives you a
flat docbook you can further transform (i.e to add sectioning, semantic
tagging, cleaning, etc.) via the evolvehub module.

For a complete list of the modules see here
https://github.com/transpect.

Regards,
__peppo



On Wed, Apr 14, 2021 at 5:02 PM Michel van den Burg <[email protected]>
wrote:

> Hello everyone,
>
>
>
> I would like to know your opinion about this question: What would you
> consider the best way to get from Microsoft .docx to DocBook these days? We
> have to work with styled Word documents (every element has been styled,
> fortunately). We know XMLMind has a converter, there are some Github
> projects (https://github.com/albertjuhe/word2docbook and
> https://github.com/oxygenxml/oxygen-resources-converter) and I would say
> oXygen can do this.
>
>
>
> And, can it be done in one transformation, or should we first convert to
> HTML, for example?
>
>
>
> I hope this question isn’t inappropriate. Thanks in advance!
>
>
>
> Kind regards,
>
>
>
> Michel van den Burg
>
> digital publishing | Uitgeverij Coutinho
>
> 06 - 10 75 75 14
>
> LinkedIn <https://www.linkedin.com/in/michelvandenburg/>
>
>
>

Reply via email to