https://bugs.documentfoundation.org/show_bug.cgi?id=149216

--- Comment #4 from V Stuart Foote <vstuart.fo...@utsa.edu> ---
Yes, MS Word 2019 does a fair job structuring its PDF import. And it saves that
out to functional ODF Text document .ODT if you need.

At the moment, Justin Luth's work on bug 118370 to cleanup LibreOffice Text
boxes and provide a means to merge a selection of multiple Text boxes into a
new single Text box on the Draw canvas is a functional, if manual process--via
UNO .uno:TextCombine

The resulting Text Box does not resize on the Draw canvas to match margin
layout of other imported PDF elements, but the resulting Text Box can be
resized as needed to manually compose the Draw document.

There is no corresponding command when the PDF is filter imported to Writer or
to Impress. Not clear implementing the same UNO TextCombine command for TextBox
in Writer is even feasible. 

More general handling for extracting text runs from PDF is open as bug 32249,
but in sum the LibreOffice filter offerings and object framework are not suited
to parsing PDF layout back into meaningful editable ODF documents with
acceptable fidelity. The PDF format is a presentation format--it is not
intended to be "edited".

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to