> On 10 Mar 2016, at 12:20, Cornelis Hoeflake <c.hoefl...@postex.com> wrote:
> 
> Hi,
> 
> When we use for example PdfTextStripperByArea, is it required in that case
> to have all non embedded fonts? Could we use a default (fallback) font in
> stead of providing the correct fonts? Now we have a global (ThreadLocal)
> font provider which is used for rendering and tasks like position based
> text extraction. But skipping the font provider for text based text
> extraction would simplify our code.

All fonts used in a PDF are supposed to embed their widths, whether or not the
font file itself gets embedded. However, sometimes they don’t, and then you
need the missing font to get an accurate text extraction. But for well-formed
PDFs you don’t need the fonts.

— John

> Kind regards,
> Cornelis


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to