Text extraction slow and /tmp files upwith AWT font files
---------------------------------------------------------

                 Key: PDFBOX-959
                 URL: https://issues.apache.org/jira/browse/PDFBOX-959
             Project: PDFBox
          Issue Type: Bug
          Components: Text extraction
    Affects Versions: 1.4.0
            Reporter: Kevin Jackson
            Priority: Critical
             Fix For: 1.5.0


During text extraction there is NO need to create AWT fonts.
However the current Type1C Font code creates the AWT always while initializing.

This has several really bad side effects:
1. Wasted time creating the AWT font.
2. The font files are copied into /tmp which fills up after a few thousand text 
extractions.
3. The AWT is created in a synchronized region so is single threaded.

The patch is quite simple.  Just delay creation of the AWT fint until required.


-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to