[ 
https://issues.apache.org/jira/browse/PDFBOX-1296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13274571#comment-13274571
 ] 

Tilman Hausherr commented on PDFBOX-1296:
-----------------------------------------

After reading the PDF spec,
http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/PDF32000_2008.pdf
I found that "+" means that it is a subset of a font, see page 258, or 9.6.4.

While waiting for a real solution (might be related to issues PDFBOX-490 and 
PDFBOX-412) here's an updated workaround code, which now also cuts off the "+".

    /**
     * Font.canDisplayUpTo() is buggy:
     * http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6623219
     * 
http://stackoverflow.com/questions/1754697/displaying-chinese-text-in-an-applet
     *
     * @param font
     * @param s
     * @return -1 when all is good
     *
     * This method returns an offset into the String s which is the first
     * character this Font cannot display without using the missing glyph code.
     * If the Font can display all characters, -1 is returned.
     */
    int canDisplayUpTo(Font font, String s)
    {
        if (!font.canDisplay(' ') || !font.canDisplay('a') || 
!font.canDisplay('A'))
        {
            return 0; //TH font.canDisplay does not always tell the truth
        }
        int len = s.length();
        int index = 0;
        while (index < len)
        {
            int codePoint = s.codePointAt(index);
            if (!font.canDisplay(codePoint))
            {
                return index;
            }
            index += Character.charCount(codePoint);
        }
        return -1;
    }

    /**
     * {@inheritDoc}
     */
    @Override
    public void drawString(String string, Graphics g, float fontSize,
            AffineTransform at, float x, float y) throws IOException
    {
        Font _awtFont = getawtFont();

        // mdavis - fix fontmanager.so/dll on sun.font.FileFont.getGlyphImage
        // for font with bad cmaps?
        // Type1 fonts are not affected as they don't have cmaps
//        if (!isType1Font() && _awtFont.canDisplayUpTo(string) != -1)
        if (!isType1Font() && canDisplayUpTo(_awtFont, string) != -1)
        {
            Font _awtFontOld = _awtFont;
            String name = null;
            String style;
            if (_awtFont.getName().indexOf('+') > 0)
            {
                name = 
_awtFont.getName().substring(_awtFont.getName().indexOf('+') + 1);
            }
            else if (_awtFont.getName().startsWith("DejaVu") || 
_awtFont.getName().startsWith("Liberation"))
            {
                if (_awtFont.getName().startsWith("DejaVu Sans"))
                {
                    name = "Lucida Sans";
                }
                else if (_awtFont.getName().startsWith("Liberation Serif"))
                {
                    name = "Times New Roman";
                }
                else if (_awtFont.getName().startsWith("Liberation Sans"))
                {
                    name = "Arial";
                }
                else if (_awtFont.getName().startsWith("Liberation Mono"))
                {
                    name = "Courier New";
                }
                else
                {
                    name = "Lucida Sans";
                }
            }

            if (name == null)
            {
                _awtFont = Font.decode(null);
            }
            else
            {
                switch (_awtFont.getStyle())
                {
                    case Font.BOLD:
                        style = "BOLD";
                        break;
                    case Font.ITALIC:
                        style = "ITALIC";
                        break;
                    case Font.PLAIN:
                        style = "PLAIN";
                        break;
                    default:
                        if (_awtFont.getStyle() == (Font.BOLD | Font.ITALIC))
                        {
                            style = "BOLDITALIC";
                        }
                        else
                        {
                            style = "PLAIN";
                        }
                }
                if (_awtFont.getName().endsWith("Bold"))
                {
                    style = "BOLD";
                }
                else if (_awtFont.getName().endsWith("Bold Italic"))
                {
                    style = "BOLDITALIC";
                }
                else if (_awtFont.getName().endsWith("Italic"))
                {
                    style = "ITALIC";
                }
                _awtFont = Font.decode(name + "-" + style + "-" + 
Integer.toString(_awtFont.getSize()));
            }
            log.warn("Changing font on <" + string + "> from <" + _awtFontOld + 
"> to <" + _awtFont + ">");
        }

        Graphics2D g2d = (Graphics2D) g;
        g2d.setRenderingHint(RenderingHints.KEY_ANTIALIASING, 
RenderingHints.VALUE_ANTIALIAS_ON);
        writeFont(g2d, at, _awtFont, x, y, string);
    }


                
> Warnung: Changing font on < > from <AMAKEA+TimesNewRoman> to the default font
> -----------------------------------------------------------------------------
>
>                 Key: PDFBOX-1296
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1296
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: XP, JDK 1.7
>            Reporter: Tilman Hausherr
>         Attachments: outside-in-01.png, outside-in.pdf, shortcuts-01.png, 
> shortcuts.pdf
>
>
> Pdfbox does not produce the correct fonts in the PNG file created with the 
> following code and I get a lot of warnings:
>             PDDocument document = null; 
>             try 
>             { 
>                 document = PDDocument.load(pdfFile); 
>                 List pages = document.getDocumentCatalog().getAllPages(); 
>                 int p = 0; 
>                 for (Object pobj : pages) 
>                 { 
>                     PDPage page = (PDPage) pobj; 
>                     ++p; 
>                     BufferedImage bim = page.convertToImage(); 
>                     // Test with output in memory, to see the size 
>                     ByteArrayOutputStream memout = new 
> ByteArrayOutputStream(); 
>                     boolean memoutok = ImageIO.write(bim, "png", memout); 
>                     if (!memoutok) 
>                         System.err.println ("mem write failed for " + p); 
>                     memout.reset(); 
>                     memout.close(); 
>                     // Test with output to png file 
>                     String fname = String.format("%s-%02d.png", prefix, p); 
>                     boolean foutok = ImageIO.write(bim, "png", new 
> File(fname)); 
>                     if (!foutok) 
>                         System.err.println ("file write failed for " + p); 
> ....
> Apr 26, 2012 2:41:11 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> Information: unsupported/disabled operation: i
> Apr 26, 2012 2:41:12 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> Information: unsupported/disabled operation: ri
> Apr 26, 2012 2:41:12 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
> Warnung: Changing font on < > from <AMAKEA+TimesNewRoman> to the default font
> Apr 26, 2012 2:41:13 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
> Warnung: Changing font on < > from <AMAKEA+TimesNewRoman> to the default font
> Apr 26, 2012 2:41:13 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
> Warnung: Changing font on <O> from <AMAKME+Arial,Bold> to the default font
> Apr 26, 2012 2:41:13 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to