https://bz.apache.org/bugzilla/show_bug.cgi?id=60952

            Bug ID: 60952
           Summary: Figure out how to map font to runs/text pieces in Word
                    6.0 files
           Product: POI
           Version: 3.16-dev
          Hardware: PC
            Status: NEW
          Severity: enhancement
          Priority: P2
         Component: HWPF
          Assignee: dev@poi.apache.org
          Reporter: talli...@mitre.org
  Target Milestone: ---

Created attachment 34897
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=34897&action=edit
triggering file

In bug 50955, I found that the hack of using the first non-default/non-symbol
font in the font table in Word 6.0 files worked fairly well.  There was one
file out of ~1300 for which this failed.

I'm attaching that file.  The issue in this file is that cp1257 comes before
1251 in the font table.

I wasn't able to figure out how to 1) determine that 1251 should be the default
or 2) how to map the font encodings to runs/text pieces.

The test file comes from Common Crawl.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to