https://bz.apache.org/bugzilla/show_bug.cgi?id=60952
Bug ID: 60952 Summary: Figure out how to map font to runs/text pieces in Word 6.0 files Product: POI Version: 3.16-dev Hardware: PC Status: NEW Severity: enhancement Priority: P2 Component: HWPF Assignee: dev@poi.apache.org Reporter: talli...@mitre.org Target Milestone: --- Created attachment 34897 --> https://bz.apache.org/bugzilla/attachment.cgi?id=34897&action=edit triggering file In bug 50955, I found that the hack of using the first non-default/non-symbol font in the font table in Word 6.0 files worked fairly well. There was one file out of ~1300 for which this failed. I'm attaching that file. The issue in this file is that cp1257 comes before 1251 in the font table. I wasn't able to figure out how to 1) determine that 1251 should be the default or 2) how to map the font encodings to runs/text pieces. The test file comes from Common Crawl. -- You are receiving this mail because: You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: dev-h...@poi.apache.org