https://bz.apache.org/bugzilla/show_bug.cgi?id=66422

--- Comment #4 from PJ Fanning <fannin...@yahoo.com> ---
The sample in the original description appears incorrect in that the sample
code does not use getParagraphText().

I can confirm that getParagraphText() does not capitalize the text. It works a
completely different way from getText(). getParagraphText() ignores the
character runs. I don't know much about the HWPF code but the H is for Horrible
(check the history of the POI and the HWPF API). Someone else might have a look
but in 2023, I no longer care about .doc format and the POI support for it. It
is an anachronism as far as I am concerned.

XWPFWordExtractor does not expose a getParagraphText() method so this issue
affects only the HWPF WordExtractor.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to