https://bz.apache.org/bugzilla/show_bug.cgi?id=66422

--- Comment #4 from PJ Fanning <[email protected]> ---
The sample in the original description appears incorrect in that the sample
code does not use getParagraphText().

I can confirm that getParagraphText() does not capitalize the text. It works a
completely different way from getText(). getParagraphText() ignores the
character runs. I don't know much about the HWPF code but the H is for Horrible
(check the history of the POI and the HWPF API). Someone else might have a look
but in 2023, I no longer care about .doc format and the POI support for it. It
is an anachronism as far as I am concerned.

XWPFWordExtractor does not expose a getParagraphText() method so this issue
affects only the HWPF WordExtractor.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to