https://bz.apache.org/bugzilla/show_bug.cgi?id=66422
Bug ID: 66422
Summary: WordExtractor.getParagraphText() - capitalized text
Product: POI
Version: unspecified
Hardware: PC
Status: NEW
Severity: normal
Priority: P2
Component: HWPF
Assignee: [email protected]
Reporter: [email protected]
Target Milestone: ---
BugĀ 63576 doesn't completely fix the issue with capitalized text.
Method WordExtractor.getParagraphText() still returns text in lower letters.
You can use the example doc file from BugĀ 63576.
---------
import java.io.FileInputStream;
import java.io.IOException;
import org.apache.poi.hwpf.extractor.WordExtractor;
public class WordTextExtractorDoc {
public static void main(String[] args) {
try {
WordExtractor wordExtDoc = new WordExtractor(new
FileInputStream("capitalized.doc"));
System.out.println(wordExtDoc.getText());
wordExtDoc.close();
} catch (IOException e) {
e.printStackTrace();
}
}
}
---------
Output is:
---------
The following word is: CAPITALIZED.
--
The following word is: capitalized.
---------
I expect the last line also with "CAPITALIZED".
Tested with version 5.3.2.
--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]