https://bz.apache.org/bugzilla/show_bug.cgi?id=66422

            Bug ID: 66422
           Summary: WordExtractor.getParagraphText() - capitalized text
           Product: POI
           Version: unspecified
          Hardware: PC
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HWPF
          Assignee: [email protected]
          Reporter: [email protected]
  Target Milestone: ---

BugĀ 63576 doesn't completely fix the issue with capitalized text.

Method WordExtractor.getParagraphText() still returns text in lower letters.

You can use the example doc file from BugĀ 63576.


---------
import java.io.FileInputStream;
import java.io.IOException;

import org.apache.poi.hwpf.extractor.WordExtractor;

public class WordTextExtractorDoc {

        public static void main(String[] args) {
                try {
                        WordExtractor wordExtDoc = new WordExtractor(new
FileInputStream("capitalized.doc"));
                        System.out.println(wordExtDoc.getText());
                        wordExtDoc.close();
                } catch (IOException e) {
                        e.printStackTrace();
                }

        }

}
---------

Output is:
---------
The following word is: CAPITALIZED.


--
The following word is: capitalized.
---------

I expect the last line also with "CAPITALIZED".

Tested with version 5.3.2.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to