https://issues.apache.org/bugzilla/show_bug.cgi?id=53556

          Priority: P2
            Bug ID: 53556
          Assignee: [email protected]
           Summary: Mispositioned Textboxes In Reading Doc Files Through
                    HWPF
          Severity: major
    Classification: Unclassified
                OS: Linux
          Reporter: [email protected]
          Hardware: PC
            Status: NEW
           Version: 3.8
         Component: HWPF
           Product: POI

Created attachment 29070
  --> https://issues.apache.org/bugzilla/attachment.cgi?id=29070&action=edit
This is the document which i was unable to read properly.

I tried reading doc and docx files using Apache POI 3.8. It worked fine until i
encountered textboxes.

If the format of the document is like this: 
paragraph 1 
textbox 1 
paragraph 2 
textbox 2 
paragraph 3 

Then the output should be: 
paragraph 1 textbox 1 paragraph 2 textbox 2 paragraph 3 
But HWPF reads such .doc file as: 
paragraph 1 paragraph 2 paragraph 3 textbox 1 textbox 2 

It seems to be adding textboxes at the end and not at the place where it should
be, i.e. between the paragraphs.

In case of .docx files, XWPF didn't read textboxes at all.

I tried methods getText(), getTextFromPieces(), extractText(),
getParagraphText(), but none of these helped.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to