[Bug 58654] Extraction fails when data is inserted using Microsoft Field Control

bugzilla Mon, 07 Dec 2015 09:02:54 -0800

https://bz.apache.org/bugzilla/show_bug.cgi?id=58654


--- Comment #7 from [email protected] ---
Hi,

We have updated our POI to the latest version
poi-3.13.jar
poi-ooxml-3.13
poi-ooxml-schemas-3.13.jar
poi-scratchpad-3.13.jar

we are using the following java codes to extract the attached docx file, it
extracts the texts inside the form controls successfully, however it fails to
extract the name and address on the top of word file (which also had Content
Control Enabled simlar to thhe actually body that was successfully extracted on
Page3)



Please advise how to fix this.

/*
* To change this template, choose Tools | Templates
* and open the template in the editor.
*/

package com.brainhunter.frontoffice.biz.util.extract;

import com.brainhunter.frontoffice.biz.exception.UnableExtractException;
import org.apache.poi.xwpf.extractor.XWPFWordExtractor;
import org.apache.poi.xwpf.usermodel.XWPFDocument;

/**
*
* @author Mohankumars
*/
public class DocxExtractor  extends TextExtractor{

    /** Creates a new instance of DocExtractor */
    public DocxExtractor() {
    }

    public String getText( byte[] contents ) throws UnableExtractException{

        try {
            XWPFDocument doc = new XWPFDocument(getInputStream(contents));
            XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
            return  extractor.getText();
        }
        catch( Exception e ) {
            throw new UnableExtractException( e );
        }
    }

}

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[Bug 58654] Extraction fails when data is inserted using Microsoft Field Control

Reply via email to