https://bz.apache.org/bugzilla/show_bug.cgi?id=58654
--- Comment #7 from [email protected] --- Hi, We have updated our POI to the latest version poi-3.13.jar poi-ooxml-3.13 poi-ooxml-schemas-3.13.jar poi-scratchpad-3.13.jar we are using the following java codes to extract the attached docx file, it extracts the texts inside the form controls successfully, however it fails to extract the name and address on the top of word file (which also had Content Control Enabled simlar to thhe actually body that was successfully extracted on Page3) Please advise how to fix this. /* * To change this template, choose Tools | Templates * and open the template in the editor. */ package com.brainhunter.frontoffice.biz.util.extract; import com.brainhunter.frontoffice.biz.exception.UnableExtractException; import org.apache.poi.xwpf.extractor.XWPFWordExtractor; import org.apache.poi.xwpf.usermodel.XWPFDocument; /** * * @author Mohankumars */ public class DocxExtractor extends TextExtractor{ /** Creates a new instance of DocExtractor */ public DocxExtractor() { } public String getText( byte[] contents ) throws UnableExtractException{ try { XWPFDocument doc = new XWPFDocument(getInputStream(contents)); XWPFWordExtractor extractor = new XWPFWordExtractor(doc); return extractor.getText(); } catch( Exception e ) { throw new UnableExtractException( e ); } } } -- You are receiving this mail because: You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
