Hi, Do you have a sample file to go along? Otherwise it is hard to say much, only that it sounds like a bug in the text-extraction. So it would be best if you can open a bug entry at https://bz.apache.org/bugzilla/enter_bug.cgi together with additional information/files so we can track any potential fix there.
Dominik. On Wed, Jan 13, 2016 at 10:04 AM, makadefia <[email protected]> wrote: > the way I'm getting the text is like this > > HWPFDocument doc = new HWPFDocument(inputStream); > WordExtractor ex = new WordExtractor(doc); > String toReturn = ex.getText(); > ex.close(); > return toReturn; > > > > -- > View this message in context: > http://apache-poi.1045710.n5.nabble.com/removing-hidden-characters-tp5721564p5721565.html > Sent from the POI - User mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
