On Thu, 7 Mar 2013, A Z wrote:
I also notice that you are building on POI (presumably 3.9).

-POI has shortfalls around HWPFDocument objects; Microsoft Word
 .doc files. One may not really easily insert

org.apache.poi.hwpf.usermodel.Picture

Apache Tika only reads files in through the various libraries it uses, so write/change support in libraries like Apache POI don't affect Tika.

If these limitations in POI do affect you, then the best bet is to ask for advice from the Apache POI community, and work up patches to add in the missing features!


-Similarly, I want more support for dealing with *.rtf files. Particularly
 to insert text and images, and not simply append them.

Again, Tika is only interested in reading data out of RTF formats, not making changes to them, so that sort of thing is out of scope

Nick

Reply via email to