On Thu, 7 Mar 2013, A Z wrote:
I also notice that you are building on POI (presumably 3.9).
-POI has shortfalls around HWPFDocument objects; Microsoft Word
.doc files. One may not really easily insert
org.apache.poi.hwpf.usermodel.Picture
Apache Tika only reads files in through the various libraries it uses, so
write/change support in libraries like Apache POI don't affect Tika.
If these limitations in POI do affect you, then the best bet is to ask for
advice from the Apache POI community, and work up patches to add in the
missing features!
-Similarly, I want more support for dealing with *.rtf files. Particularly
to insert text and images, and not simply append them.
Again, Tika is only interested in reading data out of RTF formats, not
making changes to them, so that sort of thing is out of scope
Nick