Attempting to do this on nutch causes an error. Apparently the data should
be pretty easy to extract though, according to here:
http://computercranium.com/programming/java/howto-extract-text-from-docx-file-word-2007
I'm not qualified to build this into the source but if anybody is, please
do! :-)

Reply via email to