On Thu, 9 Oct 2014, imyuka wrote:
I roughly checked up the book <Tika in Action> and found the instruction
about transforming a document to a XHTML file with command line, while I
have no idea about the Java coding implementation. Are there any
instructions or tutorials I can refer to?
We have quite a few examples, they're available in svn:
https://svn.apache.org/repos/asf/tika/trunk/tika-example/src/main/java/org/apache/tika/example
You probably want ContentHandlerExample.parseToHTML()
Longer term, we're planning to get those automatically included in the
website, along with supporting text. See
http://tika.apache.org/1.7/examples.html for the WIP on that
Nick