On Thu, 9 Oct 2014, imyuka wrote:
I roughly checked up the book <Tika in Action> and found the instruction about transforming a document to a XHTML file with command line, while I have no idea about the Java coding implementation. Are there any instructions or tutorials I can refer to?

We have quite a few examples, they're available in svn:
https://svn.apache.org/repos/asf/tika/trunk/tika-example/src/main/java/org/apache/tika/example

You probably want ContentHandlerExample.parseToHTML()

Longer term, we're planning to get those automatically included in the website, along with supporting text. See http://tika.apache.org/1.7/examples.html for the WIP on that

Nick

Reply via email to