Thanks much for your suggestion. But for the XHTML output, i believe that is one time process while extraction is being done. That means again i have to store/index that xhtml output text as well for later use. Is this correct or am i missing something?
Regards -- View this message in context: http://lucene.472066.n3.nabble.com/Preview-of-Rich-Documents-tp3270554p3274458.html Sent from the Apache Tika - Development mailing list archive at Nabble.com.