First of all thanks again Mike for helping me out. Yes, i have seen that, some text do get stripped out sometimes. Any idea as to why this could be happening?
I am using the bundled Solr 3.3.0 which comes with Tika 0.8. Should i move to 0.9? if so how? Also i am storing this text only which i am trying to display. If the xhtml produces the correct text, how do i store it instead? Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Issue-in-text-extraction-in-Solr-Tika-tp3267810p3269982.html Sent from the Apache Tika - Development mailing list archive at Nabble.com.