I am using the /update/extract request handler to push documents into solr,
but some text documents, that are encoded as windows-1255 (arabic texts) are
not extracted properly, the text given is not readable.

I searched in the web, and solr documentation and found nothing. I need to
send the file encoding as a parameter if possible to let the tika parser get
to know it.

Is there a way to achieve that?



--
Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Reply via email to