*Hello*.

I've found, that tika doesn't recognize CP866 (Russian DOS encoding).

As I understand, I should add new static inner class in
org.apache.tika.parser.txt.CharsetRecog_sbcs (because it's single-byte
encoding),
change org.apache.tika.parser.txt.CharsetDetector#createRecognizers.

Is it enought to add support for this encoding? Is it useful for community?
And, if so, what I should do to contribute such addition?

*Best regards*,
gross aka Kostya Gribov.

Reply via email to