*Hello*. I've found, that tika doesn't recognize CP866 (Russian DOS encoding).
As I understand, I should add new static inner class in org.apache.tika.parser.txt.CharsetRecog_sbcs (because it's single-byte encoding), change org.apache.tika.parser.txt.CharsetDetector#createRecognizers. Is it enought to add support for this encoding? Is it useful for community? And, if so, what I should do to contribute such addition? *Best regards*, gross aka Kostya Gribov.