[jira] Updated: (TIKA-574) Support for IBM866 (CP866) encoding in TXTParser

2010-12-17 Thread Maxim Valyanskiy (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Valyanskiy updated TIKA-574: -- Attachment: TIKA-574.patch Thank you. I added unit-test for this issue Support for IBM866

[jira] Updated: (TIKA-574) Support for IBM866 (CP866) encoding in TXTParser

2010-12-16 Thread gross (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] gross updated TIKA-574: --- Attachment: tika-0.8-cp866.patch I've used ngrams from cp1251 and wrote custom byteMap. All russian letters, used in