[
https://issues.apache.org/jira/browse/TIKA-573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12971994#action_12971994
]
Maxim Valyanskiy commented on TIKA-573:
---
I'm not sure that this patch fits correctly
[
https://issues.apache.org/jira/browse/TIKA-573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maxim Valyanskiy updated TIKA-573:
--
Attachment: 0001-TIKA-573-add-MimeType.getExtension.patch
new version of patch
[
https://issues.apache.org/jira/browse/TIKA-573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12972156#action_12972156
]
Maxim Valyanskiy commented on TIKA-573:
---
Thank you, Jukka. I found that file extensions
Support for IBM866 (CP866) encoding in TXTParser
Key: TIKA-574
URL: https://issues.apache.org/jira/browse/TIKA-574
Project: Tika
Issue Type: Improvement
Components: parser
[
https://issues.apache.org/jira/browse/TIKA-574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
gross updated TIKA-574:
---
Attachment: tika-0.8-cp866.patch
I've used ngrams from cp1251 and wrote custom byteMap. All russian letters,
used in