https://bugs.kde.org/show_bug.cgi?id=472692
Maik Qualmann changed:
What|Removed |Added
Resolution|--- |FIXED
Status|REPORTED
https://bugs.kde.org/show_bug.cgi?id=472692
--- Comment #5 from Maik Qualmann ---
Ok, encoding is fine on Windows now. We still have to fix the writing of the
OCR text in the metadata. At the moment it is only written to the DB, which at
the end restores the original caption text with a rescan. A
https://bugs.kde.org/show_bug.cgi?id=472692
--- Comment #4 from Maik Qualmann ---
Git commit cc42ef72e33356f66ec132e96cbb684d3c8d28bc by Maik Qualmann.
Committed on 28/07/2023 at 08:08.
Pushed by mqualmann into branch 'master'.
according to Tesseract doc the output encoding should be UTF8
M +1
https://bugs.kde.org/show_bug.cgi?id=472692
--- Comment #3 from Maik Qualmann ---
Ok, we're a big step further, the language setting works, we get a text with
German umlauts, but in the Windows codepage format and not UTF8. This is
correct when we view the text file in the Windows text editor, bu
https://bugs.kde.org/show_bug.cgi?id=472692
--- Comment #2 from Maik Qualmann ---
Git commit 5918439aafb5b2f7387490cb2abc9178fe33f374 by Maik Qualmann.
Committed on 27/07/2023 at 20:49.
Pushed by mqualmann into branch 'master'.
fix language parameter for Tesseract OCR on Windows
M +11 -0
https://bugs.kde.org/show_bug.cgi?id=472692
Maik Qualmann changed:
What|Removed |Added
CC||metzping...@gmail.com
--- Comment #1 from Maik