I take the training file only lstm accurate for the Russian language, but 
my project does not recognize this training file.

app log errors:
E/Tesseract(native): Could not initialize Tesseract API with language=rus!
A/libc: Fatal signal 11 (SIGSEGV), code 1 (SEGV_MAPERR), fault addr 0x8 in 
tid 27912 (tesseractsample), pid 27912 (tesseractsample)

private String extractText(Bitmap bitmap) {
        try {
            tessBaseApi = new TessBaseAPI();
        } catch (Exception e) {
            Log.e(TAG, e.getMessage());
            if (tessBaseApi == null) {
                Log.e(TAG, "TessBaseAPI is null. TessFactory not returning tess 
object.");
            }
        }

        tessBaseApi.init(DATA_PATH, lang);

        Log.d(TAG, "Training file loaded");
        tessBaseApi.setImage(bitmap);
        String extractedText = "empty result";
        try {
            extractedText = tessBaseApi.getUTF8Text();
        } catch (Exception e) {
            Log.e(TAG, "Error in recognizing text.");
        }
        tessBaseApi.end();
        return extractedText;
    }


after taking a photo, the program crashes,
the application stops on this line: extractedText = tessBaseApi
.getUTF8Text();

used implementation 'com.rmtheis:tess-two:9.0.0'

I know that it will work if you put the training data 4.1 + 3.2, but I need 
to work only with 4.1 lstm, and I would like to know if there is such a way?

when I extracted training data from 4.1 accurate, there were only lstm 
files there, but when I extracted 4.1 + 3.2 there were more files


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/0d323200-158c-4222-827f-d09c4873f251o%40googlegroups.com.

Reply via email to