Hi all,
I am trying to OCR some code wrote in Python. I ve read the Tesseract doc
many times and applied 3 pre processing script with Image Magick. The
result image is attached.
I then send it to Tesseract with ```--psm 4``` which seems to be the more
adapted segmentation mode for what I am trying to do. The result is quite
ok but I don't have indentations and I think it could be still improved.
I would be glad to have some adivce to improve the result. Thanks a lot
Best,
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/c07b4f66-7e6e-4634-a4ee-b8a8db003f20n%40googlegroups.com.
knport bases
import copy
[1mport json
import os
from abc Import abstractmethod
import. dpath.util as dp
tmport. requests
fron requests.adapters 1Aport WTTPAGapter
Fron simplejson daport ISoNDecodeError
>From url1i63 tmport Retry
>From comnon. except ion. sfm_worker.oxcoptions inport LoginFailedExce|
>From conmon,utils. Logaing iaport get_log
Togger = got_Log(__nane__)
class Baseservice(object):
Interface 1n charge of overy request of API.
a _instances < ()
headers = {
Content-Type's “application/json,
"XUI0'; 05 goteny(“INTERPOD_USER", **)
3
sensitive payload_field - (“users,
base_url
taTget_api - Nome
use_budy_token « False
session_token_ustue - Nane
session_token_1d - None
apnane = "comon_aps~
tineout vatue = 55. 4 pere
max_retry - 3
retry_on = [4m, 403, 500]
auth = (*b16790%, “quar)
PASSUOGaTE - Mone
“usernane’, “password®]
the defantt value of Anaee