I cannot send you the tiff because there are sensitive company data in the tiff. But i tried to scan another, and the result still 1 page pdf, I think something bad with my tesseract version, or installation.
2019. május 16., csütörtök 11:11:00 UTC+2 időpontban zdenop a következőt írta: > > So provide your tif for reproducing problem. > > Zdenko > > > št 16. 5. 2019 o 11:06 András Jeszenkovits <[email protected] > <javascript:>> napísal(a): > >> The OS is Windows 10, I use tesserac OCR engine v4.0.0.20190314, I tried >> english, russian, hungarian. I tried 32bit/64bit version, i tried a jpg >> file too, same result (1 page pdf) >> >> 2019. május 16., csütörtök 10:31:33 UTC+2 időpontban shree a következőt >> írta: >>> >>> What is your version of tesseract? Which O/S? >>> >>> Have you tried it with just one language? >>> >>> On Thu, May 16, 2019 at 1:32 PM András Jeszenkovits <[email protected]> >>> wrote: >>> >>>> I thought that too, but the Tesseract create a one page pdf >>>> >>>> 2019. május 15., szerda 17:29:36 UTC+2 időpontban shree a következőt >>>> írta: >>>>> >>>>> tesseract In\SPTest.tif Out\Test --psm 3 -l rus+eng pdf >>>>> >>>>> This should be enough to create a multi page pdf from a multi page >>>>> tiff. >>>>> >>>>> On Wed, May 15, 2019 at 7:27 PM András Jeszenkovits <[email protected]> >>>>> wrote: >>>>> >>>>>> Here: tesseract In\SPTest.tif Out\Test --psm 3 -l rus+eng *-c >>>>>> tessedit_page_number=-1* pdf >>>>>> >>>>>> 2019. május 15., szerda 15:51:31 UTC+2 időpontban zdenop a következőt >>>>>> írta: >>>>>>> >>>>>>> Why are you using tessedit_page_number ? >>>>>>> >>>>>>> Zdenko >>>>>>> >>>>>>> >>>>>>> st 15. 5. 2019 o 15:43 András Jeszenkovits <[email protected]> >>>>>>> napísal(a): >>>>>>> >>>>>>>> Hello! >>>>>>>> >>>>>>>> Can you help me with this problem? I'm testing the tesseract OCR >>>>>>>> engine. The input is a scanned multipage TIFF file. I tried to create >>>>>>>> a PDF >>>>>>>> from that, but the result is always one page. >>>>>>>> I used this cmd line: >>>>>>>> tesseract In\Test.tif Out\TestOutput -l rus+eng -c >>>>>>>> tessedit_page_number=-1 pdf >>>>>>>> I found an option to create a multipage pdf with this part: " -c >>>>>>>> tessedit_page_number=-1" but it doesnt work. I tried to get txt data, >>>>>>>> but I >>>>>>>> only found text from the first page. >>>>>>>> Can you help me with that? >>>>>>>> >>>>>>>> -- >>>>>>>> You received this message because you are subscribed to the Google >>>>>>>> Groups "tesseract-ocr" group. >>>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>>> send an email to [email protected]. >>>>>>>> To post to this group, send email to [email protected]. >>>>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>>>> To view this discussion on the web visit >>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/9ffedebc-c7b1-4856-bf31-f438d8213d01%40googlegroups.com >>>>>>>> >>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/9ffedebc-c7b1-4856-bf31-f438d8213d01%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>>>> . >>>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>>> >>>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to [email protected]. >>>>>> To post to this group, send email to [email protected]. >>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/tesseract-ocr/69e1428c-20f9-4a59-b56d-e98cd990171e%40googlegroups.com >>>>>> >>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/69e1428c-20f9-4a59-b56d-e98cd990171e%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> >>>>> >>>>> -- >>>>> >>>>> ____________________________________________________________ >>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/6e52f08e-0319-4e6f-b8a9-712431401a96%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/6e52f08e-0319-4e6f-b8a9-712431401a96%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> >>> -- >>> >>> ____________________________________________________________ >>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/1237fc85-7f10-4003-880a-05152d9c6541%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/1237fc85-7f10-4003-880a-05152d9c6541%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d4862d60-71fc-41eb-9838-235154797b3c%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

