FYI: i have found that by giving different fonts in jtessbox editor you will got below above error. so now i am creating TIFF by giving "Monospaced" font as per default jtessbox editor settings.
On Wednesday, 12 November 2014 15:46:57 UTC+5, iram akbar wrote: > > Hi, > > i am able to generate the required files with jtessbox editor. i want to > use Serak for training but i am getting attached error.Debugging give me no > solution. according to my knowledge you don't need to generate the files > like "frequent words file" in Serak. You just need to train the image and > then combine the tessdata and you will get the required output file. > note: i am training Arabic. > > Question: why i am getting the attached error although i am training > simple Arabic 1 line sentence . please share the solution. > > On Tuesday, 11 November 2014 03:48:36 UTC+5, Quan Nguyen wrote: >> >> You can edit the letters by manually typing in the Character textbox or >> in the Char table cells. >> >> On Monday, November 10, 2014 6:31:40 AM UTC-6, iram akbar wrote: >>> >>> thank you for your help regarding training data. one more thing there is >>> a icon near character box (see attachment). it is not functional on my >>> side. i was expecting by clicking the icon it will give you correction >>> option. >>> Question: is it active or not active? >>> >>> On Monday, 10 November 2014 15:30:21 UTC+5, shree wrote: >>>> >>>> Look under jtessboxeditor/samples/vie folder >>>> >>>> and create similar files for your language >>>> >>>> ShreeDevi >>>> ____________________________________________________________ >>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>> >>>> On Mon, Nov 10, 2014 at 1:10 PM, iram akbar <irama...@gmail.com> wrote: >>>> >>>>> Quan, >>>>> i am able to generate some files with jtess ox editor but i am having >>>>> an issue, when i select "Train with existing box" or "Train from Scratch" >>>>> under the *Traine*r tab i am getting this attached message. >>>>> Question: How i can generate the Arabic.font_properties, >>>>> Arabic.frequent_word_list and Arabic.words_list files using jtessbox >>>>> editor? >>>>> >>>>> On Friday, 7 November 2014 19:42:37 UTC+5, Quan Nguyen wrote: >>>>>> >>>>>> Look in samples folder for a working example. You can start out from >>>>>> a UTF-8 text file about 2-page long, generate TIFF/Box from it, and >>>>>> prepare >>>>>> other necessary input files for training. You can train entirely in >>>>>> jTessBoxEditor. >>>>>> >>>>>> On Thursday, November 6, 2014 6:19:53 AM UTC-6, iram akbar wrote: >>>>>>> >>>>>>> thank you for your help but my issue still exits. if i need to >>>>>>> generate the Tiff of an image text i am unable to generate the TIFF as >>>>>>> it >>>>>>> only ask to load the text file not image file. second if i have a lots >>>>>>> of >>>>>>> documents i need to copy paste first then generate the TIFF. Any one >>>>>>> else >>>>>>> can help me in this. >>>>>>> Question: how can i Input the Arabic text image in jtessbox editor >>>>>>> to generate Tiff (as attached). >>>>>>> >>>>>>> On Thursday, 6 November 2014 16:38:25 UTC+5, shree wrote: >>>>>>>> >>>>>>>> Click on the 'generate' box - with some devanagri fonts I have >>>>>>>> found that text does not display but the tiff/box are generated. Maybe >>>>>>>> same >>>>>>>> for the arabic font you are using. Give it a try. >>>>>>>> >>>>>>>> You can also try to copy and paste the text, sometimes that works. >>>>>>>> >>>>>>>> >>>>>>>> ShreeDevi >>>>>>>> ____________________________________________________________ >>>>>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to tesseract-oc...@googlegroups.com. >>>>> To post to this group, send email to tesser...@googlegroups.com. >>>>> Visit this group at http://groups.google.com/group/tesseract-ocr. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/tesseract-ocr/d7396d3d-c4d1-4fcc-a58d-6cc02927989c%40googlegroups.com >>>>> >>>>> <https://groups.google.com/d/msgid/tesseract-ocr/d7396d3d-c4d1-4fcc-a58d-6cc02927989c%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> >>>> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/df51a29e-6051-4e3e-a1bf-e1936f135181%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.