Dear Mr. Smith Hope you passing a lovely day. I had post a FAQ about the jpn.traineddata which thread is as below:
http://groups.google.com/group/tesseract-ocr/browse_thread/thread/54c96de802d6911a/4924f545668cbaac?lnk=gst&q=jpn#4924f545668cbaac I put the contents here for your quick convenience: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> I am interested to get all the tif files that used for creating the *jpn*.traindata. I just want to see how many characters are supported in that file. Because I have some other Japanese characters that can't be recognized by the tesseract OCR. Does anybody know, where are those tif files ? <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< Could you please help me to get from out of here ? Regards Mostafa 2011/5/19 Dmitri Silaev <[email protected]> > Did you contact Ray Smith, this forum's owner? > > Warm regards, > Dmitri Silaev > www.CustomOCR.com > > > > > > 2011/5/19 Mostafa <[email protected]>: > > Hi Again, > > > > Seems no body knows where it is hiding. > > Should I contact with CIA agent ? lol > > But I am kinda serious about the data. > > > > Mostafa > > > > On May 18, 2:43 am, Илья <[email protected]> wrote: > >> He need for table that contains all supported alphabetics characters. > >> Also, Parts of scanned books could not be protected by copyright. > >> > >> Can you give any contacts of "jpn.traindata" dev team? > >> > >> -- > >> Best regards, > >> Ilia. > >> > >> В Втр, 17/05/2011 в 18:24 +0200, zdenko podobny пишет: > >> > >> > >> > >> > >> > >> > >> > >> > >> > >> > On Tue, May 17, 2011 at 5:01 PM, Илья <[email protected]> wrote: > >> > IMHO alphabets can't be protected by copyright. > >> > >> > Mostafa did not asked for an alphabets. He asked for 'all the tif > >> > files that used for creating...' and content of tiff file (e.g. > >> > scanned books) could be protected by copyright. > >> > >> > -- > >> > Best regards, > >> > Ilia. > >> > >> > В Втр, 17/05/2011 в 09:24 -0400, Dmitri Silaev пишет: > >> > >> > > I think copyright issues are preventing the dev team from > >> > publishing > >> > > these source files. However you can try to contact this > >> > forum's > >> > > moderator directly - he probably can take decision to share. > >> > >> > > -- > >> > > Dmitri > >> > >> > > On Tue, May 17, 2011 at 4:58 AM, Mostafa > >> > <[email protected]> wrote: > >> > > > Hi, > >> > >> > > > I am interested to get all the tif files that used for > >> > creating the > >> > > >jpn.traindata. > >> > > > I just want to see how many characters are supported in > >> > that file. > >> > > > Because I have some other Japanese characters that can't > >> > be recognized > >> > > > by > >> > > > the tesseract OCR. > >> > >> > > > Does anybody know, where are those tif files ? > >> > >> > > > Thanks > >> > >> > > > -- > >> > > > You received this message because you are subscribed to > >> > the Google > >> > > > Groups "tesseract-ocr" group. > >> > > > To post to this group, send email to > >> > [email protected] > >> > > > To unsubscribe from this group, send email to > >> > > > [email protected] > >> > > > For more options, visit this group at > >> > > >http://groups.google.com/group/tesseract-ocr?hl=en > >> > >> > -- > >> > You received this message because you are subscribed to the > >> > Google > >> > Groups "tesseract-ocr" group. > >> > To post to this group, send email to > >> > [email protected] > >> > To unsubscribe from this group, send email to > >> > [email protected] > >> > For more options, visit this group at > >> > http://groups.google.com/group/tesseract-ocr?hl=en > >> > >> > -- > >> > You received this message because you are subscribed to the Google > >> > Groups "tesseract-ocr" group. > >> > To post to this group, send email to [email protected] > >> > To unsubscribe from this group, send email to > >> > [email protected] > >> > For more options, visit this group at > >> >http://groups.google.com/group/tesseract-ocr?hl=en > > > > -- > > You received this message because you are subscribed to the Google > > Groups "tesseract-ocr" group. > > To post to this group, send email to [email protected] > > To unsubscribe from this group, send email to > > [email protected] > > For more options, visit this group at > > http://groups.google.com/group/tesseract-ocr?hl=en > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

