Someone more familiar with git and github can suggest whether submodules would be a good option for langdata and tessdata,
https://git-scm.com/book/en/v2/Git-Tools-Submodules ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Wed, Aug 17, 2016 at 7:06 PM, Zdenko Podobný <zde...@gmail.com> wrote: > If there is other solution how to separate "must" part of the project with > "optional" data on github.com, please share it. > > Zdenko > > On Wed, Aug 17, 2016 at 3:32 PM, John Muccigrosso <jmucc...@gmail.com> > wrote: > >> On Wednesday, August 17, 2016 at 3:54:29 AM UTC-4, zdenop wrote: >>> >>> tesseract library/engine[1] is separated from language trained data[2]. >>> Main reason for this split is size of trained data and users need only >>> few of them. >>> Trained data should be placed to the same tessdata directory where >>> tesseract looks for config files (well config files are not needed if user >>> use API of popper command line options) >>> >>> [1] https://github.com/tesseract-ocr/tesseract >>> [2] https://github.com/tesseract-ocr/tessdata >>> >> >> Thanks. So it was as I had thought. >> >> I understand the motive, but I think it's worth noting that this means >> it's not possible to just point tessdata to a local clone of the >> repository. I'll probably symlink the data files into mine, but that'll >> mean re-building those every time there's an update. Pesky. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-ocr+unsubscr...@googlegroups.com. >> To post to this group, send email to tesseract-ocr@googlegroups.com. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit https://groups.google.com/d/ms >> gid/tesseract-ocr/f4303176-bbbd-40e1-82b2-bf31d3127198%40googlegroups.com >> <https://groups.google.com/d/msgid/tesseract-ocr/f4303176-bbbd-40e1-82b2-bf31d3127198%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> For more options, visit https://groups.google.com/d/optout. >> > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/CAJbzG8z-PryWZzHjeSeSn%3D6f21_sDUo% > 3DD5wn_rrEkdFY5BJz3g%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8z-PryWZzHjeSeSn%3D6f21_sDUo%3DD5wn_rrEkdFY5BJz3g%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUGVBoCGrFM0cFKjg%2BvdFMqVqLW8fQ464PvvrVKUTOQLw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.