text2image is a great script shipped with Tesseract. It is used to generate synthetic data to produce images from text files. It has a few control parameters to make the generated images similar to scanned images.
But, I have lately learned that the images generated by text2image are nowhere realistic as the ones generated by https://github.com/Belval/TextRecognitionDataGenerator. The latter tool has more powerful controls to produce the exact type of image you want to generate. - has anyway found a way of making tesseract work with other text generation tools such as TextRecognitionDataGenerator? - if so, what is the experience? - and for the developers, is there anyways to replace text2image with TextRecognitionDataGenerator? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/cf9202d6-ab25-4a54-aaf7-51eebc3d50can%40googlegroups.com.