Re: [tesseract-ocr] Remove certain characters while fine tuning (training) tesseract

2021-03-10 Thread Murtuza Dahodwala
I guess that would be manual work. I want to not detect them during inference On Wed, 10 Mar 2021, 11:20 pm Greg Dunkel, wrote: > Would it be easier to remove these characters from the output using > editing tools? > > On Tue, Mar 9, 2021, 2:30 AM Murtuza Dahodwala > wr

[tesseract-ocr] Remove certain characters while fine tuning (training) tesseract

2021-03-08 Thread Murtuza Dahodwala
Hello, Currently, my OCR model detects certain characters like *₹ *& *|.* Is it possible that I can remove these characters by correcting my lstm bounding box dataset and then fine-tuning it so that it does not detect these symbols in my test images ?? -- You received this message because you

[tesseract-ocr] Re: Data in Excel sheet

2021-02-18 Thread Murtuza Dahodwala
Thank you for your response @kostas. I have already tried these approaches and they do not work for me as my tables do not have grids to classify each cells. On Wednesday, February 17, 2021 at 12:25:58 AM UTC+5:30 Kostas wrote: > I just read the documentation, perhaps goes like that: > > Tables

[tesseract-ocr] Re: Data in Excel sheet

2021-02-16 Thread Murtuza Dahodwala
+1 On Wednesday, October 9, 2019 at 10:33:34 PM UTC+5:30 myquest wrote: > Hi Friends, > > Please advise me how to get the table data from image in csv format using > tesseract? > > Inam > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To un

[tesseract-ocr] Re: Table Detection using Tesseract

2021-02-16 Thread Murtuza Dahodwala
You can do this easily with YOLOV4 On Wednesday, February 19, 2020 at 8:05:17 PM UTC+5:30 mit wrote: > Hi, > > Just wanted to know if there is any way to detect table using > Tesseract(both with border and borderless).Like If its possible to train > tesseract to recognise the table. > > TIA >

Re: [tesseract-ocr] Re: How can I do the training using my own image in Tesseract 4.0

2021-01-11 Thread Murtuza Dahodwala
> Kay > > On Fri, Jan 8, 2021 at 9:32 AM Murtuza Dahodwala > wrote: > > > > > > It is now 2 years since this answer was posted. Is it possible to train > tesseract 4 on real images now? > > On Thursday, January 11, 2018 at 2:27:43 PM UTC+5:30 shree wrote: &g

[tesseract-ocr] Re: Training Tesseract 4 on real images

2021-01-08 Thread Murtuza Dahodwala
I also want to know that how we can train on real images which are not single lines? On Thursday, October 8, 2020 at 1:37:02 PM UTC+5:30 smn...@gmail.com wrote: > Hello, > > I would like to train *Tesseract 4* to recognize certain > scripts/languages based on real images rather than synthetic o

Re: [tesseract-ocr] Re: How can I do the training using my own image in Tesseract 4.0

2021-01-08 Thread Murtuza Dahodwala
It is now 2 years since this answer was posted. Is it possible to train tesseract 4 on real images now? On Thursday, January 11, 2018 at 2:27:43 PM UTC+5:30 shree wrote: > Currently, Ray/Google has NOT released info on how to train Tesseract 4 > (LSTM) with real life images. The only supported