If you can test on these
daily bulletins to automate tabulation, that will be very helpful.
https://cpcb.nic.in/AQI_Bulletin.php
Sample file attached

With best wishes,
Sarath

--
*Dr. Sarath Guttikunda*

*http://www.urbanemissions.info <http://www.urbanemissions.info>*


On Wed, May 26, 2021 at 6:58 PM Dilawar Singh <[email protected]>
wrote:

> Dear all,
>
> I am developing a tool to extract a table from an image. It is a big
> undertaking but I hope to release a beta version soon.
>
> The input to the tool is a PNG/JPG/PDF image and output is a CSV/ODT/XLS
> table.
>
> I have some simple tables extracted from PDF. If there are formats which
> govt uses often and people often need/want to digitize them, I'd like to
> have some samples. I am thinking of census data, GIS data etc..
>
> There is no plan to support multi-page tables. I can use some advice on
> the OCR backend (I am using pytesseract from google for now).
>
> best,
>     Dilawar
>
> --
> Dilawar Singh, Ph.D.
> LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID
> <https://orcid.org/0000-0002-4645-3211> Github
> <https://github.com/dilawar>
>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/datameet/CAM72-Zs9PT7CNZONjCUWM3%3D%3DiNDyfhVPg7Yhko1ALJ_Cmp25%2Bw%40mail.gmail.com
> <https://groups.google.com/d/msgid/datameet/CAM72-Zs9PT7CNZONjCUWM3%3D%3DiNDyfhVPg7Yhko1ALJ_Cmp25%2Bw%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/datameet/CAAj%2BwWrV_brtZkpkPZFKgTTgcdYKKT%2B_1E7U-CM-bd4qrSfQJw%40mail.gmail.com.

Attachment: AQI_Bulletin_20210526.pdf
Description: Adobe PDF document

Reply via email to