Re: [Wikisource-l] Fwd: [Wikimediaindia-l] Google's Optical Character Recognition software now works with all South Asian languages
Federico Leva (Nemo), 30/08/2015 09:38: Note that Google terms of use are very restrictive. It seems the Google Drive API doesn't have specific terms of use, only the generic ones at https://developers.google.com/terms/ Relevant passages may be: 5e. Prohibitions on Content Unless expressly permitted by the content owner or by applicable law, you will not, and will not permit your end users or others acting on your behalf to, do the following with content returned from the APIs: Scrape, build databases, or otherwise create permanent copies of such content, or keep cached copies longer than permitted by the cache header; [...] 8b. Your Obligations Post-Termination Upon any termination of the Terms or discontinuation of your access to an API, you will immediately stop using the API, cease all use of the Google Brand Features, and delete any cached or stored content that was permitted by the cache header under Section 5. ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
Re: [Wikisource-l] Fwd: [Wikimediaindia-l] Google's Optical Character Recognition software now works with all South Asian languages
Thanks for forwarding. Jayanta Nath, 30/08/2015 00:50: Any how can it be implemented in Proofread extension? With manual copy and paste only, as far as I can see. I don't think Google Drive has any API to extract text from uploaded files, though someone should check https://developers.google.com/google-apps/realtime/drive better than I just did. Note that Google terms of use are very restrictive. Nemo ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l
[Wikisource-l] Fwd: [Wikimediaindia-l] Google's Optical Character Recognition software now works with all South Asian languages
Hi All, I have checked for Bengali Images, its works fine with 100% accuracy. Any how can it be implemented in Proofread extension? Regards, Jayanta -- Forwarded message -- From: Subhashish Panigrahi Date: Sat, Aug 29, 2015 at 3:22 PM Subject: [Wikimediaindia-l] Google's Optical Character Recognition software now works with all South Asian languages To: wikimediaindi...@lists.wikimedia.org -BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Google's OCR which apparently is most accurate OCR we have seen so far, works really good for all the major South Asian scripts: http://globalvoicesonline.org/2015/08/29/googles-optical-character-recog nition-software-now-works-with-all-south-asian-languages Here are test cases of many Indian scripts: https://goo.gl/3X75iR. Except Gurmukhi most scripts are working really good. This could be really useful for Indian language Wikimedians and will come handy for digitization of printed and scanned text. Here is an animated tutorial for Wikimedians to use this tool for Wikisource/Wikipedia: https://commons.wikimedia.org/wiki/File:Tutorial_to_use_Google_Optical_C haracter_Recognition.gif Please write to me if anyone wants to localize this tutorial in your language. - -- Best! Subhashish Panigrahi Programme Officer, Access To Knowledge Centre for Internet and Society @subhapa / https://cis-india.org -BEGIN PGP SIGNATURE- Version: GnuPG v2 iQIcBAEBCAAGBQJV4YD0AAoJEHThehXZGxGO9ywP/RcJOXB3tFHJNF03X23x1jkY vffu+1Iob6kLMZt/JD3nTmpXasXDlme6pbGzaT7/YZsC0VouN+4NE9HoEmZAksJF 3nn7HoEive4mDalXH5qyATOilezqIEYOG2c32LVYHnX6Co+fXPVa5WqsHn5js957 OionIc5t0V9zlGB6e5RLOacPWXsAhXyVunaeY6Ma33cOWHFdVnu1XpUGphJ+miVj EWszTzjDOPlFiMsSsVonjWHvuz7hYPKXxvVXViXY1QAsoOT7wztvOepzM/hAPmYM kGiODSaN8fU/e/2l4xdnMRymAt8hsz61hdye2UYx7xRjlda/23BKNZz0hiuWiqgO FBntHycaHyqR8+fUK5EPE0vnqLp/7XdtRtQkRficuEDYlHz4PlMW8oiVEGhSZOaG fdpgg02sojU1iMOGOs3h/ODWxkRrE3qpG+eT8n1mWJp6Tq7ZLEaQGxW1P6ytlPFF qOz8JKl94D/MI7ybAtp+IsuUQk160H9wUPmaLxgemDRom7220xV6BysbmaMEWwww hgO4fBNG6dPUMp825pTSxx18rY/Kw53sgHmUasixCL6Zv6xnM3rRuTxjZh8j77TR gq2sKgoU+JkYt9eBpVRjrFO90xS5MxPrvL/lGH6P1smAODPull3o0tR681+NGKRp C8vU5vJOlmL+HlNXBSh9 =lwbI -END PGP SIGNATURE- ___ Wikimediaindia-l mailing list wikimediaindi...@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l 0xD91B118E.asc Description: application/pgp-keys 0xD91B118E.asc.sig Description: Binary data ___ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l