Re: [Wikisource-l] Assessing OCR quality

2019-03-12 Thread scann
I don't know how it works inside Wikisource, but at the very least Tesseract has a confidence value (also called confidence score or level) that will score how well it did OCR on a word (it also works at character level). But for assessing that you normally need the hOCR result. cheers, El mar.,

Re: [Wikisource-l] (no subject)

2018-10-01 Thread scann
share the videos in the openglam account if you haven't yet :) congrats! El lun., 1 oct. 2018 11:40, Bodhisattwa Mandal escribió: > Hi Alex, > > Glad you liked the initiative. > > Sure, we will gather the data and submit in the grant report on 30 > October, 2018. > > Regards, > Bodhisattwa > >

Re: [Wikisource-l] EAP books

2018-07-27 Thread scann
l case around it). The email is: endangeredarchi...@bl.uk. When I reached them out they were quite responsive. best, scann 2018-07-25 18:08 GMT-04:00 Bodhisattwa Mandal : > Hi, > > During Wikimania hackathon, User:Pmlineditor created two python scripts > for British Library Endangered Archive

Re: [Wikisource-l] Scanner for you?

2017-05-08 Thread scann
I know this tends to be a major bottleneck (specially if you are using Linux and the library is used to work with Windows), but it can be sorted it out. Best, Scann 2017-05-08 9:50 GMT-03:00 Nicolas VIGNERON : > Hi, > > Thanks for the proposal but Wikimédia France already has

Re: [Wikisource-l] New release of unpaper

2014-10-29 Thread scann
I think that there are better options to install in ToolLabs, like ScanTailor or Spreads, which actually have functions such as dewarping, which unpaper doesn't have. BTW, those tools can be tested in Windows or Mac. 2014-10-29 11:59 GMT-03:00 Alex Brollo : > Could it be installed into ToolLabs? I

Re: [Wikisource-l] Scanner purchase

2014-09-09 Thread scann
2014-09-09 12:01 GMT-03:00 David Cuenca : > > Have you tried the Book Uploader Bot? It is a project created by Rohit (in > CC) for his GSoC. > http://tools.wmflabs.org/bub/index > > Wow, thank you! This seems like a tool that we could use for the project, definitely. Do you know if it's possible

Re: [Wikisource-l] Scanner purchase

2014-09-08 Thread scann
ght one of these scanners (and they're using it), CC Uruguay, that has a close relationship with Wikimedia Uruguay, also bought one of these, and now Wikimedia Brazil. Well, that's pretty much it. I can answer any questions about the DIY Book scanners, and specially the way I see the proje