Most of the Bookshare volunteers do use Word.

On 12/29/2011 7:10 PM, Edward Betts wrote:
> What's the system they're using for corrections? Have they built a web
> application, or are they fixing the text in something like Word?
>
> On 2011-12-29 12:54, Roger Loran Bailey wrote:
>> As I said, Bookshare uses human proofreaders. The books are scanned and,
>> of course, there are scanning errors. Then human volunteers download the
>> scanned copy and proofread it correcting the errors. The corrected copy
>> is then uploaded and that is what goes into the Bookshare collection.
>>
>> On 12/29/2011 3:50 PM, Edward Betts wrote:
>>> Does the bookshare correction software maintain word page coordinates?
>>>
>>> On 2011-12-29 12:23, Roger Loran Bailey wrote:
>>>> I have an idea. It might run into a problem with copyright issues, but I
>>>> am not sure because I think it might come under the provisions in the
>>>> copyright act that covers preparing books for use by the print impaired.
>>>> I am not sure what it is, but I think Open Library has a relationship
>>>> with Bookshare. Bookshare has human volunteers who proofread scans of
>>>> books. I am one of them. Might it be possible for Open Library and
>>>> Bookshare to share scanned books? That is, copies of books that are held
>>>> by Bookshare could be turned over to Open Library to be posted as
>>>> protected Daisy books and the scans that come from the Internet Archive
>>>> could be turned over to Bookshare to be proofread by Bookshare
>>>> volunteers. Then after they have been proofread and enter the Bookshare
>>>> collection they could be copied and returned to Open Library to be
>>>> posted as better copies of what was there before. Am I just fantasizing
>>>> or might something like this be possible?
>>>>
>>>> On 12/29/2011 3:03 PM, Edward Betts wrote:
>>>>> We don't currently have a system for recording the quality of the OCR or
>>>>> correcting mistakes.
>>>>>
>>>>> As you point out the OCR doesn't properly handle blackletter type.
>>>>>
>>>>> A system for correcting OCR is often requested, conceptually it is quite
>>>>> simple. Just a web page that shows the page image and a way to edit the
>>>>> text. We keen to maintain page coordinate information for each word so
>>>>> that we can highlight words in the book reader and search inside. This
>>>>> makes the problem more difficult.
>>>>>
>>>>> We would like to build a correction system, but we don't have the 
>>>>> resources.
>>>>>
>>>> _______________________________________________
>>>> Ol-discuss mailing list
>>>> [email protected]
>>>> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
>>>> To unsubscribe from this mailing list, send email to 
>>>> [email protected]
>>> _______________________________________________
>>> Ol-discuss mailing list
>>> [email protected]
>>> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
>>> To unsubscribe from this mailing list, send email to 
>>> [email protected]
>> _______________________________________________
>> Ol-discuss mailing list
>> [email protected]
>> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
>> To unsubscribe from this mailing list, send email to 
>> [email protected]
> _______________________________________________
> Ol-discuss mailing list
> [email protected]
> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
> To unsubscribe from this mailing list, send email to 
> [email protected]
_______________________________________________
Ol-discuss mailing list
[email protected]
http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
To unsubscribe from this mailing list, send email to 
[email protected]

Reply via email to