For some words, it is easy to tell that the penultimate syllable is long,
and should therefore be accented (e.g., adventus because -ven- ends in a
consonant, and if the penultimate vowel were a dipthong (au, æ, œ) that
would make the syllable long as well.)  The real trick would be to have a
list of words whose penultimate syllable is never long, and one of words
that always have a long vowel in the penultimate syllable (e.g., advenit is
ambiguous because has a long e if it is in the perfect tense, and a short e
in the present tense).  If anyone could get such lists of Latin words
together, I could write a script to add accents to all the words whose
accent is unambiguous, and then list all the 3+ syllable words whose accent
would need to be determined by the context.

Does anyone have an accented Latin word list of any kind, though?  Even if
it were just a list of every Latin word with accent marked, or with vowel
lengths marked, I could write a script to extract the 3+ syllable words
into their proper lists when they are not ambiguously accented words like
advenit.

I could probably figure out a way to download a list of all the Latin words
contained in Wiktionary, but I'm not sure how accurate or complete that
would be.

*Benjamin Bloomfield*


On Wed, Feb 19, 2014 at 6:40 AM, Innocent Smith <[email protected]>wrote:

> Dear Gregorio Users,
>
> I'm experimenting with using an OCR program to extract liturgical texts
> from a PDF of a Latin Missal, for various purposes including setting texts
> with Gregorio. With the software I have available, I am having difficulty
> doing the OCR in a way that preserves the accents accurately.
>
> Is anyone aware of automated ways to take a Latin text that does not have
> accents and to add them in?
>
> Yours,
>
> bro. Innocent, op
>
> _______________________________________________
> Gregorio-users mailing list
> [email protected]
> https://mail.gna.org/listinfo/gregorio-users
>
>
_______________________________________________
Gregorio-users mailing list
[email protected]
https://mail.gna.org/listinfo/gregorio-users

Reply via email to