Re: Walter's Famous German Language Essentials Guide

Chris via Digitalmars-d Fri, 06 May 2016 03:26:59 -0700

On Thursday, 5 May 2016 at 23:47:15 UTC, H. S. Teoh wrote:

Rule-based letter-to-sound systems don't work too well forEnglish precisely because you have to basically reproduce 500years' worth of sound change plus all the exceptions introducedby words borrowed from other contemporous languages across thecenturies. A rule-based system possibly could work, providedthe rules were extensive enough (and multi-layered, to accountfor borrowed exceptions and other oddities). But there comes apoint where even the most industrious programmer would throw uphis hands and say, forget this exercise in futility, let's justhave the machine teach itself instead.

It's not just sound changes, English is just weird from anon-native speaker's point of view. As Kurt Tucholsky, one of thebest German writers ever, once said, English is a simple and adifficult language at the same time. It consists of foreign wordsthat are pronounced wrongly. English pronunciation makes anyspeaker of a Latin language cringe. In many European languages,and certainly in Latin languages, the letter-to-soundcorrespondence is more or less one-to-one: <a> is /a/, <e> is /e/etc. In English it's often /ei/ and /i:/. <i> is often /ai/ (offor f**k's sake!): "emeritus", a Latin word, is pronounced/e.'me(:).ri.tus/, in English it's /em@.'rai.d@s/. This justmakes you cringe. Native speakers of English often don't realizehow weird their pronunciation sounds to those who natively speakthe language they borrowed the words from (around 60% of thewords). Makes me laugh when I hear English speakers who say "Oh,there is no Irish word for 'afterhours'!?" - Well, what's theEnglish for "restaurant", "evict", "condone", "depot", "deposit"... and what's the English for "language"?

Rule-based systems work better for Spanish because theorthography is much closer to actual pronunciation, and otherparameters such as stress is more predictable. I'd venture toguess that rule-based systems might not work as well forRussian, in spite of the orthography being almost 1-to-1 withactual pronunciation, because of unpreditable stress positionswhich can fundamentally alter vowel values. At best, you'd needa database of stress patterns for various words so that theaccent would fall in the correct places. Plus a set ofexceptions for certain archaic word combinations that haveunusual stress. If you had a database of English stresspositions, I think half the battle is already won.
French would have the same problem as English, except that youcould just do as a first approximation:
        if (rand() > someFactor)
                word = word[0 .. $/2];

and then touch it up with a small set of exceptions.  :-P


T

Are Russian stress-rules based on context? Long vs. short vowels,palatalized vs. velarized consonants etc.? If yes, you canprogram rules.

Re: Walter's Famous German Language Essentials Guide

Reply via email to