Some languages have large varieties in orthography, as a result of
spelling reforms that can be adopted in parts on an individual
basis. For example, after the 1996 spelling reform, a German
writer can prefer to adopt the new spelling Fotografie (with F,
rather than Ph), while declining to abandon ß. That writer would
like her spell checker to report an error for new spelling "dass"
and for old spelling Photographie.
What technical solutions exist to parameterize the generation of
spelling dictionaries?
One could imagine a parameterized German dictionary:
#!/bin/sh
ss=ß
ph=f
Ph=F
cat <<EOF
Boot
da${ss}
Fu${ss}
Katze
${Ph}otogra${ph}ie
EOF
where the official new orthography has ss=ss, ph=f; and the old
was ss=ß, ph=ph. Users who want to abandon capitalization of
nouns could even set Ph=f or Ph=ph.
Has anything such been implemented? Has this functionality been
considered as a part of Hunspell?
As the next step, would it be possible to design such a spell
checker that automatically adopts to the user's personal
orthography, without the user first having to declare her
preferences?
The same could be useful for distinguishing British from American
English. If the user starts to write "colour", then the spell
checker would assume this user prefers British spelling, and would
warn against the use of "aluminum" and other Americanisms.
I guess Czech/Slovak and Serbian/Croatian could provide more
examples. My closest reference is Swedish/Danish/Norwegian.
--
Lars Aronsson ([EMAIL PROTECTED])
Aronsson Datateknik - http://aronsson.se
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]