Some languages have large varieties in orthography, as a result of 
spelling reforms that can be adopted in parts on an individual 
basis.  For example, after the 1996 spelling reform, a German 
writer can prefer to adopt the new spelling Fotografie (with F, 
rather than Ph), while declining to abandon ß.  That writer would 
like her spell checker to report an error for new spelling "dass" 
and for old spelling Photographie.

What technical solutions exist to parameterize the generation of 
spelling dictionaries?

One could imagine a parameterized German dictionary:

#!/bin/sh
ss=ß
ph=f
Ph=F
cat <<EOF
 Boot
 da${ss}
 Fu${ss}
 Katze
 ${Ph}otogra${ph}ie
EOF

where the official new orthography has ss=ss, ph=f; and the old 
was ss=ß, ph=ph.  Users who want to abandon capitalization of 
nouns could even set Ph=f or Ph=ph.

Has anything such been implemented?  Has this functionality been 
considered as a part of Hunspell?

As the next step, would it be possible to design such a spell 
checker that automatically adopts to the user's personal 
orthography, without the user first having to declare her 
preferences?

The same could be useful for distinguishing British from American 
English.  If the user starts to write "colour", then the spell 
checker would assume this user prefers British spelling, and would 
warn against the use of "aluminum" and other Americanisms.

I guess Czech/Slovak and Serbian/Croatian could provide more 
examples.  My closest reference is Swedish/Danish/Norwegian.


-- 
  Lars Aronsson ([EMAIL PROTECTED])
  Aronsson Datateknik - http://aronsson.se

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to