Re: Other European languages on a US keyboard [OT]

2006-07-25 Thread A.J.Mechelynck

Yakov Lerner wrote:

On 7/24/06, A.J.Mechelynck [EMAIL PROTECTED] wrote:

The French have
used accented letters since (IIUC) before Gutenberg invented printing.


While Antonie helps us with bits of history, I thought I'd ask this. I 
was on

irc chat, and somehow the issue of French using a lot of silent
letters came up.

For example, Peugeot is 7 letters but 4 sounds. I don't speak French, but
the tendency is there.

Somebody explained that in middle ages, literacy was rare, and scribes
were paid by letters written; and scribes would artificially inflate
number of letter.
Does this explanation hold water ? I know that generally in other 
languages,
the silent letters are artifact of past real sounds that are preserved 
due to

conservatism of orthography. Was if different in French ?

Yakov




In the Middle Ages, scribes weren't paid commercially (I think): they 
were monks and had their share in their convent's table and bed. Payment 
was to the convent, maybe by the thickness of the book, maybe according 
to the beauty of illustrations (and decorated letters), maybe according 
to how long it took to read the story aloud, I don't rightly know. Or 
else maybe some copists were attached to some nobleman's court, and 
wrote, copied or decorated books in return for being lodged, fed, clad, 
and generally cared for. I guess many small letters wouldn't have 
fetched very much more than slightly fewer, but slightly bigger, letters.


French orthography has been largely fixed at some point of the Middle 
Ages (12th century?), but French pronunciation, like English 
pronunciation, has continued to evolve since then. Just like English 
spelling does not reflect the Great Vowel Shift, French spelling still 
uses letters that belong in the etymology, ceased to be pronounced at 
some point in the past, but reappear in the feminine or in liaison:


un grand monsieur -- no liaison, the d in grand is silent
un grand homme -- liaison, the d is sounded [t]
une grande femme -- feminine, before unvoiced consonant, -de is sounded [t]
une grande maison -- feminine, before voiced consonant, -de is sounded [d]
elle est grande -- feminine, before pause, [d]
une grand-mère -- compound, frozen before the adjective (from Lat. 
grandis in both genders) acquired -e in the feminine: no liaison, -d 
is silent.

la grandeur -- derived word, with [d] pronounced.

There are many such cases, also e.g. with verbs, or with plurals where 
the -s (from the Latin accusative plural) has remained sounded only in 
liaison. IIUC, at least some linguists say that those silent letters are 
not artifacts, but reflect phonemes (is that the word?) that are still 
present in the mental representation of the language, but are not always 
pronounced depending on context.


I don't know where the proper name Peugeot came from, but -eu- is the 
standard French graphy for a sound unknown in English (except as the 
first part of a diphtong in some recent forms of upper-class British 
English long O) but represented in German as ö; the e after the g is a 
diacritic, meaning g is pronounced soft before a,o,u; and -ot is a 
common diminutive ending where the -t (from, I guess, Latin -otus, 
-otum, as in factotum maybe?) used to be pronounced but isn't anymre. 
(About -ge- : The spelling of British gaol is much more surprising 
than its French counterpart geôle when considering how they are 
pronounced respectively.)



Best regards,
Tony.


Re: Other European languages on a US keyboard [OT]

2006-07-25 Thread Russell Bateman

[more way off topic comments]

...some linguists say that those silent letters are not artifacts, but 
reflect __phonemes__ (is that the word?) that are still present in the 
mental representation of the language...


--__morphemes__, actually, from a written point of view (you did say 
letters). They are what's left of the word eroded from Latin and, like 
night in English, demonstrate (just as you have pointed out) the 
word's philology (etymology is somewhat correct, but focuses more on 
semantics than the morphemic transformation). Note: I studied Latin, 
Greek and Linguistics at Université de Paris X in the late 70s.


Many French nouns, for example, evolve from the accusative (direct 
object) singular form in Latin with further erosion, rosam  rose, 
templum  temple, calculus  calcule. This also accounts partially for 
what others see as French's tendency to accentuate the final syllable of 
a word; actually, there isn't really a tonic accent in French (I won't 
go into this), but the final of the word was often the penultimate in 
Latin, often the part receiving the tonic accent in that language (just 
as now in Italian, Spanish, etc.) and therefore could not be lost 
without compromising the word itself.


Phonemes are (very) roughly equivalent to syllables and exist at the 
oral or phonetic level. French has the peculiarity, more than most other 
Western languages in my observation, of its end of word phonemes being 
greatly ambiguous due to the erosion from Latin already mentioned. 
Hence, it's easier to find rhymes both rich and otherwise in French even 
across gender boundaries (whereas Italian and Spanish have kept the o/a 
alternance when French erodes both feminine am and masculine um to 
silent e). The resulting explosion in jeux de mots (puns), so looked 
down upon or at least smirked at in English, is inexplicably prized in 
French (where it is so much more common in the first place): Le  
_saint_ père, _sain_ de corps et d'esprit, _ceint_ de vertu, couvait le 
mal dans son _sein_. (The _holy_ father, while _healthy_ in body and 
spirit, and _girded_ with virtue, nourished evil in his _breast_. All 
these underlined words are pronounced identically. There's yet another 
word or two in French pronounced the same way, but it's been too many 
years and I can't seem to conjure them up at the moment.


If Linguistics paid a decent wage, I probably wouldn't be writing C code 
for a living.


Is this off-topic or what?

Russ



A.J.Mechelynck wrote:

Yakov Lerner wrote:

On 7/24/06, A.J.Mechelynck [EMAIL PROTECTED] wrote:

The French have
used accented letters since (IIUC) before Gutenberg invented printing.


While Antonie helps us with bits of history, I thought I'd ask this. 
I was on

irc chat, and somehow the issue of French using a lot of silent
letters came up.

For example, Peugeot is 7 letters but 4 sounds. I don't speak French, 
but

the tendency is there.

Somebody explained that in middle ages, literacy was rare, and scribes
were paid by letters written; and scribes would artificially inflate
number of letter.
Does this explanation hold water ? I know that generally in other 
languages,
the silent letters are artifact of past real sounds that are 
preserved due to

conservatism of orthography. Was if different in French ?

Yakov




In the Middle Ages, scribes weren't paid commercially (I think): 
they were monks and had their share in their convent's table and bed. 
Payment was to the convent, maybe by the thickness of the book, maybe 
according to the beauty of illustrations (and decorated letters), 
maybe according to how long it took to read the story aloud, I don't 
rightly know. Or else maybe some copists were attached to some 
nobleman's court, and wrote, copied or decorated books in return for 
being lodged, fed, clad, and generally cared for. I guess many small 
letters wouldn't have fetched very much more than slightly fewer, but 
slightly bigger, letters.


French orthography has been largely fixed at some point of the Middle 
Ages (12th century?), but French pronunciation, like English 
pronunciation, has continued to evolve since then. Just like English 
spelling does not reflect the Great Vowel Shift, French spelling still 
uses letters that belong in the etymology, ceased to be pronounced at 
some point in the past, but reappear in the feminine or in liaison:


un grand monsieur -- no liaison, the d in grand is silent
un grand homme -- liaison, the d is sounded [t]
une grande femme -- feminine, before unvoiced consonant, -de is 
sounded [t]
une grande maison -- feminine, before voiced consonant, -de is sounded 
[d]

elle est grande -- feminine, before pause, [d]
une grand-mère -- compound, frozen before the adjective (from Lat. 
grandis in both genders) acquired -e in the feminine: no liaison, -d 
is silent.

la grandeur -- derived word, with [d] pronounced.

There are many such cases, also e.g. with verbs, or with plurals where 
the -s (from the Latin accusative 

Re: Other European languages on a US keyboard [OT]

2006-07-25 Thread cga2000
On Tue, Jul 25, 2006 at 10:27:58AM EDT, Russell Bateman wrote:
 [more way off topic comments]
 
[...]
 Phonemes are (very) roughly equivalent to syllables and exist at the 
 oral or phonetic level. French has the peculiarity, more than most other 
 Western languages in my observation, of its end of word phonemes being 
 greatly ambiguous due to the erosion from Latin already mentioned. 
 Hence, it's easier to find rhymes both rich and otherwise in French even 
 across gender boundaries (whereas Italian and Spanish have kept the o/a 
 alternance when French erodes both feminine am and masculine um to 
 silent e). The resulting explosion in jeux de mots (puns), so looked 
 down upon or at least smirked at in English, is inexplicably prized in 
 French (where it is so much more common in the first place): Le  
 _saint_ père, _sain_ de corps et d'esprit, _ceint_ de vertu, couvait le 
 mal dans son _sein_. (The _holy_ father, while _healthy_ in body and 
 spirit, and _girded_ with virtue, nourished evil in his _breast_. All 
 these underlined words are pronounced identically. There's yet another 
 word or two in French pronounced the same way, but it's been too many 
 years and I can't seem to conjure them up at the moment.

seing .. as in blanc-seing
 
 If Linguistics paid a decent wage, I probably wouldn't be writing C
 code for a living.
 
 Is this off-topic or what?
 
 Russ
 


Re: Other European languages on a US keyboard

2006-07-25 Thread cga2000
On Tue, Jul 25, 2006 at 03:09:29AM EDT, A.J.Mechelynck wrote:
[..]
 
 and it can change fonts on-the-fly (change 
 the font from Courier to Lucida to whatever, only through Vim keyboard 
 commands). 
 
 I would never want do that.. but just out of curiosity.. why would that
 not be possible in an xterm?
 
 because console Vim has no control over the xterm's fonts.
 
ok.  a bit more flexible than toggling the xterm's font.
 
 It can do real boldface and italics, as well as straight or 
 curly underlining. 
 
 That would be for highlighting stuff, right? So the same functionality
 can be achieved with colors.  And in a more pleasing manner IMHO.. the
 color schemes that I have seen that use italics have not convinced me.
 
 I do html a lot, and it helps me to see iitalics, bbold italics, 
 ubold underlined italics,/u/b uunderlined 
 italics,/u/iuunderlined/u text all displayed like they should. 

.. meaning you can toggle between the source version and the rendered
version of the document in Vim?

 Of course colours can do it, but console vim has only 16 bg and 16 fg 
 colours: the list is soon over.

.. my mistake.. I never counted them and I thought that console Vim on
a 256-color xterm was capable of displaying 256 colors simultaneoulsy.

[..]
 
 Note: :setl fenc=latin9 follows by :setl fenc? returns 
 iso-8859-15. This is normal, they are two names for the same thing.
 
 
 So this should help clarify the issue.

 For more details, see
   :help mbyte.txt
   http://vim.sourceforge.net/tips/tip.php?tip_id=246
   section 37 (last) of the Vim FAQ 
 http://vimdoc.sourceforge.net/htmldoc/vimfaq.html
   http://www.unicode.org/
   http://www.cl.cam.ac.uk/~mgk25/unicode.html

Thanks

cga


Re: Other European languages on a US keyboard [OT]

2006-07-25 Thread A.J.Mechelynck

Russell Bateman wrote:

[more way off topic comments]

...some linguists say that those silent letters are not artifacts, but 
reflect __phonemes__ (is that the word?) that are still present in the 
mental representation of the language...


--__morphemes__, actually, from a written point of view (you did say 
letters). They are what's left of the word eroded from Latin and, like 
night in English, demonstrate (just as you have pointed out) the 
word's philology (etymology is somewhat correct, but focuses more on 
semantics than the morphemic transformation). Note: I studied Latin, 
Greek and Linguistics at Université de Paris X in the late 70s.


I haven't studied linguistics, other than as an amateur, so I have to 
defer to you. By etymology I meant here (as you understood) what might 
be called the word's linguistic history: night is the English form 
of the Germanic word which gave Dutch and German nacht where the -ch- 
is still pronounced as a guttural consonant. Similarly light (NL/DE 
licht), though (DE doch), through (DE durch), etc.




Many French nouns, for example, evolve from the accusative (direct 
object) singular form in Latin with further erosion, rosam  rose, 
templum  temple, calculus  calcule. This also accounts partially for 
what others see as French's tendency to accentuate the final syllable of 
a word; actually, there isn't really a tonic accent in French (I won't 
go into this), but the final of the word was often the penultimate in 
Latin, often the part receiving the tonic accent in that language (just 
as now in Italian, Spanish, etc.) and therefore could not be lost 
without compromising the word itself.


Phonemes are (very) roughly equivalent to syllables and exist at the 
oral or phonetic level. French has the peculiarity, more than most other 
Western languages in my observation, of its end of word phonemes being 
greatly ambiguous due to the erosion from Latin already mentioned. 
Hence, it's easier to find rhymes both rich and otherwise in French even 
across gender boundaries (whereas Italian and Spanish have kept the o/a 
alternance when French erodes both feminine am and masculine um to 
silent e). The resulting explosion in jeux de mots (puns), so looked 
down upon or at least smirked at in English, is inexplicably prized in 
French (where it is so much more common in the first place): Le  
_saint_ père, _sain_ de corps et d'esprit, _ceint_ de vertu, couvait le 
mal dans son _sein_. (The _holy_ father, while _healthy_ in body and 
spirit, and _girded_ with virtue, nourished evil in his _breast_. All 
these underlined words are pronounced identically. There's yet another 
word or two in French pronounced the same way, but it's been too many 
years and I can't seem to conjure them up at the moment.


Gentlemen, that reminds me:
Un _sot_ à la potrine étriquée chevauchait un âne. Il tenait dans sa 
main droite le _sceau_ destiné à marquer le _seau_ qu'il tenait de sa 
main gauche. Subitement, l'âne fait un écart et [letrwasotõb]. Comment 
écrivez-vous [letrwaso]?











Réponse: l'étroit sot.

-- Again, not really translatable.
A narrow-chested _fool_ [so] was riding an ass. He was holding in his 
right hand the _seal_ [so] with which to mark the _bucket_ [so] he was 
holding in his left. Suddenly, the ass shies and (the three [so] fall). 
How do you write (the three [so])? Answer: the narrow fool (because the 
three and the narrow are also homonymous, and so are the 3rd persons 
sg.  pl. of to fall in the indicative present).




If Linguistics paid a decent wage, I probably wouldn't be writing C code 
for a living.


:D



Is this off-topic or what?

Russ


Yes, it is, but it's fun, isn't it?


Best regards,
Tony.


Re: Other European languages on a US keyboard

2006-07-25 Thread A.J.Mechelynck

cga2000 wrote:

On Tue, Jul 25, 2006 at 03:09:29AM EDT, A.J.Mechelynck wrote:
[..]
and it can change fonts on-the-fly (change 
the font from Courier to Lucida to whatever, only through Vim keyboard 
commands). 

I would never want do that.. but just out of curiosity.. why would that
not be possible in an xterm?

because console Vim has no control over the xterm's fonts.


ok.  a bit more flexible than toggling the xterm's font.
It can do real boldface and italics, as well as straight or 
curly underlining. 

That would be for highlighting stuff, right? So the same functionality
can be achieved with colors.  And in a more pleasing manner IMHO.. the
color schemes that I have seen that use italics have not convinced me.
I do html a lot, and it helps me to see iitalics, bbold italics, 
ubold underlined italics,/u/b uunderlined 
italics,/u/iuunderlined/u text all displayed like they should. 


.. meaning you can toggle between the source version and the rendered
version of the document in Vim?

[...]

No, I still see the tags, and font color=redblah blah blah/font 
doesn't make blah blah blah show up red in gvim, but the HTML syntax 
has groups htmlItalic, htmlBold, htmlUnderlined, htmlBoldItalic, etc., 
which appear as italic, bold, underlined, bold-italic, etc., in the GUI 
(unless, of course, a colorscheme changes them, but why would it want to?).



Similarly, the built-in spell checker uses (by default) curly 
underlining, so the background and foreground colours of the underlying 
text are not altered.



Best regards,
Tony.


Re: Other European languages on a US keyboard

2006-07-24 Thread cga2000
On Mon, Jul 24, 2006 at 03:36:54AM EDT, Matthew Winn wrote:
 On Sun, Jul 23, 2006 at 06:41:09PM -0400, cga2000 wrote:
  Avoid words such as coeur.. boeuf.. etc.  Rather amazing that the
  French who are so picky about anything that concerns their language
  never came up with a codepage.. or whatever it's called that features
  this particular character.  
 
 I think it dates from the days when typewriters were popular.  The US
 dominance of the market for office equipment prompted many European
 languages to manage without combinations like oe, ae and ij where the
 characters can be approximated by typing separate letters.  It's easier
 to change typing habits than to manufacture a new range of typewriters
 just to deal with one special letter.

.. hmm.. as far as I know only France and Germany went to the trouble of
designing their own typewriter keyboard layouts separate from the QWERTY
model.  I think Polish keyboards are derived from the German layout..  I
would assume variations of the French layout are used in other
French-speaking countries and some African countries..  As to other
European countries - ie. the ones that speak neither French nor German -
I believe that you are correct and that they use derivatives of the US
keyboard.  

Therefore, since the French went so far as building keyboards that have
the basic letters arranged differently (AZERTY instead of QWERTY) it
would not have been such a major enhancement to provide an oe some
place on that keyboard..? 

I have a feeling it is more a question of whoever designed the original
French typewriter keyboard just did not think it worth bothering with
such typographic niceties as providing an o dans l'e (or is it the
other way round?) when the end result with fixed-width characters was
going to be light-years removed from the refinements of traditional
typesetting anyway..

But I would agree that the absence of the oe on French keyboards
(typewriters and computers alike) probably accounts for the fact that
you can't find it anywhere in the latin* charsets.
 
 Prior to computers many keyboards didn't even have separate keys for
 the digits 1 and 0, typists using the letters l and O instead.

I was aware of the l/1 thing.. sometimes use it when I feel lazy.

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-24 Thread Russell Bateman

As you say, warning: off-topic post. Read at your own risk.

This discussion underlines all the more strongly why I don't attempt to 
produce final documents using vim: I sometimes use an actual word 
processor like Open Office Writer, but mostly I write in HTML and, of 
course, the best HTML editor on the planet is...


...vim!

Russ

P.S. Yes, typing eacute; , oelig; and uuml; is painful, but I'm one 
of those perfectionists who would have used half-spacing back in the old 
days if I had been in need of such things. My father used a non-electric 
typewriter, but I was 19 before I moved to France from the US and needed 
what wasn't on the keyboard. After coming back at 25 (some 26+ years ago 
now), I never lost the need to communicate and product documents of with 
accents, digraphs, etc. in fact, I added the need to compose classical 
Greek texts while in France, but that's a whole other mess.



A.J.Mechelynck wrote:


Warning: off-topic post. Read at your own risk.

Before computers, I used a French typewriter keyboard (AZERTY type). 
Nowadays I use a Belgian computer keyboard (also AZERTY but with 
special characters arranged differently). My father has an old 
typewriter he bought in Switzerland when he was a student, and it uses 
a QWERTZ layout. (Switzerland has four official languages, viz. 
German, French, Italian and Romanche; and I don't know how many 
different keyboards they use.)


On a mechanical typewriter, it was possible to use half-spacing by 
holding the space bar down. So, if one wanted to produce the oe 
digraph on a French typewriter (not an electric one though), it was 
possible -- for a perfectionist. Let's say I wanted to type boeuf (= 
beef/ox):


1. press and hold spacebar. This advances the carriage by one half space
2. hit b. This prints b without moving the carriage.
3. release, press and hold spacebar.
4. hit o
5. release spacebar. The carriage is now over the right half of the o.
6. hit e u f in succession.

The oe digraph is called o, e dans l'o and the ae digraph is called 
in French a, e dans l'a. The latter as in Serge Gainsbourg's song 
elaeudanla téitéia (which spells the name Laetitia).


French typewriters indeed seldom had the digits one and zero: 
small-ell and big-oh were used insted. But it even carried over to 
computers: Several decades ago (before the merger with Honeywell), the 
(French) Bull computer company used on its computers a charset where 
the same character could mean either zero or O-for-Oscar depending on 
context -- and another one, I think, could mean one or I-for-India. 
(Few computers had lowercase in those days.) This, of course, caused 
headaches without end when trying to convert those computers' magnetic 
tapes to IBM's BCD and EBCDIC standards or to (whose? PDP? CDC? 
other?) ASCII.


I'm not sure non-English non-French non-German speaking countries all 
use a US-derived keyboard, even if we limit ourselves to those that 
use variants of the Latin alphabet. Typewriters, after all, date back 
to (I think) before World War I, a time when English was much less 
dominant internationally than it is now. At the courts of 
St-Petersburg and Potsdam, French was spoken; Germany and Austria 
together covered (or had recently covered) a territory that went from 
Alsace to Silesia and from Schleswig-Holstein to the plain of the Po. 
I suspect that most of Central Europe would have adopted a 
German-derived (or maybe French-derived) keyboard regardless of 
whether the majority language was Czech, Slovak, Italian, Hungarian, 
Croatian...


I agree that the lack of oe OE digraphs in the Latin charsets is 
probably due to their absence on French typewriter keyboards. (AE ae 
were kept because they are used in Danish.) There is more than a 
single-letter difference with English though: not only the layout is 
different but there are several accented letters. The French (and 
Belgian) keyboards have a dead key for circumflex and 
trema/diaeresis/umlaut, but à ç é è ù and sometimes 
uppercase-C-cedilla each have their own glyphs. (In French, uppercase 
letters with the exception of C-cedilla and sometimes E-acute were 
usually left unaccented. I believe computers are slowly pushing back 
the trend.)



Best regards,
Tony.




Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck

Russell Bateman wrote:

As you say, warning: off-topic post. Read at your own risk.

This discussion underlines all the more strongly why I don't attempt to 
produce final documents using vim: I sometimes use an actual word 
processor like Open Office Writer, but mostly I write in HTML and, of 
course, the best HTML editor on the planet is...


...vim!


;-)



Russ

P.S. Yes, typing eacute; , oelig; and uuml; is painful, but I'm one 
of those perfectionists who would have used half-spacing back in the old 
days if I had been in need of such things. My father used a non-electric 
typewriter, but I was 19 before I moved to France from the US and needed 
what wasn't on the keyboard. After coming back at 25 (some 26+ years ago 
now), I never lost the need to communicate and product documents of with 
accents, digraphs, etc. in fact, I added the need to compose classical 
Greek texts while in France, but that's a whole other mess.


... and I bought an IBM ball typewriter so I could type not only Latin 
but also Greek and Russian. I even lent it to the World Esperanto 
Congress of 1982 in Antwerp, so journalists could write home in their 
respective languages. Don't know if they used it (the keyboard layout 
must have felt weird to them).


Somewhere at vim-online I have a script htmlmap.vim which 
auto-converts ç to ccedil; É to Eacute; etc. as you type; and F12 o e 
to #339; etc. (not all browsers understand oelig; -- or did when I 
started: Netscape 4, e.g., didn't).  (Useful when the browser doesn't 
know if it's reading UTF-8 or some flavour of Latin.) Source it from 
%HOME%\vimfiles\after\ftplugin\html.vim or ~/.vim/after/plugin/html.vim. 
I'm not sure if it works when 'encoding' is set to UTF-8 though.



Best regards,
Tony.


RE: Other European languages on a US keyboard

2006-07-24 Thread Gene Kwiecinski
Rare enough .. but besides oeuf is also occurs in such very common
words as voeu [wish] and coeur [heart] and it really bothers me
when
I see them incorrectly spelled in web pages for instance.  I spot it
and
after that I tend to lose focus and not be able to take in what I'm
reading for a short while.

How're they misspelled?


Or, if none of the distributed keymaps is exactly what you want, you
can 
write your own. It isn't hard. See :help :loadkeymap for the
theory, 
and look at the contents of Bram's $VIMRUNTIME/keymap/accents.vim
and my 
$VIMRUNTIME/keymap/esperanto_utf8.vim for a couple of simple
examples. 

Already started on this:  copied accents.vim to ~/.vim/keymap/ ..
renamed it to foreign.vim and added the Spanish inverted question /
exclamation marks - an for now I have mapped to !! and ??. 

Come to think of it, French would appear to have the most annoying
spelling system of the West European languages that I have some degree
of familiarity with.  Spanish, Italian, and German seem to use fewer
non-ASCII characters.  

In order to set up my foreign language keymap correctly I would really
need tables of all the characters that occur in these languages, decide
which ones are common enough to be worth adding to the keymap, and make
sure I build a scheme that's coherent before I get my fingers to
memorize it.  I'll scour the Wiki's later today.. see if I can find
anything useful.

If you wouldn't mind, definitely keep me in the loop on this one, as
I've got something of an interest.

Offhand, some contributions and questions:

beta-looking SS (German)
slashed 'l' (Polish)
slashed 'o' (Scandinavian or thereabouts, not sure if Dutch or other)
AElig/aelig/OElig/oelig (Latin, etc.)
ccedil/Ccedil (how done, ,C?)
ecedil(?) (also Polish, possibly other vowels, 'though don't recall
offhand)

Oh, someone on the list is native Polish, so might ask him.  Was it
Mikolaj?

Dunno anyone Dutch who'd recall the slashed-'o'.

How to enter Aring (eg, Aring;ngstrom)?  oA??  Synonymous with aa
(eg, Haas == Haring;s?)


Oh, well...


Re: Other European languages on a US keyboard

2006-07-24 Thread Charles E Campbell Jr

cga2000 wrote:


I sometimes need to write text in other languages such as French,
Spanish and occasionally German or Italian. ..snip..

I would like to do this in Vim.

Unfortunately I only have a US keyboard.
 



Have you considered EasyAccents.vim?

 http://vim.sourceforge.net/scripts/script.php?script_id=451

It doesn't use the spelling checker in vim 7.0, but it accepts
a'
a`
a^
a:
etc to generate accented characters.  Easy to turn on and off, too: \eza 
toggles.


Regards,
Chip Campbell



Re: Other European languages on a US keyboard

2006-07-24 Thread Mikolaj Machowski
 Schleswig-Holstein to the plain of the Po. I suspect that most of
 Central Europe would have adopted a German-derived (or maybe
 French-derived) keyboard regardless of whether the majority language was
 Czech, Slovak, Italian, Hungarian, Croatian...

In fact Polish traditional keyboard is modelled after German layout
(with addition of Polish diacritics on separate keys). Only very fast
spreading of computers in the beginning of '90 connected with lack of
production of those keyboards lead to wide adoption of Polish programmer
keyboard (US 101 with diacritics by AltGr not separate keys).

m.



Re: Other European languages on a US keyboard

2006-07-24 Thread cga2000
On Mon, Jul 24, 2006 at 10:50:33AM EDT, Gene Kwiecinski wrote:
 Rare enough .. but besides oeuf is also occurs in such very common
 words as voeu [wish] and coeur [heart] and it really bothers me
 when
 I see them incorrectly spelled in web pages for instance.  I spot it
 and
 after that I tend to lose focus and not be able to take in what I'm
 reading for a short while.
 
 How're they misspelled?
 
well .. can't give you a sample of the correct spelling - ie. the
single oe character instead of the o .. a space and the e.. -
since I am not running with an UTF8 locale.. and even that might not
work if you are running latin1 for instance.

so to get my meaning you need to go to 

www.wordreference.com

.. enter egg in the Enter Word field and check the English to
French radio button.. Enter .. and a few lines down you should see
a bunch of oeuf correctly spelled .. bad egg - oeuf pourri, for
example.. provided your browser is set up to display UTF8, naturally..

otherwise.. I'll have to put up a screenshot some place..
 
[...]

 In order to set up my foreign language keymap correctly I would really
 need tables of all the characters that occur in these languages, decide
 which ones are common enough to be worth adding to the keymap, and make
 sure I build a scheme that's coherent before I get my fingers to
 memorize it.  I'll scour the Wiki's later today.. see if I can find
 anything useful.
 
 If you wouldn't mind, definitely keep me in the loop on this one, as
 I've got something of an interest.

Not much joy where finding tables of all characters used in typesetting
for different languages, I'm afraid.

I did find this regarding keyboard layouts though:

http://www-306.ibm.com/software/globalization/topics/keyboards/registry_index.jsp

.. as usual I had a very simplified vision of the problem.
 
 Offhand, some contributions and questions:
 
 beta-looking SS (German)

.. used to know this as an s-tset (phonetic rendering..)

 slashed 'l' (Polish)

no knowledge of slavonic languages here.. sorry.

looks like Poland has somewhat switched to a US-derived keyboard where
computers are concerned.  And Tony was right assuming that most central
European countries - Czech.. Slovak.. Slovenian/Croatian.. Moldovan..
Hungarian.. even Rumanian have keyboards that are derived from the
German layout.

 slashed 'o' (Scandinavian or thereabouts, not sure if Dutch or other)

don't think Dutch has this.

 AElig/aelig/OElig/oelig (Latin, etc.)

Danes  Norwegians have a key for AElig right on the keyboard.

 ccedil/Ccedil (how done, ,C?)

Some keyboards - French.. Italian.. Portuguese.. have a ccedil key

 ecedil(?) (also Polish, possibly other vowels, 'though don't recall
 offhand)
 
Doesn't appear to be one on the Polish keyboard as pictured at the IBM
site.

 Oh, someone on the list is native Polish, so might ask him.  Was it
 Mikolaj?
 
 Dunno anyone Dutch who'd recall the slashed-'o'.
 
Can't think a town in Holland that has this .. so I would assume it's
not indigenous (?)

 How to enter Aring (eg, Aring;ngstrom)?  oA??  Synonymous with aa
 (eg, Haas == Haring;s?)
 
 
 Oh, well...

Yes, I could say I agree with that last remark..

Vast subject.. but quite fascinating.. :-)

Real glad I started this thread.. Learned some useful stuff here..

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck
Warning: this email is in UTF-8. U+ below (where  is in hex) 
is the Unicode notation for a character (Unicode codepoint) given by value.


Gene Kwiecinski wrote:

Rare enough .. but besides oeuf is also occurs in such very common
words as voeu [wish] and coeur [heart] and it really bothers me

when

I see them incorrectly spelled in web pages for instance.  I spot it

and

after that I tend to lose focus and not be able to take in what I'm
reading for a short while.


How're they misspelled?


the above should all have œ (oe digraph, as one character) not o 
followed by e (two characters)






Or, if none of the distributed keymaps is exactly what you want, you
can 

write your own. It isn't hard. See :help :loadkeymap for the
theory, 

and look at the contents of Bram's $VIMRUNTIME/keymap/accents.vim
and my 

$VIMRUNTIME/keymap/esperanto_utf8.vim for a couple of simple
examples. 


Already started on this:  copied accents.vim to ~/.vim/keymap/ ..
renamed it to foreign.vim and added the Spanish inverted question /
exclamation marks - an for now I have mapped to !! and ??. 



Come to think of it, French would appear to have the most annoying
spelling system of the West European languages that I have some degree
of familiarity with.  Spanish, Italian, and German seem to use fewer
non-ASCII characters.  


Spanish and Italian both can accent any vowel (grave accent in Italian, 
acute accent in Italian), and Spanish also has inverted ? and !


Catalan has some grave-accented vowels, some acute-accented ones, some 
which can take either, the c-cedilla, and also a lettergroup which I've 
seen in no other language: ll with a dot at mid-height between them, to 
mean geminated l (pron. as French ll) as opposed to palatalized l 
(pron. more or less as Spanish ll). ŀl, U+0140 U+006C, decimal 320 108; 
K l . followed by plain l. Examples: geminated in coŀlinear, palatalized 
in coll (mountain pass). The Spanish inverted ? and ! are optional in 
Catalan but can be used in the middle of a sentence to mark the start of 
the question or exclamation. The uppercase equivalent (ĿL, used only in 
full-capitals titles since the geminated ells must be part of different 
syllables, has Ŀ = U+013F, dec. 319, Ctrl-K L .





In order to set up my foreign language keymap correctly I would really
need tables of all the characters that occur in these languages, decide
which ones are common enough to be worth adding to the keymap, and make
sure I build a scheme that's coherent before I get my fingers to
memorize it.  I'll scour the Wiki's later today.. see if I can find
anything useful.


If you wouldn't mind, definitely keep me in the loop on this one, as
I've got something of an interest.

Offhand, some contributions and questions:

beta-looking SS (German)


ß eszet, U+00DF, decimal 223. Don't know if the accents keymap has it. 
With digraphs: Ctrl-K s s



slashed 'l' (Polish)


ł -- don't know the name; I'd guess hard l (pronounced more or less as 
w in Polish but etymologically related to what the Russian pronounce 
like the ll in English bell). Don't think accents has it as it is 
255. U+0142, decimal 322, Ctrl-K l / -- Uppercase: Ł U+0141, dec. 321, 
Ctrl-K L /



slashed 'o' (Scandinavian or thereabouts, not sure if Dutch or other)


Danish and Norwegian, not Swedish, equivalent of German/Swedish ö. ø, 
U+00F8, decimal 248, Ctrl-K o / -- Uppercase: Ø, U+00D8, decimal 216, 
Ctrl-K O / -- I guess /O and /o with the accents keymap. In other 
languages, a similar or identical glyph is used to mean diameter.



AElig/aelig/OElig/oelig (Latin, etc.)


Among modern languages:
  Danish
Æ  AE  U+00C6  dec.198   Ctrl-K A E
æ  ae  U+00E6  dec.230   Ctrl-K a e
  French
Π OE  U+0152  dec.338   Ctrl-K O E
œ  oe  U+0153  dec.339   Ctrl-K o e


ccedil/Ccedil (how done, ,C?)


in the accents keymap, Ithink so. With digraphs, Ctrl-K c , or Ctrl-K 
C , -- U+00C7, decimal 199 (Ç) and U+00E7, decimal 231 (ç).



ecedil(?) (also Polish, possibly other vowels, 'though don't recall
offhand)


ę, it's not a cedilla, (it's turned the other way), it's called an 
ogonek (thus: e-ogonek). If your keymap hasn't got it, it's Ctrl-K e ; 
-- U+0119, decimal 281.




Oh, someone on the list is native Polish, so might ask him.  Was it
Mikolaj?

Dunno anyone Dutch who'd recall the slashed-'o'.


Its not Dutch, its Danish and Norwegian. Dutch only has ij IJ (usually 
typed as two letters, it's up to the browser to fetch the presentation 
form if deemed appropriate, like with French fi ffi fl ffl etc. IJ is 
used e.g. in IJsland (Iceland), IJszee (the Glacial Ocean, usually the 
Artic, but there is also a Zuidelijke IJszee around the Antartic 
continent), IJ (a river in the Netherlands, with the town of IJmuiden at 
its mouth IIRC), IJssel (two other rivers: Hollandse IJssel and Gelderse 
IJssel; the latter ends up in the IJsselmeer); ijs (ice) becomes IJs at 
the start of a sentence, etc. Dutch also has some accented vowels in 

Re: Other European languages on a US keyboard

2006-07-24 Thread cga2000
On Mon, Jul 24, 2006 at 09:02:34AM EDT, Russell Bateman wrote:
 As you say, warning: off-topic post. Read at your own risk.

.. don't see this as OT.. Being lazy I skipped the .. in Vim in the
subject.. 
 
 This discussion underlines all the more strongly why I don't attempt to 
 produce final documents using vim: I sometimes use an actual word 
 processor like Open Office Writer, but mostly I write in HTML and, of 
 course, the best HTML editor on the planet is...

Maybe I should dump LaTeX and use HTML.. I printed some of the
TeX-gen'd stuff and it just looks too beautiful.. I end up with
correspondence that looks more like pages torn out of an expensive
book..  
 
 ...vim!
 
agreed.. natch'..

 Russ
 
 P.S. Yes, typing eacute; , oelig; and uuml; is painful, but I'm one 
 of those perfectionists who would have used half-spacing back in the old 
 days if I had been in need of such things. My father used a non-electric 
 typewriter, but I was 19 before I moved to France from the US and needed 
 what wasn't on the keyboard. After coming back at 25 (some 26+ years ago 
 now), I never lost the need to communicate and product documents of with 
 accents, digraphs, etc. 

yes.. have a feeling only the folks at the  Académie Française and
enlightened foreigners are really concerned about this these days.. I'm
sure the French don't take notice or give a damn whether you write
laetitia or lætitia.. 

Hope you get my accents.. cedillas.. and e dans l'a above..

 in fact, I added the need to compose classical 
 Greek texts while in France, but that's a whole other mess.
 
[..]

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-24 Thread cga2000
On Mon, Jul 24, 2006 at 08:37:47AM EDT, A.J.Mechelynck wrote:
 Warning: off-topic post. Read at your own risk.
 
[..]
 Before computers, I used a French typewriter keyboard (AZERTY type). 
 Nowadays I use a Belgian computer keyboard (also AZERTY but with 
 special characters arranged differently). My father has an old 
 typewriter he bought in Switzerland when he was a student, and it uses a 
 QWERTZ layout. (Switzerland has four official languages, viz. German, 
 French, Italian and Romanche; and I don't know how many different 
 keyboards they use.)
 
 On a mechanical typewriter, it was possible to use half-spacing by 
 holding the space bar down. So, if one wanted to produce the oe digraph 
 on a French typewriter (not an electric one though), it was possible -- 
 for a perfectionist. Let's say I wanted to type boeuf (= beef/ox):
 
 1. press and hold spacebar. This advances the carriage by one half space
 2. hit b. This prints b without moving the carriage.
 3. release, press and hold spacebar.
 4. hit o
 5. release spacebar. The carriage is now over the right half of the o.
 6. hit e u f in succession.
 
.. makes my mouth water.. I should try Ebay .. see if I can find an
affordable high-end typewriter that does such fancy stuff.

 The oe digraph is called o, e dans l'o and the ae digraph is called in 
 French a, e dans l'a. The latter as in Serge Gainsbourg's song 
 elaeudanla téitéia (which spells the name Laetitia).

phew.. this one took me a couple of minutes to figure out.. !!
 
 French typewriters indeed seldom had the digits one and zero: small-ell 
 and big-oh were used insted. But it even carried over to computers: 
 Several decades ago (before the merger with Honeywell), the (French) 
 Bull computer company used on its computers a charset where the same 
 character could mean either zero or O-for-Oscar depending on context -- 
 and another one, I think, could mean one or I-for-India. (Few computers 
 had lowercase in those days.) This, of course, caused headaches without 
 end when trying to convert those computers' magnetic tapes to IBM's BCD 
 and EBCDIC standards or to (whose? PDP? CDC? other?) ASCII.

Hehe.. maybe a bit of OT at one point in the designers' career
wouldn't have hurt.. Sounds like the year 2000 business.. but worse..

What did they do?  Hired a few thousand data entry folks to do the
conversion..  Not sure regex's had been invented at the time.

.. anyway .. as I always say we should all go back to writing in Latin 
Roman numerals..
 
 I'm not sure non-English non-French non-German speaking countries all
 use a US-derived keyboard, even if we limit ourselves to those that
 use variants of the Latin alphabet. Typewriters, after all, date back
 to (I think) before World War I, a time when English was much less
 dominant internationally than it is now. At the courts of
 St-Petersburg and Potsdam, French was spoken; Germany and Austria
 together covered (or had recently covered) a territory that went from
 Alsace to Silesia and from Schleswig-Holstein to the plain of the Po.
 I suspect that most of Central Europe would have adopted a
 German-derived (or maybe French-derived) keyboard regardless of
 whether the majority language was Czech, Slovak, Italian, Hungarian,
 Croatian...

quick googling for keyboard layout shows that you are correct. 

I don't trust Wiki's 100% but this page has some useful keyboard
layouts:

http://en.wikipedia.org/wiki/Keyboard_layout
 
 I agree that the lack of oe OE digraphs in the Latin charsets is 
 probably due to their absence on French typewriter keyboards. (AE ae 
 were kept because they are used in Danish.) There is more than a 
 single-letter difference with English though: not only the layout is 
 different but there are several accented letters. The French (and 
 Belgian) 

.. somewhat to my surprise the Belgian keyboard uses the AZERTY layout
while the Netherlands use QWERTY.  But then this would make sense since
as far as I recall Dutch/Flemish is pretty limited to the ASCII charset
and that's obviously available on AZERTY keyboards. So they only needed
to accomodate the French-speaking community. But doesn't Belgium also
have a German-speaking community?  Ah.. maybe it was just that most
businesses were owned by French-speaking Belgians at the time the layout
was adopted.. 

 keyboards have a dead key for circumflex and trema/diaeresis/umlaut,
 but à ç é è ù and sometimes uppercase-C-cedilla each have their own
 glyphs. (In French, uppercase letters with the exception of C-cedilla
 and sometimes E-acute were usually left unaccented. I believe
 computers are slowly pushing back the trend.)
 
Actually I found that there is such a thing as a US International
Keyboard and maybe I could acquire one of those since it all the fancy
characters that I would want..   
 
 Best regards, Tony.

Thanks much for all this pre-computer days lore..!

Doesn't hurt to know a little something about where we came from..

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-24 Thread Russell Bateman
Of course, we all realize that the original difference between AZERTY 
and QWERTY was the analyzed solutions to the problem of the likelihood 
of two typewriter hammers striking the platen in close enough succession 
that they would jam together and get stuck. Accents arose as a 
distinction only because the French decided, based presumably on a 
letter frequency analysis that AZERTY was the optimal key arrangement 
based on letter frequency in French words while Americans  (I've never 
noticed what they type on in England) chose QWERTY. It was always a 
puzzle to me in my childhood as to why the keys weren't arranged in a 
more obvious fashion. It wasn't answered until, as I was acquainted with 
the Dvorak key layout, it was explained to me why typewriter keys had 
been arranged like that in the first place. Of course, now, it's all 
just tradition--strong enough that the Dvorak guys haven't carried the 
day and the chording guys are just a lone voice in the wilderness.




cga2000 wrote:

On Mon, Jul 24, 2006 at 08:37:47AM EDT, A.J.Mechelynck wrote:
  

Warning: off-topic post. Read at your own risk.



[..]
  
Before computers, I used a French typewriter keyboard (AZERTY type). 
Nowadays I use a Belgian computer keyboard (also AZERTY but with 
special characters arranged differently). My father has an old 
typewriter he bought in Switzerland when he was a student, and it uses a 
QWERTZ layout. (Switzerland has four official languages, viz. German, 
French, Italian and Romanche; and I don't know how many different 
keyboards they use.)


[snip]



  




RE: Other European languages on a US keyboard [OT]

2006-07-24 Thread Max Dyckhoff
This thread reminded me of an experiment I saw a couple of years ago
that really interested me, given my background in AI.

http://www.visi.com/~pmk/evolved.html

To summarize, a guy is trying to evolve a good keyboard layout by
deriving interesting metrics. A use for genetic algorithms at last!
Regrettably, he didn't have much luck, with the Dvorak layout winning
out against all his genetically created layouts, which implies that
either his heuristics weren't strong enough, or he didn't let it evolve
for long enough.

When I have the time I intend to attempt his experiment myself to see if
I can produce any useful results.

Max

 -Original Message-
 From: Russell Bateman [mailto:[EMAIL PROTECTED]
 Sent: Monday, July 24, 2006 1:26 PM
 To: vim@vim.org
 Subject: Re: Other European languages on a US keyboard
 
 Of course, we all realize that the original difference between AZERTY
 and QWERTY was the analyzed solutions to the problem of the likelihood
 of two typewriter hammers striking the platen in close enough
succession
 that they would jam together and get stuck. Accents arose as a
 distinction only because the French decided, based presumably on a
 letter frequency analysis that AZERTY was the optimal key arrangement
 based on letter frequency in French words while Americans  (I've never
 noticed what they type on in England) chose QWERTY. It was always a
 puzzle to me in my childhood as to why the keys weren't arranged in a
 more obvious fashion. It wasn't answered until, as I was acquainted
with
 the Dvorak key layout, it was explained to me why typewriter keys had
 been arranged like that in the first place. Of course, now, it's all
 just tradition--strong enough that the Dvorak guys haven't carried the
 day and the chording guys are just a lone voice in the wilderness.
 
 
 
 cga2000 wrote:
  On Mon, Jul 24, 2006 at 08:37:47AM EDT, A.J.Mechelynck wrote:
 
  Warning: off-topic post. Read at your own risk.
 
 
  [..]
 
  Before computers, I used a French typewriter keyboard (AZERTY
type).
  Nowadays I use a Belgian computer keyboard (also AZERTY but with
  special characters arranged differently). My father has an old
  typewriter he bought in Switzerland when he was a student, and it
uses
 a
  QWERTZ layout. (Switzerland has four official languages, viz.
German,
  French, Italian and Romanche; and I don't know how many different
  keyboards they use.)
 
  [snip]
 
 
 



Re: Other European languages on a US keyboard

2006-07-24 Thread cga2000
On Mon, Jul 24, 2006 at 12:05:02PM EDT, Charles E Campbell Jr wrote:
 cga2000 wrote:
 
 I sometimes need to write text in other languages such as French,
 Spanish and occasionally German or Italian. ..snip..
 
 I would like to do this in Vim.
 
 Unfortunately I only have a US keyboard.
  
 
 
 Have you considered EasyAccents.vim?
 
  http://vim.sourceforge.net/scripts/script.php?script_id=451

That's exactly the sort of thing I initially had in mind but since I've
already spent some time getting familiar with the standard (?) Vim
solution (:set keymap=) and it's a breeze to customize I'll probably
stick with that.  

What I like abut the :set keymap solution is that if you wait a
fraction of a second between ' and e for instance Vim realizes that
you want an apostrophe followed by an e.. not an e with acute accent
and moves the cusor to the next position..  Another way to achieve this
is to map two apostrophes to the (single) apostrophe in your keymap
description.. so you type two ''s in quick succession when you want an
apostrophe rather than an accented letter..  But I find the former
method more natural.

Thanks all the same. Appreciate your help.
 
 It doesn't use the spelling checker in vim 7.0, but it accepts
 a'
 a`
 a^
 a:
 etc to generate accented characters.  Easy to turn on and off, too: \eza 
 toggles.
 
For some bizarre reason .. something in my Vim setup .. I've never
managed to get this '\' leader escape character to work.  So for the
vimspell plugin for instance I have to type the :* commands.

 Regards,
 Chip Campbell


Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck

cga2000 wrote:

On Mon, Jul 24, 2006 at 08:37:47AM EDT, A.J.Mechelynck wrote:

Warning: off-topic post. Read at your own risk.

[...]
On a mechanical typewriter, it was possible to use half-spacing by 
holding the space bar down. So, if one wanted to produce the oe digraph 
on a French typewriter (not an electric one though), it was possible -- 
for a perfectionist. Let's say I wanted to type boeuf (= beef/ox):


1. press and hold spacebar. This advances the carriage by one half space
2. hit b. This prints b without moving the carriage.
3. release, press and hold spacebar.
4. hit o
5. release spacebar. The carriage is now over the right half of the o.
6. hit e u f in succession.


.. makes my mouth water.. I should try Ebay .. see if I can find an
affordable high-end typewriter that does such fancy stuff.


You might have to buy an antique: only mechanical (not electrical) 
typewriters behaved this way.


[...]

.. somewhat to my surprise the Belgian keyboard uses the AZERTY layout
while the Netherlands use QWERTY.  But then this would make sense since
as far as I recall Dutch/Flemish is pretty limited to the ASCII charset
and that's obviously available on AZERTY keyboards. So they only needed
to accomodate the French-speaking community. But doesn't Belgium also
have a German-speaking community?  Ah.. maybe it was just that most
businesses were owned by French-speaking Belgians at the time the layout
was adopted.. 


German, or German together with French, is spoken in parts of three 
cantons (i.e., in part of the territory of three Justices of the 
Peace), near the German border. The Brussels area is bilingual 
French/Dutch; a number of municipalities near the French/Dutch language 
border have, at least in theory, a protected minority of the opposite 
linguistic persuasion. Said border runs approximately East-West from 
somewhere between Dunkirk and Lille to the vicinity of Maastricht where 
it hits the German language area.


In the 19th century, the Belgian high classes (nobility, liberal 
professions, rich merchants, etc.) spoke French as their mother language 
all over the country. In 1830 (15 years after the merger) the Catholic 
Belgians revolted against the Calvinist Dutch, then (after rejecting 
several other candidates) accepted as king Leopold of Saxony-Coburg, a 
Protestant who was the uncle of Queen Victoria, and married to a 
daughter of Louis-Philippe Premier, roi des Français (or did he marry 
her for the occasion?). Louis-Philippe's dynasty was ended for good 
after only 18 years but Leopold's has endured (with some ups and downs) 
to this day. The Flemish nationalistic movement was not necessarily 
created, but at least sponsored by the Germans during both world wars as 
a possible use of the Divide et impera maxim. The Dutch language has 
gained equality with French over the Kingdom in general and a 
quasi-monopole north of the language border, but there is still a strong 
feeling of resentment against the French-speaking oppressor among part 
of the Flemish. The far-right racist and nationalistic Vlaams Belang 
(formerly Vlaams Blok) party has, IIRC, some 20% of the votes in the 
harbour city of Antwerp... especially in rich boroughs where hardly any 
Muslim, Jew or French-speaker have ever been sighted... But I digress.





keyboards have a dead key for circumflex and trema/diaeresis/umlaut,
but à ç é è ù and sometimes uppercase-C-cedilla each have their own
glyphs. (In French, uppercase letters with the exception of C-cedilla
and sometimes E-acute were usually left unaccented. I believe
computers are slowly pushing back the trend.)


Actually I found that there is such a thing as a US International
Keyboard and maybe I could acquire one of those since it all the fancy
characters that I would want..   


Oh, golly! Good to know. Hope you won't get stuck with a keyboard that 
your OS doesn't recognise.



Best regards, Tony.


Thanks much for all this pre-computer days lore..!

Doesn't hurt to know a little something about where we came from..

Thanks

cga




My pleasure.


Best regards,
Tony.


Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck

Russell Bateman wrote:
Of course, we all realize that the original difference between AZERTY 
and QWERTY was the analyzed solutions to the problem of the likelihood 
of two typewriter hammers striking the platen in close enough succession 
that they would jam together and get stuck. Accents arose as a 
distinction only because the French decided, based presumably on a 
letter frequency analysis that AZERTY was the optimal key arrangement 
based on letter frequency in French words while Americans  (I've never 
noticed what they type on in England) chose QWERTY. It was always a 
puzzle to me in my childhood as to why the keys weren't arranged in a 
more obvious fashion. It wasn't answered until, as I was acquainted with 
the Dvorak key layout, it was explained to me why typewriter keys had 
been arranged like that in the first place. Of course, now, it's all 
just tradition--strong enough that the Dvorak guys haven't carried the 
day and the chording guys are just a lone voice in the wilderness.


I don't follow what you're saying about accents (typo?). The French have 
used accented letters since (IIUC) before Gutenberg invented printing. 
Yes, the various typewriter keyboards are supposed to be the result of 
some ergology research, but I don't know the details. IIRC the QWERTY 
keyboard was invented in England at a time when that was the leading 
industrial country in the world. W is quite rare in French while -ez is 
part of the universal second-person-plural ending of verbs; but wouldn't 
the latter make one think that placing z next to e (well, next on the 
keyboard, and separated by only S 3 X in the machinery) would cause 
_more_ jamming in mechanical typewriters? The q is safely away from the u...



Best regards,
Tony.


Re: Other European languages on a US keyboard

2006-07-24 Thread Christian Ebert
* A.J.Mechelynck on Saturday, July 22, 2006 at 22:40:45 +0200:
 The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
 neither in capitals (as for titles or if the word oeuf [egg] is the 
 first of a sentence), nor in lowercase. You need UTF-8 for it,

No. Just latin9 or ISO8859-15 (Look at the header of this mail).

Mon cœur.

This is on a Mac with a German keyboard, but using actually an
American keyboard layout. I enter the œ with Alt-q (the Alt
key on Mac keyboard corresponds to the Modifier key on other
keyboards I believe).

$ echo $LANG
en_US.ISO8859-15

Alain Bench @ mutt-users pointed me to this site:

http://www.in-ulm.de/~mascheck/locale/

where you can get also a little utility at

http://www.in-ulm.de/~mascheck/locale/checklocale.c

It gives the following output here:


[Latin1/9] If there's no real copyrightsymbol at the end of this sentence,
then your terminal/terminalemulator/font is not ISO8859-1/15 ready: ©

- Current environment settings:
  LANG= en_US.ISO8859-15

- Implicitly setting all locale categories with LANG succeeded.

  Testing LC_CTYPE with isprint():
  # # # # # # # # # # # # # # # # 
  # # # # # # # # # # # # # # # # 
!  # $ %  ' ( ) * + , - . / 
  0 1 2 3 4 5 6 7 8 9 : ;  =  ? 
  @ A B C D E F G H I J K L M N O 
  P Q R S T U V W X Y Z [ \ ] ^ _ 
  ` a b c d e f g h i j k l m n o 
  p q r s t u v w x y z { | } ~ # 
  # # # # # # # # # # # # # # # # 
  # # # # # # # # # # # # # # # # 
    ¡ ¢ £ € ¥ Š § š © ª « ¬ ­ ® ¯ 
  ° ± ² ³ Ž µ ¶ · ž ¹ º » Œ œ Ÿ ¿ 
  À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï 
  Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß 
  à á â ã ä å æ ç è é ê ë ì í î ï 
  ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ 
  
- Testing LC_MESSAGES with perror(), but it's a libc message.

- Implicitly setting LC_CTYPE by LANG succeeded.
- Implicitly setting   LC_NUMERIC by LANG succeeded.
- Implicitly setting  LC_TIME by LANG succeeded.
- Implicitly setting   LC_COLLATE by LANG succeeded.
- Implicitly setting  LC_MONETARY by LANG succeeded.
- Implicitly setting  LC_MESSAGES by LANG succeeded.


Mind you, what you will actually see might depend eventual broken
mailer settings, missing characters in fonts etc.

But I can write cedille, ae, oe, a-ring, several vowels with
trema and even € EURO.

c
-- 
_B A U S T E L L E N_ lesen!  --- http://www.blacktrash.org/baustellen.html


Re: Other European languages on a US keyboard

2006-07-24 Thread Russell Bateman
My text was I think misleading. I meant to say that accents were neither 
here nor there in the arrangement of the keys. The French didn't choose 
the AZERTY arrangement of the keyboard on the basis of using or not 
using accents, but only because, presumably, it was the best solution to 
the stuck hammer problem. At least, that is what I surmised.


And, unacquainted with day-to-day English life as most Americans, I 
don't always know what was decided there and what here (in the US), 
though I'm never arrogant enough to think that it's all been decided here.


(Actually, and despite living in Paris for 6 years, I'm something of an 
Anglophile, but that's another story--check out my Inspector Morse site 
at http://www.windofkeltia.com/morse and my Allo, Allo page at 
http://www.windofkeltia.com/allo .)


Russ

A.J.Mechelynck wrote:

Russell Bateman wrote:
Of course, we all realize that the original difference between AZERTY 
and QWERTY was the analyzed solutions to the problem of the 
likelihood of two typewriter hammers striking the platen in close 
enough succession that they would jam together and get stuck. Accents 
arose as a distinction only because the French decided, based 
presumably on a letter frequency analysis that AZERTY was the optimal 
key arrangement based on letter frequency in French words while 
Americans  (I've never noticed what they type on in England) chose 
QWERTY. It was always a puzzle to me in my childhood as to why the 
keys weren't arranged in a more obvious fashion. It wasn't answered 
until, as I was acquainted with the Dvorak key layout, it was 
explained to me why typewriter keys had been arranged like that in 
the first place. Of course, now, it's all just tradition--strong 
enough that the Dvorak guys haven't carried the day and the chording 
guys are just a lone voice in the wilderness.


I don't follow what you're saying about accents (typo?). The French 
have used accented letters since (IIUC) before Gutenberg invented 
printing. Yes, the various typewriter keyboards are supposed to be the 
result of some ergology research, but I don't know the details. IIRC 
the QWERTY keyboard was invented in England at a time when that was 
the leading industrial country in the world. W is quite rare in French 
while -ez is part of the universal second-person-plural ending of 
verbs; but wouldn't the latter make one think that placing z next to e 
(well, next on the keyboard, and separated by only S 3 X in the 
machinery) would cause _more_ jamming in mechanical typewriters? The q 
is safely away from the u...



Best regards,
Tony.






Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck

Christian Ebert wrote:

* A.J.Mechelynck on Saturday, July 22, 2006 at 22:40:45 +0200:
The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
neither in capitals (as for titles or if the word oeuf [egg] is the 
first of a sentence), nor in lowercase. You need UTF-8 for it,


No. Just latin9 or ISO8859-15 (Look at the header of this mail).

Mon cœur.

This is on a Mac with a German keyboard, but using actually an
American keyboard layout. I enter the œ with Alt-q (the Alt
key on Mac keyboard corresponds to the Modifier key on other
keyboards I believe).

$ echo $LANG
en_US.ISO8859-15

[...]

Good to know that the Euro sign wasn't the only missing glyph added in 
ISO 8859-15.


There is an Alt key left of the spacebar on i86 machine's keyboards, but 
I guess you mean the Alt-Gr which is right of the spacebar. 
(Alt-something is used for menu shortcuts here.) AltGr-q gives me æ 
(aelig;), with shift Æ (AElig;).


I think I'm going to experiment with this AltGr key, apparently it gives 
a lot of new characters not always mentioned on the keys; and different 
ones depending on whether it is used alone or with Shift. [after trying] 
I can't find oelig;, I will have to continue pasting it from Vim when I 
want it in an email.



Best regards,
Tony.


RE: Other European languages on a US keyboard

2006-07-24 Thread Max Dyckhoff
I haven't been following this thread in its entirety, but there are the 
Windows Alt Keycodes that can solve your entry of the œ symbol, and many 
others. To enter œ all you need to do is HOLD Alt, and then enter 0156 on the 
keypad, and then release Alt.

Hardly a stylish solution, but easier than copy/pasting from Vim, I'm sure.

Max


 -Original Message-
 From: A.J.Mechelynck [mailto:[EMAIL PROTECTED]
 Sent: Monday, July 24, 2006 3:58 PM
 To: vim@vim.org
 Subject: Re: Other European languages on a US keyboard
 
 Christian Ebert wrote:
  * A.J.Mechelynck on Saturday, July 22, 2006 at 22:40:45 +0200:
  The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding,
  neither in capitals (as for titles or if the word oeuf [egg] is the
  first of a sentence), nor in lowercase. You need UTF-8 for it,
 
  No. Just latin9 or ISO8859-15 (Look at the header of this mail).
 
  Mon cœur.
 
  This is on a Mac with a German keyboard, but using actually an
  American keyboard layout. I enter the œ with Alt-q (the Alt
  key on Mac keyboard corresponds to the Modifier key on other
  keyboards I believe).
 
  $ echo $LANG
  en_US.ISO8859-15
 [...]
 
 Good to know that the Euro sign wasn't the only missing glyph added in
 ISO 8859-15.
 
 There is an Alt key left of the spacebar on i86 machine's keyboards, but
 I guess you mean the Alt-Gr which is right of the spacebar.
 (Alt-something is used for menu shortcuts here.) AltGr-q gives me æ
 (aelig;), with shift Æ (AElig;).
 
 I think I'm going to experiment with this AltGr key, apparently it gives
 a lot of new characters not always mentioned on the keys; and different
 ones depending on whether it is used alone or with Shift. [after trying]
 I can't find oelig;, I will have to continue pasting it from Vim when I
 want it in an email.
 
 
 Best regards,
 Tony.


Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck

Max Dyckhoff wrote:

I haven't been following this thread in its entirety, but there are the Windows Alt 
Keycodes that can solve your entry of the œ symbol, and many others. To enter œ 
all you need to do is HOLD Alt, and then enter 0156 on the keypad, and then release Alt.

Hardly a stylish solution, but easier than copy/pasting from Vim, I'm sure.

Max


Except that I never remember those numbers.

-Vim is intuitive:
To enter œ in Vim, I hit ^Koe (where ^K is Ctrl-K). No weird numbers to 
remember, just one ctrl-key for all digraphs. And is it possible even on 
Windows to use codes above 255 (in this case, Alt-0339 I suppose)?


-Vim is customizable:
When I want to type Russian, I use gvim with my own 
russian-phonetic_utf-8.vim keymap, where a maps to ah, b to beh, v to 
veh, g to gheh, etc. No weird keystrokes to remember.


-Vim is cross-platform:
On this SuSE Linux system, Alt-keypad codes just don't work. Vim, OTOH, 
works the same on both Windows and Linux.



Morality: Don't underrate Vim, especially not on its own mailing list. :-)


Best regards,
Tony.


Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck

cga2000 wrote:

On Mon, Jul 24, 2006 at 05:59:42PM EDT, Christian Ebert wrote:

* A.J.Mechelynck on Saturday, July 22, 2006 at 22:40:45 +0200:
The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
neither in capitals (as for titles or if the word oeuf [egg] is the 
first of a sentence), nor in lowercase. You need UTF-8 for it,

No. Just latin9 or ISO8859-15 (Look at the header of this mail).

Mon coeur.

This is on a Mac with a German keyboard, but using actually an
American keyboard layout. I enter the oe with Alt-q (the Alt
key on Mac keyboard corresponds to the Modifier key on other
keyboards I believe).


Could this be Mac-specific? 


I switched to encoding=latin9.

When I do a Ctrl-K o e and a Ctrl-K O E this is what I get:

½ ¼ 


confirmed by the :dig command.

I looked carefully at the output of :dig and I couldn't see our elusive
e dans l'o either.

So I switched to the French ISO-08859-15, then the US version of
latin9.. still can't find that o dans l'e.

Strange thing is that the font I use on terminals does have these two
characters (upper/lower case E dans l'O..) in the exact same spot Vim
displays the above fractions.. 


Try the following (in gvim):

 :echo has(multi_byte)

the answer should be 1. If it is zero, your version of gvim cannot 
handle UTF-8.


 :if tenc== | let tenc = enc | endif | set enc=utf-8
 :new

then i (set Insert mode) and ^Vu0153 (where ^V is Ctrl-V, unless you use 
Ctrl-V to paste, in which case it is Ctrl-Q).


If you see anything other than the oe digraph, then your 'guifont' is 
plain wrong. See http://vim.sourceforge.net/tips/tip.php?tip_id=632 
about how to choose a better one.




Best regards,
Tony.


Re: Other European languages on a US keyboard

2006-07-24 Thread cga2000
On Mon, Jul 24, 2006 at 08:29:10PM EDT, A.J.Mechelynck wrote:
 cga2000 wrote:
 On Mon, Jul 24, 2006 at 05:59:42PM EDT, Christian Ebert wrote:
 * A.J.Mechelynck on Saturday, July 22, 2006 at 22:40:45 +0200:
 The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
 neither in capitals (as for titles or if the word oeuf [egg] is the 
 first of a sentence), nor in lowercase. You need UTF-8 for it,
 No. Just latin9 or ISO8859-15 (Look at the header of this mail).
 
 Mon coeur.
 
 This is on a Mac with a German keyboard, but using actually an
 American keyboard layout. I enter the oe with Alt-q (the Alt
 key on Mac keyboard corresponds to the Modifier key on other
 keyboards I believe).
 
 Could this be Mac-specific? 
 
 I switched to encoding=latin9.
 
 When I do a Ctrl-K o e and a Ctrl-K O E this is what I get:
 
 ½ ¼ 
 
 confirmed by the :dig command.
 
 I looked carefully at the output of :dig and I couldn't see our elusive
 e dans l'o either.
 
 So I switched to the French ISO-08859-15, then the US version of
 latin9.. still can't find that o dans l'e.
 
 Strange thing is that the font I use on terminals does have these two
 characters (upper/lower case E dans l'O..) in the exact same spot Vim
 displays the above fractions.. 
 
 Try the following (in gvim):
 
.. with all the goings-on in this thread I never had a chance to
mention the fact that I do not use gvim. I try to do everything in a
terminal (under gnu/screen) because text-mode apps were designed for
the keyboard so they work a lot better than gui's for those of us who
prefer not to use mice.

  :echo has(multi_byte)
 
 the answer should be 1. If it is zero, your version of gvim cannot
 handle UTF-8.
 
Works fine if I switch my locale to UTF-8.  Vim automatically figures
what I want and :dig displays the o dans l'e (both the lower and upper
case versions) among a gazillon other digraphs. Then I can use the
ususal Ctrl-K oe .. save the file.. pass this on to LaTeX and provided I
have the correct LaTeX statements to activate UTF-8 (that's what took
forever to figure out the other day..) I get my coeurs, voeux and
boeufs rendered correctly in xdvi/gv .. *and* the the ensuing
printout looks great too.

The problem with this is that I haven't found a comfortable way to
run Vim in UTF-8 mode and the rest of my stuff in 8-bit mode.

Over the week-end I found that I can run Vim in a separate unicode
xterm but that's not what I want because I lose screen's copy/paste and
more importantly it destroys my attempt at running a fully integrated
desktop.

Other problems that I have run into is that text files created when in
UTF-8 mode are a mess when browsed in latin1/9 mode.  I also have
problems when I print unicode files.. I once created a nice table
with those box-drawing characters that were available in UTF-8 mode and
it was really nice on-screen.. but when I tried to print it, all I got
was rows and columns of questiion marks.

So I switched back to latin1 pending better internationalization support
in some applications (slrn, ELinks.. mutt should workd but it's tricky)
and maybe more importantly until I acquire a better understanding of
running a unicode locale in X/linux and the implications thereof..

  :if tenc== | let tenc = enc | endif | set enc=utf-8 :new
 
 then i (set Insert mode) and ^Vu0153 (where ^V is Ctrl-V, unless you
 use Ctrl-V to paste, in which case it is Ctrl-Q).
 
 If you see anything other than the oe digraph, then your 'guifont' is
 plain wrong. See http://vim.sourceforge.net/tips/tip.php?tip_id=632
 about how to choose a better one.
 
Well.. actually.. I ran some tests in latin-9 earlier.. trying to figure
out this o dans l'e business.. that was on a linux console..  and
that's where I realized that I was still running a unicode font.. both
on the linux console and in 'X'.. :-) .. It seems I never switched back
after my brief incursion into unicode territory..  and since I haven't
had any problems displaying and printing text since I switched back.. I
would say that the font is ok..  And that UTF-8 stuff is indeed
backward-compatible?

The font is called terminus and I like it a lot because it looks like
a fixed-width version of MS's Verdana, which is my favorite screen font.

see http://.geocities.com/cga/wee.png for an excellent
screenshot.

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-24 Thread A.J.Mechelynck

cga2000 wrote:

On Mon, Jul 24, 2006 at 08:29:10PM EDT, A.J.Mechelynck wrote:

cga2000 wrote:

On Mon, Jul 24, 2006 at 05:59:42PM EDT, Christian Ebert wrote:

* A.J.Mechelynck on Saturday, July 22, 2006 at 22:40:45 +0200:
The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
neither in capitals (as for titles or if the word oeuf [egg] is the 
first of a sentence), nor in lowercase. You need UTF-8 for it,

No. Just latin9 or ISO8859-15 (Look at the header of this mail).

Mon coeur.

This is on a Mac with a German keyboard, but using actually an
American keyboard layout. I enter the oe with Alt-q (the Alt
key on Mac keyboard corresponds to the Modifier key on other
keyboards I believe).
Could this be Mac-specific? 


I switched to encoding=latin9.

When I do a Ctrl-K o e and a Ctrl-K O E this is what I get:

½ ¼ 


confirmed by the :dig command.

I looked carefully at the output of :dig and I couldn't see our elusive
e dans l'o either.

So I switched to the French ISO-08859-15, then the US version of
latin9.. still can't find that o dans l'e.

Strange thing is that the font I use on terminals does have these two
characters (upper/lower case E dans l'O..) in the exact same spot Vim
displays the above fractions.. 

Try the following (in gvim):


.. with all the goings-on in this thread I never had a chance to
mention the fact that I do not use gvim. I try to do everything in a
terminal (under gnu/screen) because text-mode apps were designed for
the keyboard so they work a lot better than gui's for those of us who
prefer not to use mice.


Gvim can use keyboard commands just like console Vim, or mice-addicted 
people can use that too. It has a lot more different coulours (typically 
16 million rather than 16) and it can change fonts on-the-fly (change 
the font from Courier to Lucida to whatever, only through Vim keyboard 
commands). It can do real boldface and italics, as well as straight or 
curly underlining. And it can use Unicode: see further down.





 :echo has(multi_byte)

the answer should be 1. If it is zero, your version of gvim cannot
handle UTF-8.


Works fine if I switch my locale to UTF-8.  Vim automatically figures
what I want and :dig displays the o dans l'e (both the lower and upper
case versions) among a gazillon other digraphs. Then I can use the
ususal Ctrl-K oe .. save the file.. pass this on to LaTeX and provided I
have the correct LaTeX statements to activate UTF-8 (that's what took
forever to figure out the other day..) I get my coeurs, voeux and
boeufs rendered correctly in xdvi/gv .. *and* the the ensuing
printout looks great too.

The problem with this is that I haven't found a comfortable way to
run Vim in UTF-8 mode and the rest of my stuff in 8-bit mode.


Well, in an xterm (or konsole, or Windows Dos Box), console Vim is 
dependent on xhatever charset the console is using. If you xterm (or 
whatever) is in Latin1, you cannot use French oe anymore than you can 
use Cyrillic or Greek. Gvim, on the other hand, can display anything for 
which you have a glyph in a font.




Over the week-end I found that I can run Vim in a separate unicode
xterm but that's not what I want because I lose screen's copy/paste and
more importantly it destroys my attempt at running a fully integrated
desktop.

Other problems that I have run into is that text files created when in
UTF-8 mode are a mess when browsed in latin1/9 mode.  I also have
problems when I print unicode files.. I once created a nice table
with those box-drawing characters that were available in UTF-8 mode and
it was really nice on-screen.. but when I tried to print it, all I got
was rows and columns of questiion marks.

So I switched back to latin1 pending better internationalization support
in some applications (slrn, ELinks.. mutt should workd but it's tricky)
and maybe more importantly until I acquire a better understanding of
running a unicode locale in X/linux and the implications thereof..


 :if tenc== | let tenc = enc | endif | set enc=utf-8 :new

then i (set Insert mode) and ^Vu0153 (where ^V is Ctrl-V, unless you
use Ctrl-V to paste, in which case it is Ctrl-Q).

If you see anything other than the oe digraph, then your 'guifont' is
plain wrong. See http://vim.sourceforge.net/tips/tip.php?tip_id=632
about how to choose a better one.


Well.. actually.. I ran some tests in latin-9 earlier.. trying to figure
out this o dans l'e business.. that was on a linux console..  and
that's where I realized that I was still running a unicode font.. both
on the linux console and in 'X'.. :-) .. It seems I never switched back
after my brief incursion into unicode territory..  and since I haven't
had any problems displaying and printing text since I switched back.. I
would say that the font is ok..  And that UTF-8 stuff is indeed
backward-compatible?

The font is called terminus and I like it a lot because it looks like
a fixed-width version of MS's Verdana, which is my favorite screen font.

see 

Re: Other European languages on a US keyboard

2006-07-23 Thread cga2000
On Sat, Jul 22, 2006 at 04:40:45PM EDT, A.J.Mechelynck wrote:
 cga2000 wrote:
 On Fri, Jul 21, 2006 at 03:19:25PM EDT, A.J.Mechelynck wrote:
 [...]
 Only minor glitch seems to be that text doesn't wrap when in INSERT
 (lang) mode.. haven't figured out why yet.. so I just escape out of
 insert mode and do a gqip once in a while.  Could be unrelated
 though..
 
 Check your options:
 
for auto-insert of linebreaks
   :verbose set textmode? wrapmargin?
   :verbose set formatoptions? formatexpr?
 
about display of long lines
   :verbose set wrap? linebreak? breakat?
 
:verbose set textmode? wrapmargin?

textmode
  wrapmargin=0
  
:verbose set formatoptions? formatexpr?

  formatoptions=tcql
Last set from /usr/share/vim/vim63/ftplugin/mail.vim
E518: Unknown option: formatexpr?

:verbose set wrap? linebreak? breakat?

  wrap
nolinebreak
  breakat= [EMAIL PROTECTED];:,./?

Since this non-wrapping behavior only occurs after I issue the
:setlocal keymap=accents command, I ran these commands both before
and after but the output was strictly identical.

As to to he E518: error message, I run Vim 6.3 - that's the version
available on debian stable, so I assume that this option was
implemented in later versions.

But I am writing this message in Vim and I set the keymap to accents
and right now everything is wrapping correctly.  Don't know if this
makes sense but I would appear that the problem only occurs when working
on a .tex file?  So maybe this is a bug/feature that's related to syntax
checking or highlighting?  I need to run another test and try to figure
out what I was doing.  Issue the same above commands while editing latex
stuff and see if it makes a difference.  I will update the thread if I
find something interesting.

  
 Or, if none of the distributed keymaps is exactly what you want, you can 
 write your own. It isn't hard. See :help :loadkeymap for the theory, 
 and look at the contents of Bram's $VIMRUNTIME/keymap/accents.vim and my 
 $VIMRUNTIME/keymap/esperanto_utf8.vim for a couple of simple examples. 

 You might want to write something more extensive but this will show you 
 how to do it.

Already started on this:  copied accents.vim to ~/.vim/keymap/ ..
renamed it to foreign.vim and added the Spanish inverted question /
exclamation marks - an for now I have mapped to !! and ??. 

 
 Doesn't look like much is missing.. Maybe the French o+e .. but then my
 screen font doesn't have it either.. 
 
 The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
 neither in capitals (as for titles or if the word oeuf [egg] is the 
 first of a sentence), nor in lowercase. You need UTF-8 for it, and for 
 this relatively rare character you can still use Ctrl-K o e ; or else 

Rare enough .. but besides oeuf is also occurs in such very common
words as voeu [wish] and coeur [heart] and it really bothers me when
I see them incorrectly spelled in web pages for instance.  I spot it and
after that I tend to lose focus and not be able to take in what I'm
reading for a short while. 

I'm pretty sure there is also a similar a+e / A+E in French as well,
though I could not think of one common word that has this.  I'm pretty
sure that it's the correct spelling for names such as Aegisthe or
Laetitia but I don't have any printed material where I could check
that for sure.  Only words I can think of right now are caecum and
caetera (as in  et caetera) but there's bound to be others.

 you can add the following to your vimrc (after setting 'encoding' to
 UTF-8):
 
   lmap OE Char-338 lmap oe Char-339
 
The problem with switching to UTF8 is that practically all the other
applications that I use on a daily basis do not support it correctly -
mutt, slrn, ELinks.. at least not the versions that I am running. I had
a go at it a couple of months back but I ran into so many problems that
I had to switch back to latin1.  Naturally, I could try to run Vim in a
UTF8-capable terminal like uxterm while staying with a latin1 locale - I
understand that this might work - but now that I have gotten used to the
convenience of running all these apps under gnu/screen under a single
xterm I don't think I would want to fire up Vim in a separate xterm on a
regular basis. 

Printing was another area where I ran into problems.

So, due to my ignorance of such matters and lack of time I'll have to
keep this on the back burner until either the apps are more mature and
do unicode out-of-the-box..  or until I have the time to look into
this and acquire a better understanding of the issues.

 I'm using language mappings here so they will be turned on and off 
 together with the keymap.
 

Come to think of it, French would appear to have the most annoying
spelling system of the West European languages that I have some degree
of familiarity with.  Spanish, Italian, and German seem to use fewer
non-ASCII characters.  

In order to set up my foreign language keymap correctly I would really
need 

Re: Other European languages on a US keyboard

2006-07-23 Thread A.J.Mechelynck

cga2000 wrote:

On Sat, Jul 22, 2006 at 04:40:45PM EDT, A.J.Mechelynck wrote:

cga2000 wrote:

On Fri, Jul 21, 2006 at 03:19:25PM EDT, A.J.Mechelynck wrote:

[...]

Only minor glitch seems to be that text doesn't wrap when in INSERT
(lang) mode.. haven't figured out why yet.. so I just escape out of
insert mode and do a gqip once in a while.  Could be unrelated
though..

Check your options:

 for auto-insert of linebreaks
:verbose set textmode? wrapmargin?
:verbose set formatoptions? formatexpr?

 about display of long lines
:verbose set wrap? linebreak? breakat?


:verbose set textmode? wrapmargin?

textmode
  wrapmargin=0


...oops... verbose set textwidth? wrapmargin?

and (seeing what you write below) do it with with the active cursor in a 
Latex file. Maybe the Latex ftplugin changes textwidth? (A value of zero 
means no auto-reformatting.)


  
:verbose set formatoptions? formatexpr?


  formatoptions=tcql
Last set from /usr/share/vim/vim63/ftplugin/mail.vim


:help fo-table will tell you what these flags mean.


E518: Unknown option: formatexpr?

:verbose set wrap? linebreak? breakat?

  wrap
nolinebreak
  breakat= [EMAIL PROTECTED];:,./?

Since this non-wrapping behavior only occurs after I issue the
:setlocal keymap=accents command, I ran these commands both before
and after but the output was strictly identical.

As to to he E518: error message, I run Vim 6.3 - that's the version
available on debian stable, so I assume that this option was
implemented in later versions.


'formatexpr' is new in version 7. BTW, you can get both 6.4 and 7.0 from 
the Vim FTP site. The current version is 7.0.039. Compiling Vim is not 
really hard -- see my 
http://users.skynet.be/antoine.mechelynck/vim/compunix.htm for details.




But I am writing this message in Vim and I set the keymap to accents
and right now everything is wrapping correctly.  Don't know if this
makes sense but I would appear that the problem only occurs when working
on a .tex file?  So maybe this is a bug/feature that's related to syntax
checking or highlighting?  I need to run another test and try to figure
out what I was doing.  Issue the same above commands while editing latex
stuff and see if it makes a difference.  I will update the thread if I
find something interesting.



Or, if none of the distributed keymaps is exactly what you want, you can 
write your own. It isn't hard. See :help :loadkeymap for the theory, 
and look at the contents of Bram's $VIMRUNTIME/keymap/accents.vim and my 
$VIMRUNTIME/keymap/esperanto_utf8.vim for a couple of simple examples. 


You might want to write something more extensive but this will show you 
how to do it.


Already started on this:  copied accents.vim to ~/.vim/keymap/ ..
renamed it to foreign.vim and added the Spanish inverted question /
exclamation marks - an for now I have mapped to !! and ??. 


Doesn't look like much is missing.. Maybe the French o+e .. but then my
screen font doesn't have it either.. 
The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
neither in capitals (as for titles or if the word oeuf [egg] is the 
first of a sentence), nor in lowercase. You need UTF-8 for it, and for 
this relatively rare character you can still use Ctrl-K o e ; or else 


Rare enough .. but besides oeuf is also occurs in such very common
words as voeu [wish] and coeur [heart] and it really bothers me when
I see them incorrectly spelled in web pages for instance.  I spot it and
after that I tend to lose focus and not be able to take in what I'm
reading for a short while. 


I'm pretty sure there is also a similar a+e / A+E in French as well,
though I could not think of one common word that has this.  I'm pretty
sure that it's the correct spelling for names such as Aegisthe or
Laetitia but I don't have any printed material where I could check
that for sure.  Only words I can think of right now are caecum and
caetera (as in  et caetera) but there's bound to be others.


you can add the following to your vimrc (after setting 'encoding' to
UTF-8):

lmap OE Char-338 lmap oe Char-339


The problem with switching to UTF8 is that practically all the other
applications that I use on a daily basis do not support it correctly -
mutt, slrn, ELinks.. at least not the versions that I am running. I had
a go at it a couple of months back but I ran into so many problems that
I had to switch back to latin1.  Naturally, I could try to run Vim in a
UTF8-capable terminal like uxterm while staying with a latin1 locale - I
understand that this might work - but now that I have gotten used to the
convenience of running all these apps under gnu/screen under a single
xterm I don't think I would want to fire up Vim in a separate xterm on a
regular basis. 


If your other applications don't support UTF-8 you will have to do 
without the OE and oe digraphs.




Printing was another area where I ran into problems.

So, due to my ignorance of 

Re: Other European languages on a US keyboard

2006-07-22 Thread cga2000
On Fri, Jul 21, 2006 at 03:19:25PM EDT, A.J.Mechelynck wrote:
 cga2000 wrote:
 I sometimes need to write text in other languages such as French,
 Spanish and occasionally German or Italian.
 
 I would like to do this in Vim.
 
 Unfortunately I only have a US keyboard.
 
[..]
 I.
 
 Since you've already used digraphs, and they're too cumbersome for you, 
 you could try a keymap.
 
 There are some keymaps in $VIMRUNTIME/keymap which you can apply by just 
 doing :setlocal keymap=keymapname (where keymapname is the 
 filename without the encoding and .vim endings) or by using the Edit - 
 Keymap menu. Then you can toggle between US-QWERTY mode and keymap 
 mode by hitting Ctrl-^ in Insert mode, or by toggling 'iminsert' between 
 zero and 1 in any mode. Basically, what a keymap does is establish a set 
 of language-mappings, i.e., insert-mode mappings which can be turned on 
 an off. Try the accents keymap, it might be just what you want.
 
This works great..! I tried that on a short text in French and within a
couple of minutes I was almost as comfortable as I am when typing in
English.  The accents are all perfectly intuitive - ` ' ^ etc. so I
didn't even need to look at the keymap.  Just need to be a little
patient when entering an apostrophe.

Only minor glitch seems to be that text doesn't wrap when in INSERT
(lang) mode.. haven't figured out why yet.. so I just escape out of
insert mode and do a gqip once in a while.  Could be unrelated
though..

 Or, if none of the distributed keymaps is exactly what you want, you can 
 write your own. It isn't hard. See :help :loadkeymap for the theory, 
 and look at the contents of Bram's $VIMRUNTIME/keymap/accents.vim and my 
 $VIMRUNTIME/keymap/esperanto_utf8.vim for a couple of simple examples. 
 You might want to write something more extensive but this will show you 
 how to do it.
 
Doesn't look like much is missing.. Maybe the French o+e .. but then my
screen font doesn't have it either.. 

 If and when you write your own keymap, place it in the keymap/
 subdirectory of a directory listed early in 'runtimepath' but not in
 $VIMRUNTIME/keymap itself because any upgrade can silently change
 anything there.
 
 
 
 II. What you are suggesting looks like setting 'spelllang' (with three
 ells) to whatever means French and then spellchecking your
 US-ASCII-only text. But beware: the Vim spellchecker (which I don't
 use because of my good innate spelling) might not be clever enough to
 mark words which have accented homographs, such as a (has) vs. à
 (at), de (of) vs. dé (thimble), du (of the) vs. dû
 (owed), cru (believed or raw) vs. crû (grown) etc.: so the
 cure might be worse than the ill, owing to the necessity of looking
 for unmarked spelling mistakes even after running the spell checker.
 
I think you're right.  Considering how effective the keymap solution is
there's just no point.  Anyway, I don't even have an active spellchecker
at this point. aspell segfaults for some reason and I haven't had time
to research that yet.

Just need to figure out how I can get latex to handle these non-ASCII
characters. They disappear from the .dvi file.

In case I can't figure it out, there's probably a latex user list
somewhere.

Great Tip..!

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-22 Thread cga2000
On Fri, Jul 21, 2006 at 05:19:33PM EDT, Yakov Lerner wrote:
 On 7/21/06, cga2000 [EMAIL PROTECTED] wrote:
[..]
 
 I think the easiest apporach is to craete mappings.
 You could use ctrl-(a-z), ctrl-shift(a-z), ctrl-alt-(a-z),
 then f1-f12 + ctrl/alt/del combinations.
 
Thanks, but not for me.. I touch type and as a result Tony's solution
is much preferable than key combos.

 Linux has a keyboard mode where ' is used to
 modify letters to create diactitics. This works for
 all X applications, not just vim.

I have used the Compose key in the past - usually mapped to menu.. I
think I remapped it to the Windows key on this laptop at one point but
I wasn't aware of this X keyboard mode.

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-22 Thread cga2000
On Fri, Jul 21, 2006 at 02:50:46PM EDT, Gene Kwiecinski wrote:
 Unfortunately I am only able to type the US keyboard, so remapping
 the keyboard might be a better solution than entering digraphs in the
 long run but will not be painless..  And since I do not do this on a
 regular basis, I am unsure whether it's really worth going to all the
 trouble.
 
 Would it be impractical to map, eg, ^e to whatever the code is for
 'ê', ie, use prefix notation of [^'`~,], etc., as a prefix for
 [aeioucnAEIOUCN] as needed?  Wouldn't be *all* those combinations,
 but, eg, would only need ,C for Ccedil;, ~N for Ntilde; (and
 their lowercase counterparts, natch), but the rest would just be
 whatever accented chars you normally use, for grave, acute,
 circumflex, etc.
 
Unless I'm mistaken you have actually reinvented the loadkeymap solution
discussed by Tony..?  

Don't take me wrong.. I am *not* being ironical..  

Quite the contrary.

 I'm not sure how a non-US keyboard does such things, so I can't
 suggest a more transparent way of doing it.
 
 One other possibility would be the way my phone does multiple chars
 per key, eg, you'd hit '1' to get the generic '.', then '*' would
 cycle through different punctuation, and so on, 'til it'd get back to
 '.' again.  Maybe hitting alt-A would get you an 'a' and put you into
 a loop, then multiple hits of an F-key would cycle through the 3-4
 other chars and then back.  Any other key would escape the loop.
 Arrange them in the order you expect their occurrence, most
 commonly-used ones first.
 
That, I had thought of but I discarded it as impractical.  That's also
the method that's commonly used to input languages that have too many
characters to fit on a reasonably-sized keyboard.. Chinese, eg.

Entering the accent, tilde, cedilla.. etc. followed by the letter is a
lot more in keeping with the wiring of my cortex..

 Eg, if you arrange them in the order acute/grave/circumflex/ring,
 simply hitting M-a would get you aacute;.  Hit F2, and it gets you
 agrave;.  Hit F2 again, circumflex.  Again, ring.  Again, acute.
 Lather, rinse, repeat.
 
 *Implementing* this would for now be beyond my ken, or my barbie, but
 I'm sure someone might have some ideas how to best do it.  No?

Thanks

cga


Re: Other European languages on a US keyboard

2006-07-22 Thread A.J.Mechelynck

cga2000 wrote:

On Fri, Jul 21, 2006 at 03:19:25PM EDT, A.J.Mechelynck wrote:

[...]

Only minor glitch seems to be that text doesn't wrap when in INSERT
(lang) mode.. haven't figured out why yet.. so I just escape out of
insert mode and do a gqip once in a while.  Could be unrelated
though..


Check your options:

 for auto-insert of linebreaks
:verbose set textmode? wrapmargin?
:verbose set formatoptions? formatexpr?

 about display of long lines
:verbose set wrap? linebreak? breakat?



Or, if none of the distributed keymaps is exactly what you want, you can 
write your own. It isn't hard. See :help :loadkeymap for the theory, 
and look at the contents of Bram's $VIMRUNTIME/keymap/accents.vim and my 
$VIMRUNTIME/keymap/esperanto_utf8.vim for a couple of simple examples. 
You might want to write something more extensive but this will show you 
how to do it.



Doesn't look like much is missing.. Maybe the French o+e .. but then my
screen font doesn't have it either.. 


The French oe (o, e-dans-l'o) is not defined in the Latin1 encoding, 
neither in capitals (as for titles or if the word oeuf [egg] is the 
first of a sentence), nor in lowercase. You need UTF-8 for it, and for 
this relatively rare character you can still use Ctrl-K o e ; or else 
you can add the following to your vimrc (after setting 'encoding' to UTF-8):


lmap OE Char-338
lmap oe Char-339

I'm using language mappings here so they will be turned on and off 
together with the keymap.





If and when you write your own keymap, place it in the keymap/
subdirectory of a directory listed early in 'runtimepath' but not in
$VIMRUNTIME/keymap itself because any upgrade can silently change
anything there.



II. What you are suggesting looks like setting 'spelllang' (with three
ells) to whatever means French and then spellchecking your
US-ASCII-only text. But beware: the Vim spellchecker (which I don't
use because of my good innate spelling) might not be clever enough to
mark words which have accented homographs, such as a (has) vs. à
(at), de (of) vs. dé (thimble), du (of the) vs. dû
(owed), cru (believed or raw) vs. crû (grown) etc.: so the
cure might be worse than the ill, owing to the necessity of looking
for unmarked spelling mistakes even after running the spell checker.


I think you're right.  Considering how effective the keymap solution is
there's just no point.  Anyway, I don't even have an active spellchecker
at this point. aspell segfaults for some reason and I haven't had time
to research that yet.


Vim 7 has a built-in spell checker. I don't use it, but I know it is 
there. I guess :helpgrep spell might tell you about it if you're 
interested.




Just need to figure out how I can get latex to handle these non-ASCII
characters. They disappear from the .dvi file.

In case I can't figure it out, there's probably a latex user list
somewhere.

Great Tip..!

Thanks

cga




My pleasure.

Best regards,
Tony.


Other European languages on a US keyboard

2006-07-21 Thread cga2000
I sometimes need to write text in other languages such as French,
Spanish and occasionally German or Italian.

I would like to do this in Vim.

Unfortunately I only have a US keyboard.

Using Ctrl-K to enter the various digraphs becomes somewhat cumbersome
for anything larger than a short paragraph.

Unfortunately I am only able to type the US keyboard, so remapping the
keyboard might be a better solution than entering digraphs in the long
run but will not be painless..  And since I do not do this on a regular
basis, I am unsure whether it's really worth going to all the trouble.

I was thinking of writing the text without accents, tildes, cedillas,
etc. using the letters on the US keyboard and then feeding the result to
some advanced functionality of a spellchecker that would replace all the
words that can only be spelled one way by their correctly spelled
version - say French 'épeler (to spell) can only be spelled this way..
there is  no 'epeler' or 'èpeler' or 'êpeler'.

On the other hand, for those words where the accent differs depending
on the semantics such as French 'a' vs. 'à', the script/plugin would
leave them untouched and  - ideally - highlight them, thus leaving me
with only a handful of manual corrections.

Is there anything in Vim that does something like this?

Or is there any other 'smart' way to achieve something like the above?

Thanks

cga


RE: Other European languages on a US keyboard

2006-07-21 Thread Gene Kwiecinski
Unfortunately I am only able to type the US keyboard, so remapping the
keyboard might be a better solution than entering digraphs in the long
run but will not be painless..  And since I do not do this on a regular
basis, I am unsure whether it's really worth going to all the trouble.

Would it be impractical to map, eg, ^e to whatever the code is for 'ê', ie, 
use prefix notation of [^'`~,], etc., as a prefix for [aeioucnAEIOUCN] as 
needed?
Wouldn't be *all* those combinations, but, eg, would only need ,C for 
Ccedil;, ~N for Ntilde; (and their lowercase counterparts, natch), but 
the rest would just be whatever accented chars you normally use, for grave, 
acute, circumflex, etc.

I'm not sure how a non-US keyboard does such things, so I can't suggest a more 
transparent way of doing it.

One other possibility would be the way my phone does multiple chars per key, 
eg, you'd hit '1' to get the generic '.', then '*' would cycle through 
different punctuation, and so on, 'til it'd get back to '.' again.  Maybe 
hitting alt-A would get you an 'a' and put you into a loop, then multiple hits 
of an F-key would cycle through the 3-4 other chars and then back.  Any other 
key would escape the loop.  Arrange them in the order you expect their 
occurrence, most commonly-used ones first.

Eg, if you arrange them in the order acute/grave/circumflex/ring, simply 
hitting M-a would get you aacute;.  Hit F2, and it gets you agrave;.  Hit 
F2 again, circumflex.  Again, ring.  Again, acute.  Lather, rinse, repeat.

*Implementing* this would for now be beyond my ken, or my barbie, but I'm sure 
someone might have some ideas how to best do it.  No?


Re: Other European languages on a US keyboard

2006-07-21 Thread A.J.Mechelynck

cga2000 wrote:

I sometimes need to write text in other languages such as French,
Spanish and occasionally German or Italian.

I would like to do this in Vim.

Unfortunately I only have a US keyboard.

Using Ctrl-K to enter the various digraphs becomes somewhat cumbersome
for anything larger than a short paragraph.

Unfortunately I am only able to type the US keyboard, so remapping the
keyboard might be a better solution than entering digraphs in the long
run but will not be painless..  And since I do not do this on a regular
basis, I am unsure whether it's really worth going to all the trouble.

I was thinking of writing the text without accents, tildes, cedillas,
etc. using the letters on the US keyboard and then feeding the result to
some advanced functionality of a spellchecker that would replace all the
words that can only be spelled one way by their correctly spelled
version - say French 'épeler (to spell) can only be spelled this way..
there is  no 'epeler' or 'èpeler' or 'êpeler'.

On the other hand, for those words where the accent differs depending
on the semantics such as French 'a' vs. 'à', the script/plugin would
leave them untouched and  - ideally - highlight them, thus leaving me
with only a handful of manual corrections.

Is there anything in Vim that does something like this?

Or is there any other 'smart' way to achieve something like the above?

Thanks

cga




I.

Since you've already used digraphs, and they're too cumbersome for you, 
you could try a keymap.


There are some keymaps in $VIMRUNTIME/keymap which you can apply by just 
doing :setlocal keymap=keymapname (where keymapname is the 
filename without the encoding and .vim endings) or by using the Edit - 
Keymap menu. Then you can toggle between US-QWERTY mode and keymap 
mode by hitting Ctrl-^ in Insert mode, or by toggling 'iminsert' between 
zero and 1 in any mode. Basically, what a keymap does is establish a set 
of language-mappings, i.e., insert-mode mappings which can be turned on 
an off. Try the accents keymap, it might be just what you want.


Or, if none of the distributed keymaps is exactly what you want, you can 
write your own. It isn't hard. See :help :loadkeymap for the theory, 
and look at the contents of Bram's $VIMRUNTIME/keymap/accents.vim and my 
$VIMRUNTIME/keymap/esperanto_utf8.vim for a couple of simple examples. 
You might want to write something more extensive but this will show you 
how to do it.


If and when you write your own keymap, place it in the keymap/ 
subdirectory of a directory listed early in 'runtimepath' but not in 
$VIMRUNTIME/keymap itself because any upgrade can silently change 
anything there.




II. What you are suggesting looks like setting 'spelllang' (with three 
ells) to whatever means French and then spellchecking your 
US-ASCII-only text. But beware: the Vim spellchecker (which I don't use 
because of my good innate spelling) might not be clever enough to mark 
words which have accented homographs, such as a (has) vs. à 
(at), de (of) vs. dé (thimble), du (of the) vs. dû 
(owed), cru (believed or raw) vs. crû (grown) etc.: so the 
cure might be worse than the ill, owing to the necessity of looking for 
unmarked spelling mistakes even after running the spell checker.



Best regards,
Tony.


Re: Other European languages on a US keyboard

2006-07-21 Thread Yakov Lerner

On 7/21/06, cga2000 [EMAIL PROTECTED] wrote:

I sometimes need to write text in other languages such as French,
Spanish and occasionally German or Italian.

I would like to do this in Vim.

Unfortunately I only have a US keyboard.

Using Ctrl-K to enter the various digraphs becomes somewhat cumbersome
for anything larger than a short paragraph.

Unfortunately I am only able to type the US keyboard, so remapping the
keyboard might be a better solution than entering digraphs in the long
run but will not be painless..  And since I do not do this on a regular
basis, I am unsure whether it's really worth going to all the trouble.

I was thinking of writing the text without accents, tildes, cedillas,
etc. using the letters on the US keyboard and then feeding the result to
some advanced functionality of a spellchecker that would replace all the
words that can only be spelled one way by their correctly spelled
version - say French 'épeler (to spell) can only be spelled this way..
there is  no 'epeler' or 'èpeler' or 'êpeler'.

On the other hand, for those words where the accent differs depending
on the semantics such as French 'a' vs. 'à', the script/plugin would
leave them untouched and  - ideally - highlight them, thus leaving me
with only a handful of manual corrections.

Is there anything in Vim that does something like this?

Or is there any other 'smart' way to achieve something like the above?


I think the easiest apporach is to craete mappings.
You could use ctrl-(a-z), ctrl-shift(a-z), ctrl-alt-(a-z),
then f1-f12 + ctrl/alt/del combinations.

Linux has a keyboard mode where ' is used to
modify letters to create diactitics. This works for
all X applications, not just vim.

Yakov