In our previous episode, Michael Schnell said: > > found by a dumb byte/char scan; only few encodings have to be > > recognized and handled, based on the char size: MBCS (UTF-8...), > > WideChars (UTF-16/UCS2) and UTF-32. > > > In fact I suppose that for UTF-8 ("pure UTF-8" without surrogates) pos() > works for all strings and an UTF-8 "character" is a string.
Either you have UTF-8 with surrogates, or you have ASCII (since UTF-8 without surrogates means that only char 0..127 are valid, which is ASCII) _______________________________________________ fpc-devel maillist - fpc-devel@lists.freepascal.org http://lists.freepascal.org/mailman/listinfo/fpc-devel