Re: [patch] fix plain text output

Abdelrazak Younes Wed, 16 Aug 2006 10:19:48 -0700

Georg Baum wrote:

Am Mittwoch, 16. August 2006 18:41 schrieb Abdelrazak Younes:
Hum... I am not I follows everything but let me summarize what Iunderstand from current code. The std::vectors I am talking about are:
* vector<char>: could be replaced by std::basic_string<char>
* vector<unsigned char>: that is ucs2 right? That could be replaced bystd::basic_string<unsigned char>* vector<boost::uint32_t>: I guess that is ucs4 and that could bereplaced by std::basic_string<unsigned char>
aka lyx::docstring

So, IIUC, we could switch to basic_string for char, ucs2 and ucs4without any problem. The utf8 case is an entirely different problem.

Internally we should just use one of those three types.
IMO only the last one. ucs2 is only for talking to qt, but that can easilybe wrapped in fromqstr/toqstr, so we don't really need a ucs2 string type.


Yes.

The conversionto this complicate utf8 encoding should happen on input/output only.Handling a multi-byte encoding internally is just a recipe for a buggyfuture IMHO.
So what I do not get right here?
multibyte != variable-byte. Multibyte is not bad per se.


Yes, that's what I meant.

Both ucs2 and ucs4use a fixed number of bytes for one character (2 and 4, respectively,surprise, surprise!). The problem is a variable-byte encoding such asutf8.

Yes I understood that far, sorry for "quiproquo". IMHO, the only codethat should refer to the utf8 encoding is a code that handles writing orreading a file.


Abdel.

Re: [patch] fix plain text output

Reply via email to