Re: Practical problems with custom .ucm based encoding

2002-04-25 Thread Nick Ing-Simmons
Dan Kogai <[EMAIL PROTECTED]> writes: >On Wednesday, April 24, 2002, at 09:25 , Bart Schuller wrote: >> Hello, >> >> The cool Encoding support in 5.8 to be enables me to properly solve a >> very common task: making HTML entities out of utf-8 data. >> >> I generated a ucm file with entries like thi

Re: Practical problems with custom .ucm based encoding

2002-04-24 Thread Dan Kogai
On Wednesday, April 24, 2002, at 10:07 , Bart Schuller wrote: > On Wed, Apr 24, 2002 at 09:56:29PM +0900, Dan Kogai wrote: >> Shoot! I've just implemented FB_HTMLENT ! (quick, wasn't it?) Okay, >> be >> it CHARREF (or isn't there a good short abbreviation for that?). Let >> me > > CHARREF is

Re: Practical problems with custom .ucm based encoding

2002-04-24 Thread Dan Kogai
On Wednesday, April 24, 2002, at 09:43 , Bart Schuller wrote: > Character Reference is the proper term, for entities you'd need my whole > module. > Please go completely overboard and have FB_XMLCHARREF in addition to > FB_HTMLCHARREF, the difference being that the XML version would make it > € S

Re: Practical problems with custom .ucm based encoding

2002-04-24 Thread Bart Schuller
On Wed, Apr 24, 2002 at 09:32:37PM +0900, Dan Kogai wrote: > As a matter of fact, I was thinking of adding FB_HTMLENT or something > like that. It seems trivial; Unless jhi whips me for the sin of > Feeping Creaturism, I'll do so. Character Reference is the proper term, for entities you'd nee

Re: Practical problems with custom .ucm based encoding

2002-04-24 Thread Dan Kogai
On Wednesday, April 24, 2002, at 09:25 , Bart Schuller wrote: > Hello, > > The cool Encoding support in 5.8 to be enables me to properly solve a > very common task: making HTML entities out of utf-8 data. > > I generated a ucm file with entries like this: > > \x26\x6E\x62\x73\x70\x3B

Practical problems with custom .ucm based encoding

2002-04-24 Thread Bart Schuller
Hello, The cool Encoding support in 5.8 to be enables me to properly solve a very common task: making HTML entities out of utf-8 data. I generated a ucm file with entries like this: \x26\x6E\x62\x73\x70\x3B |0 # nbsp The resulting Encode::HTMLEntities encoding works perfec