Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Nick Bastin
On Mar 8, 2004, at 3:12 PM, [EMAIL PROTECTED] wrote: Yes, I was confused by the fact you said and XML to XML tranformation worked correctly, but XML to HTML did not. Clearly, they must have beeen with different data sets, so the comparison was not relevant. Well, *we* didn't think they were differ

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread david_n_bertoni
> On Mar 8, 2004, at 2:18 PM, [EMAIL PROTECTED] wrote: > > > Yes, I was confused by the fact you said and XML to XML tranformation > > worked correctly, but XML to HTML did not. Clearly, they must have > > beeen > > with different data sets, so the comparison was not relevant. > > Well, *we* d

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Nick Bastin
On Mar 8, 2004, at 2:18 PM, [EMAIL PROTECTED] wrote: Yes, I was confused by the fact you said and XML to XML tranformation worked correctly, but XML to HTML did not. Clearly, they must have beeen with different data sets, so the comparison was not relevant. Well, *we* didn't think they were diffe

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread david_n_bertoni
> On Mar 8, 2004, at 11:39 AM, [EMAIL PROTECTED] wrote: > > >"The html output method may output a character using a character > > entity > > reference, if one is defined for it in the version of HTML that the > > output > > method is using." > > > > Many XSLT processors do this, not just Xa

XML/XSLT editors

2004-03-08 Thread Nick Bastin
We're looking at new tools for XSLT editing and debugging, and I was wondering if anybody had used ActiveState Komodo, and what they thought of it. We're also looking at XMLSpy, but it only works on Windows, so that's a downside for us (it would be best if we could use the same tool on Solaris

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Nick Bastin
On Mar 8, 2004, at 11:39 AM, [EMAIL PROTECTED] wrote: "The html output method may output a character using a character entity reference, if one is defined for it in the version of HTML that the output method is using." Many XSLT processors do this, not just Xalan-C, so I'm not sure why you th

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Keith Rogers
Actually, David, it did apply - if you think you've got UTF-8, but really don't then the output won't be what you expect.   We're just talking semantics here.  I've had a [u]string class (for about 4 years now) that encapsulates both UTF-8 and UTF-16, because it seemed like the X/X transcoders leak

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread david_n_bertoni
> > This just recently happened when I was creating a Xerces text node, > > and the DOM_String (Xerces 1.6!) was constructed with a char* that > > pointed to UTF-8, instead of a wchar_t* pointing to UTF-16.  What > > happens is that Xerces interprets char* as a *multibyte* character > > set, an

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread david_n_bertoni
Hi Nick, The use of entities is allowed by the XSLT recommendation and expected by many browers: http://www.w3.org/TR/xslt#section-HTML-Output-Method "The html output method may output a character using a character entity reference, if one is defined for it in the version of HTML that

RE: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Fish Christopher G Contr ESC/ACU OL1
last time I checked there were some differences between Microsoft Unicode and ISO Unicode.  it was some years ago, by some I mean 4, but at that point in time it was necessary to download special fonts to properly render Russian and the characters above that because there Unicode was offse

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Keith Rogers
No question - XMLSpy.  Been using it for about 3 1/2 years.  You need a Unicode font installed, of course.   Don't know if this is your problem, but what happens in Windows UTF-8, e.g., in Notepad, is that the file starts with the UTF-8 encoding bytes (actually, UTF-16 flags FFFE or FEFF encoded as

XSL- Transform

2004-03-08 Thread Pankaj Bhatia
Hi,    I am trying to generate a HTML file by applying schema transform on a schema file. My xsl works fine with .xsd when I use some XML IDE but not able to do the same prgrammetically.  I am taking help from "simple transform" sample. here is the code snippet of my project... I am using

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Nick Bastin
On Mar 7, 2004, at 7:42 PM, Keith Rogers wrote: All of our file XML is UTF-8 input, and I haven't seen any problems with direct file transforms using Xerces 1.6/Xalan 1.3 or Xerces 2.3/Xalan 1.6.  I never saw a reason for ICU, since all of our stuff is UTF-8 (or UTF-16), so don't build with it. 

Re: Avoiding the escaping UTF-8 unicode text

2004-03-08 Thread Keith Rogers
All of our file XML is UTF-8 input, and I haven't seen any problems with direct file transforms using Xerces 1.6/Xalan 1.3 or Xerces 2.3/Xalan 1.6.  I never saw a reason for ICU, since all of our stuff is UTF-8 (or UTF-16), so don't build with it.  Like I said, the only time I saw (what should have