It may be that the characters are not validated,  and that the encoding you
set determines how the parser interprets the characters.
Have you tried outputting the simple text contained in your document tag?
If the characters do not appear as you have entered them, then this may be
the case.


                                                                                
                                                  
                    Christophe                                                  
                                                  
                    PrÃvost              To:     "Xerces-J" <[EMAIL 
PROTECTED]>                                        
                    <[EMAIL PROTECTED]       cc:                                
                                                      
                    perf.com>            Subject:     problem with iso-8859-1 
...                                                 
                                                                                
                                                  
                    21/11/2001                                                  
                                                  
                    10:45                                                       
                                                  
                    Please respond                                              
                                                  
                    to                                                          
                                                  
                    xerces-j-user                                               
                                                  
                                                                                
                                                  
                                                                                
                                                  




Hello everybody,

I use Xerces to parse documents via sax. I've an ErrorHandler and my parser
features are like this:

XMLReader parser = XMLReaderFactory.createXMLReader(
"org.apache.xerces.parsers.SAXParser" );
parser.setErrorHandler( errHandler );
parser.setFeature( "http://xml.org/sax/features/validation";, true );
parser.setFeature( "http://xml.org/sax/features/namespaces";, false );

When i parse a document like this, xerces errorhandler raise no error
<?xml version='1.0' encoding='ISO-8859-1' standalone="yes"?>
<document>â Å, Å, Å, Å, Å, Å, and Å.</document>
However all those caracters doesn't belong to the ISO-8859-1 charmap but
ISO-8859-15 !!

Maybe i've done something wrong or i forgot something... Can u help me, i'm
blind.

Thanks

PS1: using Xerces-J 1.4.3
PS2: there are no entity in the dtd



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




**********************************************************************
This e-mail (including any attachments) is intended only for
the recipient(s) named above. It may contain confidential or
privileged information and should not be read, copied or
otherwise used or disseminated by any unauthorised person.
Any views or opinions expressed are solely those of the author
and do not necessarily represent those of Yes Television
(Europe) Ltd . If you are not the named recipient, please
contact the sender and delete the e-mail from your system.
**********************************************************************

Reply via email to