Hi Fabrizo,

I think you're in the right place to talk about that.

I haven't been able to reproduce your error.
I added a test:

XMLParserTest>>testNonUTF8Characters

        self shouldnt: [XMLDOMParser parseDocumentFrom:
'<foo>Bean BLABLABLA Eidgenössisches Institut für BLABLALBLA</foo>' readStream] raise: Error.

It goes green in my image. Do you have a different way to get the readStream from the String?

Cheers,
Alexandre

On 26 Mar 2010, at 12:14, Fabrizio Perin wrote:

Hi,
I was parsing an XML File with the last version of XML Parser (XML- Parser-JAAyer.68) and i get an error related to a not UTF-8 character that the parser found into the document. The XML document contains some german character:

<![CDATA[SES: Bean BLABLABLA Eidgenössisches Institut für BLABLALBLA]]>

Actually i'm not sure if the error is which is in the UTF8TextConverter or something is wrong in the invokation from the parser. Anyway i parse several time the same document with older versions of the XML-Parser (XML-Parser-JAAyer.57) and it always works well. I'm not sure if the mailing list of Pharo is the right place to report this problem in the case i'm i'm sorry.

Here the trace from the log:

Error: Invalid utf8 input detected
26 March 2010 4:14:07 pm

VM: Mac OS - intel - 1062 - Squeak3.8.1 of '28 Aug 2006' [latest update: #6747] Squeak VM 4.2.2b1
Image: Pharo-1.0-10515-rc3 [Latest update: #10515]

SecurityManager state:
Restricted: false
FileAccess: true
SocketAccess: true
Working Dir /Users/fabrizioperin/development/Pharo/WORKINGONNOW/ MooseJEE_64
Trusted Dir /foobar/tooBar/forSqueak/bogus
Untrusted Dir /Users/fabrizioperin/Library/Preferences/Squeak/ Internet/My Squeak

UTF8TextConverter(Object)>>error:
        Receiver: an UTF8TextConverter
        Arguments and temporary variables:
                aString:        'Invalid utf8 input detected'
        Receiver's instance variables:
an UTF8TextConverter

UTF8TextConverter>>errorMalformedInput
        Receiver: an UTF8TextConverter
        Arguments and temporary variables:

        Receiver's instance variables:
an UTF8TextConverter

UTF8TextConverter>>nextFromStream:
        Receiver: an UTF8TextConverter
        Arguments and temporary variables:
aStream: MultiByteFileStream: '/Users/fabrizioperin/development/ Pharo/WORKINGON...etc...
                character1:     $¶
                value1:         182
                character2:     $s
                value2:         115
                unicode:        nil
                character3:     $s
                value3:         115
                character4:     nil
                value4:         nil
        Receiver's instance variables:
an UTF8TextConverter

MultiByteFileStream>>next
Receiver: MultiByteFileStream: '/Users/fabrizioperin/development/ Pharo/WORKINGONNOW/MooseJEE_64/src/...etc...
        Arguments and temporary variables:
                char:   nil
                secondChar:     nil
                state:  nil
        Receiver's instance variables:


XMLStreamReader>>basicNext
        Receiver: a XMLStreamReader
        Arguments and temporary variables:
                nextChar:       nil
        Receiver's instance variables:
stream: MultiByteFileStream: '/Users/fabrizioperin/development/ Pharo/WORKINGONN...etc...
                nestedStreams:  nil
                peekChar:       nil
buffer: a WriteStream 'SES: Bean zum Einlesen und updaten der Stako relevanten ...etc...

XMLStreamReader>>next
        Receiver: a XMLStreamReader
        Arguments and temporary variables:
                nextChar:       nil
        Receiver's instance variables:
stream: MultiByteFileStream: '/Users/fabrizioperin/development/ Pharo/WORKINGONN...etc...
                nestedStreams:  nil
                peekChar:       nil
buffer: a WriteStream 'SES: Bean zum Einlesen und updaten der Stako relevanten ...etc...

XMLStreamReader>>upToAll:
        Receiver: a XMLStreamReader
        Arguments and temporary variables:
                aDelimitingString:      ']]>'
        Receiver's instance variables:
stream: MultiByteFileStream: '/Users/fabrizioperin/development/ Pharo/WORKINGONN...etc...
                nestedStreams:  nil
                peekChar:       nil
buffer: a WriteStream 'SES: Bean zum Einlesen und updaten der Stako relevanten ...etc...

SAXDriver(XMLTokenizer)>>nextCDataContent
        Receiver: a SAXDriver
        Arguments and temporary variables:
                cdata:  nil
        Receiver's instance variables:
                streamReader:   a XMLStreamReader
                streamWriter:   a XMLStreamWriter
                entities:       nil
                externalEntities:       nil
                parameterEntities:      nil
                isValidating:   false
                parsingMarkup:  false
                saxHandler:     an OPOpaxHandler
                openTags:       <ejb-jar>, <enterprise-beans>, <session>, 
<description>
                nestedScopes:   nil
                useNamespaces:  false
                validateAttributes:     nil
                languageEnvironment:    nil

SAXDriver(XMLTokenizer)>>nextCDataOrConditional
        Receiver: a SAXDriver
        Arguments and temporary variables:
                nextChar:       $C
                conditionalKeyword:     nil
        Receiver's instance variables:
                streamReader:   a XMLStreamReader
                streamWriter:   a XMLStreamWriter
                entities:       nil
                externalEntities:       nil
                parameterEntities:      nil
                isValidating:   false
                parsingMarkup:  false
                saxHandler:     an OPOpaxHandler
                openTags:       <ejb-jar>, <enterprise-beans>, <session>, 
<description>
                nestedScopes:   nil
                useNamespaces:  false
                validateAttributes:     nil
                languageEnvironment:    nil

SAXDriver(XMLTokenizer)>>nextMarkupToken
        Receiver: a SAXDriver
        Arguments and temporary variables:
                nextChar:       $[
        Receiver's instance variables:
                streamReader:   a XMLStreamReader
                streamWriter:   a XMLStreamWriter
                entities:       nil
                externalEntities:       nil
                parameterEntities:      nil
                isValidating:   false
                parsingMarkup:  false
                saxHandler:     an OPOpaxHandler
                openTags:       <ejb-jar>, <enterprise-beans>, <session>, 
<description>
                nestedScopes:   nil
                useNamespaces:  false
                validateAttributes:     nil
                languageEnvironment:    nil

SAXDriver(XMLTokenizer)>>nextToken
        Receiver: a SAXDriver
        Arguments and temporary variables:
                whitespace:     ''
        Receiver's instance variables:
                streamReader:   a XMLStreamReader
                streamWriter:   a XMLStreamWriter
                entities:       nil
                externalEntities:       nil
                parameterEntities:      nil
                isValidating:   false
                parsingMarkup:  false
                saxHandler:     an OPOpaxHandler
                openTags:       <ejb-jar>, <enterprise-beans>, <session>, 
<description>
                nestedScopes:   nil
                useNamespaces:  false
                validateAttributes:     nil
                languageEnvironment:    nil

OPOpaxHandler(SAXHandler)>>parseDocument
        Receiver: an OPOpaxHandler
        Arguments and temporary variables:

        Receiver's instance variables:
                driver:         a SAXDriver
                eod:    false
                stack:  an OrderedCollection(<?xml version="1.0" 
encoding="utf-8"?>
<ejb-jar id=...etc...
_______________________________________________
Pharo-project mailing list
Pharo-project@lists.gforge.inria.fr
http://lists.gforge.inria.fr/cgi-bin/mailman/listinfo/pharo-project

--
_,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:
Alexandre Bergel  http://www.bergel.eu
^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;._,.;:~^~:;.






_______________________________________________
Pharo-project mailing list
Pharo-project@lists.gforge.inria.fr
http://lists.gforge.inria.fr/cgi-bin/mailman/listinfo/pharo-project

Reply via email to