You can try with libxml2.

On Fri, Jan 30, 2009 at 12:57 PM, Catalina Caloian <
catalina.calo...@quintiq.com> wrote:

> Hi
>
> By non-ASCII I meant characters with a value greater than 127. The response
> that Guththila couldn't parse completely had the french "small letter e with
> acute accent"; when retrieved from Guththila's internal buffers (and stored
> in an 'int'), it would yield a negative value.
>
> While googling Guththila, I found that it was supposed to support UTF-8 and
> UTF-16; according to that, a response containing special french characters
> should have been properly parsed.
>
> Should I conclude that in order to handle UTF-8 encoded requests and
> responses I should use a different parser?
>
> Thank you.
> Catalina Caloian
>
> >>>
> From:   Supun Kamburugamuva <supu...@gmail.com>
> To:     Apache AXIS C Developers List <axis-c-dev@ws.apache.org>
> Date:   1/30/2009 6:33 AM
> Subject:        Re: guththila_next_char clarification needed
>
> Guthtila is written assuming ASCHII characters. So it is not possible to
> get
> negative characters. If Guththila gets negative charactes, it means the XML
> stream has ended or contains invalid characters.
>
> Anyway what do you mean by non-ASCII characters? Do you mean binary?
>
> Supun.
>
> On Thu, Jan 29, 2009 at 8:02 PM, Catalina Caloian <
> catalina.calo...@quintiq.com> wrote:
>
> > Hi
> >
> > In the 'guththila_next_char' method, whenever a character is being
> > retrieved from Guththila's internal char buffers, a check is being made
> to
> > see if it has a positive value:
> >        e.g.  'return c >= 0 ? c : -1;' (guththila_xml_parser.c, lines
> 1522,
> > 1537, 1616, 1632)
> > -1 is interpreted as an error code by the caller and the parser basically
> > stops, deeming the input as invalid.
> >
> > I encountered this situation when dealing with a response that contained
> > some non-ASCII characters. Those non-ASCII characters got stored in
> > Guththila's buffers of char after a wrap-around of their values, so to
> fit
> > in a char. When 'guththila_next_char' got to those characters, it would
> > encounter negative values and return -1. As a consequence, Axis2C would
> > treat the response as invalid.
> >
> > Replacing the aforementioned 'return' statements with a simple 'return
> c;'
> > made Guththila parse the entire response, although the VS XML Visualizer
> > doesn't display the non-ASCII characters as expected.
> >
> > I'm wondering what's the rationale behind Guththila's current way of
> doing
> > things. I feel as if I'm missing something, so any help would be highly
> > appreciated.
> >
> > Thank you.
> >
> >
> > Catalina-Georgiana Caloian
> > Software Engineer
> >
> > Quintiq
> >
> > T +31 (0) 73 691 07 39
> > F +31 (0) 73 691 07 54
> > M +31 (0) 65 247 63 99
> > E catalina.calo...@quintiq.com
> > I www.quintiq.com
> >
> >
> > Quintiq Conference "Quintessence 09" Tuesday May 12th, 2009, Country
> Estate
> > Duin & Kruidberg, near Amsterdam Schiphol, The Netherlands - for more
> > information visit www.quintiq.com
> >
> > This message contains information that may be privileged or confidential
> > and is the property of Quintiq. It is only intended for the person to
> whom
> > it is addressed. If you are not the intended recipient, you are not
> > authorized to read, print, retain, copy, disseminate, distribute or use
> this
> > message or any part thereof. If you have received this message in error,
> > please notify the sender immediately and delete all copies of this
> message.
> > Please note that e-mails are susceptible to change, therefore they are
> not
> > binding.
> >
>
> Quintiq Conference "Quintessence 09" Tuesday May 12th, 2009, Country Estate
> Duin & Kruidberg, near Amsterdam Schiphol, The Netherlands - for more
> information visit www.quintiq.com
>
> This message contains information that may be privileged or confidential
> and is the property of Quintiq. It is only intended for the person to whom
> it is addressed. If you are not the intended recipient, you are not
> authorized to read, print, retain, copy, disseminate, distribute or use this
> message or any part thereof. If you have received this message in error,
> please notify the sender immediately and delete all copies of this message.
> Please note that e-mails are susceptible to change, therefore they are not
> binding.
>



-- 
Software Engineer, WSO2 Inc
http://wso2.org
supunk.blogspot.com

Reply via email to