Ahh, sorry bout that, i think the mbstring module is indeed enabled. I found
this in my php.ini:
mbstring.internal_encoding    = UTF-8  ; Set internal encoding to UTF-8

So this module will transparently encode my data? Do i need to disable the
entire module, or does it have an option to leave my data alone? I think
this is my own fault, because i activated this a while back...in case i
needed to manipulate UTF-8 data in php. But i didnt know it had other
effects except for the extra multibyte functions.

Thank you for the help (and sorry to bother you with this ;-),

Merijn

----- Original Message -----
From: "Marcus Börger" <[EMAIL PROTECTED]>
To: "Merijn van den Kroonenberg" <[EMAIL PROTECTED]>
Cc: "Wez Furlong" <[EMAIL PROTECTED]>; <[EMAIL PROTECTED]>
Sent: Tuesday, August 27, 2002 2:35 PM
Subject: Re: [PHP-DEV] curl and UTF-8, random encoding?


> Wez also mentioned mbstring. This module works transparently,
> so please check if that module is available.
>
> marcus
>
> At 12:25 27.08.2002, Merijn van den Kroonenberg wrote:
> >Thanks for the answer,
> >
> >It doesnt make sense indeed ;-) But unfortunately It hasn't been
processed
> >by an xml parser in the php script. When i check the output direct after
the
> >curl_exec its already converted (tested by echoing $result, and by
setting
> >RETURNTRANSFER to 0). The perl script that creates the xml does it right,
> >its easely to check, i just warn the xml at the moment that i print it
out,
> >so i can see the xml that is sent back to the php script in the webserver
> >logs.
> >
> >So i am 100% sure that the conversion takes place after it is sent to the
> >webserver, and before its processed by the rest of the php script.
> >
> >The only other thing that i can think up is that the webserver itself
might
> >do something to the data? I am running apache on linux, is that a
> >possibility?
> >
> >Merijn
> >
> >----- Original Message -----
> >From: "Wez Furlong" <[EMAIL PROTECTED]>
> >To: "Merijn van den Kroonenberg" <[EMAIL PROTECTED]>
> >Cc: <[EMAIL PROTECTED]>
> >Sent: Tuesday, August 27, 2002 11:49 AM
> >Subject: Re: [PHP-DEV] curl and UTF-8, random encoding?
> >
> >
> > > Hey,
> > >
> > > That does not make sense, since neither curl nor PHP do any
> > > kind of conversion like that.
> > > Are you sure that you're not looking at the output from an XML
> > > processor that has mangled utf-8 -> iso-8859-1 ??
> > > (expat has source and target encodings that can be set separately),
> > > And are you using something like mbstring with transparent encoding
> > > translation turned on?
> > >
> > > --Wez.
> > >
> > >
> > > On 08/27/02, "Merijn van den Kroonenberg" <[EMAIL PROTECTED]>
wrote:
> > > > Hello List,
> > > >
> > > > I have a problem with the php CURL module and UTF-8 data.
> > > > My php script uses curl to do a post to a perl/cgi script. This perl
> >script
> > > > returns UTF-8 encoded XML. The perl script returns utf-8, i have
> >verified
> > > > that using the webserver logfiles, but the data that i receive in
> >$result
> > > > (see below) is decoded to ISO-8859-1.
> > > >
> > > >     $ch = curl_init();
> > > >     curl_setopt($ch, CURLOPT_URL, $post_url);
> > > >     curl_setopt($ch, CURLOPT_HEADER, 0);
> > > >     curl_setopt($ch, CURLOPT_VERBOSE, 0);
> > > >     curl_setopt($ch, CURLOPT_POST, 1);
> > > >     curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
> > > >     curl_setopt($ch, CURLOPT_POSTFIELDS, $postfields);
> > > >     $result = curl_exec ($ch);// #### UTF compatible?
> > > >     curl_close ($ch);
> > > >
> > > > I did some further testing, and i found that this behaviour is not
> > > > consistent. Actually i am pretty puzzled about this.
> > > >
> > > > I was testing with a xml document that
> > > > contained only the following multi byte utf chacracter:
> > > > \303\253    (octal utf8) (LATIN SMALL LETTER E WITH DIAERESIS)
> > > > The output from CURL got automatically decoded to latin1.
> > > >
> > > > Then after that i tested with another xml document that
> > > > contained the following multi byte utf character:
> > > > \342\202\254 (octal utf8) (EURO SIGN)
> > > > I was suprised to see that the output was now correct UTF-8.
> > > >
> > > > Now i modified the first document and inserted the EURO SIGN in this
> > > > document. When i process this document again, the CURL output is
UTF-8.
> >So
> > > > it seems the output of CURL depends on what it detects on its imput,
and
> >it
> > > > will try to convert the data to latin1 if possible??
> > > >
> > > > Does anyone know how i can disable this behaviour? For me, CURL
should
> >not
> > > > do any en/de-coding of my data.
> > > >
> > > > I also looked around at the cURL library site (http://curl.haxx.se/)
of
> >the
> > > > developer of CURL. In message
> > > > http://curl.haxx.se/mail/curlphp-2001-02/0005.html the cURL
developer
> > > > indicates that the libraries do not care about character sets, and
that
> >it
> > > > might have something to do with the implementation into PHP.
> > > >
> > > > If this is true, then theres probably not much i can do about it. If
its
> >the
> > > > case, please let me know, so i can find an alternative.
> > >
> > >
> >
> >
> >--
> >PHP Development Mailing List <http://www.php.net/>
> >To unsubscribe, visit: http://www.php.net/unsub.php
>
>
> --
> PHP Development Mailing List <http://www.php.net/>
> To unsubscribe, visit: http://www.php.net/unsub.php
>


-- 
PHP Development Mailing List <http://www.php.net/>
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to