php-i18n Digest 22 Sep 2008 11:58:23 -0000 Issue 411

Topics (messages 1287 through 1290):

Re: utf-8 characters delivered as question marks involuntary
        1287 by: Darren Cook
        1288 by: Nash Nipples
        1289 by: Nash Nipples

A good place to get started?
        1290 by: jsahleen

Administrivia:

To subscribe to the digest, e-mail:
        [EMAIL PROTECTED]

To unsubscribe from the digest, e-mail:
        [EMAIL PROTECTED]

To post to the list, e-mail:
        [EMAIL PROTECTED]


----------------------------------------------------------------------
--- Begin Message ---
>     input => output
> A: 3313 => 3313;  
> B: 3313 => 3?13;  
> -------------------------
> 3 -> a variable-length character;
> 1 -> a one byte ascii character;
> ? -> a one byte ascii question mark (literaly)
> ...
> Sometimes (probability 1/(2-20)) i get a question mark instead of a
> particular variable-length character when the other variable-length
> characters are displayed correctly within the same page.

Question mark is often used for characters that have no code point in
the charset. So, what is the code point of the character that turns into
a question mark? Does that same code point output okay some of the time?

Of course, this explanation is a bit dubious if input is UTF-8, internal
encoding is UTF-8 and output is UTF-8.

> More details for an instant: - Only characters read from a utf-8
> encoded file with a function file_get_contents() seem to be displayed
> correctly at all times; - Characters that are read by the php.exe
> (please excuse me here) do fail once in a while

Ah, could you be reading a partial multi-byte character and processing
it? E.g. taking an extreme example if you read 8 bytes, process them,
then repeat with the next 8 bytes, and they are an equal mix of
characters 1, 2 and 3 bytes long then there is a high probability you
will processing garbage.

Darren


-- 
Darren Cook, Software Researcher/Developer
http://dcook.org/mlsn/ (English-Japanese-German-Chinese-Arabic
open source dictionary/semantic network)
http://dcook.org/work/ (About me and my work)
http://dcook.org/blogs.html (My blogs and articles)

--- End Message ---
--- Begin Message ---
> > Do you think i should try
> > to read from a
> > better sector?
> 
> Unless you think the hardware is particularly old and
> suspect (e.g.
> unstable on all applications) I think the chance of it
> being a software
> bug or configuration issue is 99%.
> 
> Darren
> 

What i'm doing now is reading the file in a bad sector with the function 
file_get_contents() and sending it to the screen. It does not fail from the 
file_get_contents() side when it does fail from the php.exe side.

Nash


      

--- End Message ---
--- Begin Message ---
> > > Do you think i should try
> > > to read from a
> > > better sector?
> > 
> > Unless you think the hardware is particularly old and
> > suspect (e.g.
> > unstable on all applications) I think the chance of it
> > being a software
> > bug or configuration issue is 99%.
> > 
> > Darren
> > 
> 
> What i'm doing now is reading the file in a bad sector
> with the function file_get_contents() and sending it to the
> screen. It does not fail from the file_get_contents() side
> when it does fail from the php.exe side.
> 
> Nash
> 

moved to a different virtual server with a different executable version, same 
hard drive phpinfo() now clearly states Multibyte Support enabled
no optimization engine here

lets wait untill the question marks start getting into databases

Nash



      

--- End Message ---
--- Begin Message ---
Hello everyone,

My name is Joel Sahleen and I am a PHP developer living in the Salt Lake
City, UT area. My background is in Chinese and Japanese, both of which I
studied at Stanford University as part of my graduate program in Asian
Languages. I have been programming in PHP for about 2.5 years now (read
career change) and recently received my ZCE certification.

I am currently interviewing with a large web analytics company located in
Utah who want me to help localize their products for the East Asian market.
Although I have the linguistic background and the necessary programming
skills, I have never implemented a localization project before and I was
hoping that people on the list could point me toward some good resources to
help get me started. I understand that the new intl implementation of ICU is
probably the best way to go for doing conversions, but how does this work
together with other technologies such as gettext? My potential employers
primary concerns are performance and manageability. Which strategies would
best satisfy those requirements?

Any help or suggestions would be greatly appreciated!!

Thanks.

Joel D. Sahleen

-- 
View this message in context: 
http://www.nabble.com/A-good-place-to-get-started--tp19606544p19606544.html
Sent from the Php - Internationalization (i18n) mailing list archive at 
Nabble.com.


--- End Message ---

Reply via email to