In message <009801c1a257$9dee9e40$[EMAIL PROTECTED]>, "Hardeep Si ngh" writes: >But, how can it get a value larger than 255? I mean, even if there is a >value greater than one byte, it should be interpreted as two consecutive >characters, not just one. The problem at hand requires the speed. Hence, >what can I do to make it either just ignore unicode files or ignore the >higher bit (this shud work correctly for UTF 8).
If you are reading your data from a file or other I/O source, set the character encoding for byte-to-char-translation to US-ASCII or ISO-LATIN-1 or some other encoding that will limit the values to <= 255. You can do this with an InputStreamReader. daniel -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
