RE: UTF-16 -> UTF-8

2001-11-22 Thread Edward Cherlin
> Sent: Thursday, November 22, 2001 5:29 AM > To: Edward Cherlin; [EMAIL PROTECTED] > Subject: RE: UTF-16 -> UTF-8 > > > Hi Edward, > > > > You can tell Access about the encoding when you import a file. > > > > In Access 2000, open the File menu, and on

Broken fonts (was RE: UTF-16 -> UTF-8)

2001-11-22 Thread Edward Cherlin
EMAIL PROTECTED] > Subject: Re: UTF-16 -> UTF-8 > > > > On Wed, 21 Nov 2001 10:27:11 -0800 > Edward Cherlin <[EMAIL PROTECTED]> wrote: > > > > Your problem with '\' is that MS Mincho is trying to be > both a Unicode font > > and a Japanese fon

Re: UTF-16 -> UTF-8

2001-11-22 Thread Philip Newton
On Wed, 21 Nov 2001 22:04:52 -, in perl.unicode you wrote: > When adding the unicode value to the Sql string in > $sql="INSERT INTO Tipo_Referencia ( Descricao ) > VALUES ('$palavra_utf16');"; > there is an implicit conversion from the Unicode::String object > to a common Pe

RE: UTF-16 -> UTF-8

2001-11-22 Thread Rui Ribeiro
Hi Edward, > You can tell Access about the encoding when you import a file. > > In Access 2000, open the File menu, and on the Get External Data submenu, > select Import. The file browser dialog box will open. The problem with this is that I cannot simply import the file, because I have struct

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
: quarta-feira, 21 de Novembro de 2001 23:15To: [EMAIL PROTECTED]Cc: [EMAIL PROTECTED]Subject: RE: UTF-16 -> UTF-8 Yes Tim I see your point. There is probably a relation with my problem. But is seems a bit strange that that happened using UTF-8, because Perl 5.6 seems to treat UT

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
AIL PROTECTED]]Sent: quarta-feira, 21 de Novembro de 2001 22:37To: Rui Ribeiro; Philip NewtonCc: [EMAIL PROTECTED]Subject: RE: UTF-16 -> UTF-8 I don't know if this will help / is related or whatever, but I did find that when processing UTF8 data in an Oracle database PERL *s

RE: UTF-16 -> UTF-8

2001-11-21 Thread Tim Scott
value is actually written.I guess I'll have to handle my special chars outside Perl. It's less elegant, but probably easier tosolve.Once again your insigths have been very instructive. Thank you so much for your help.Best regards.Rui> -Original Message-> From: Philip Newton [mailt

RE: UTF-16 -> UTF-8

2001-11-21 Thread Edward Cherlin
. > -Original Message- > From: Rui Ribeiro [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, November 21, 2001 10:47 AM > To: Philip Newton > Cc: [EMAIL PROTECTED] > Subject: RE: UTF-16 -> UTF-8 > > > Philip, > > I can read the file properly on Word,no

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
Dear Martin, I can use perl 5.6. In fact I'm using it. I thank you for your code, but I need to write the converted words in UTF-16 to a database and not to a text file. We were using the text file for output only to see if the conversion was being properly done. But our true objective was (it

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
gards. Rui > -Original Message- > From: Philip Newton [mailto:[EMAIL PROTECTED]] > Sent: quarta-feira, 21 de Novembro de 2001 18:29 > To: Rui Ribeiro > Cc: [EMAIL PROTECTED] > Subject: Re: UTF-16 -> UTF-8 > > > On Wed, 21 Nov 2001 16:34:48 -, in perl.unicode y

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
Philip, I can read the file properly on Word,now. Just had to force it to have me confirm the conversion performed when opening the file. So when opening if I "force" Word to treat it as Unicode, it will read the file properly. Just have to make Access to recognize the encoding now. I'm feeling

Re: UTF-16 -> UTF-8

2001-11-21 Thread Tim Scott
ink you could use any Unicode fontin an application that works with UTF-16, UTF-8, SCSU, UTF-7 or Latin-1.What do you mean with "look slightly different"? Different from what?You get the character for "south" when you try to output U+5317 (whichis "north"? You get charac

Re: UTF-16 -> UTF-8

2001-11-21 Thread Philip Newton
is defined in the TTF spec. I would think you could use any Unicode font in an application that works with UTF-16, UTF-8, SCSU, UTF-7 or Latin-1. What do you mean with "look slightly different"? Different from what? You get the character for "south" when you try to output U+5317

Re: UTF-16 -> UTF-8

2001-11-21 Thread Philip Newton
On Wed, 21 Nov 2001 16:05:06 -, in perl.unicode you wrote: > now I can write to the DB, but the values are not properly recognized. If > you try to open the file I attached to my prior mail in Word, you'll > see exactly what I see in the DB record. In Word, I see ĨĩŨũ, but when I open it in

Re: UTF-16 -> UTF-8

2001-11-21 Thread Philip Newton
On Wed, 21 Nov 2001 16:34:48 -, in perl.unicode you wrote: > Don't lose more time over this. It seems there is some kind of problem with > the recognition of the encoding from other Office apps. > Its rather surprising that Notepad regosnizes the characters properly and > Word and Access don'

RE: UTF-16 -> UTF-8

2001-11-21 Thread Edward Cherlin
ovember 21, 2001 8:42 AM To: Martin Duerst Cc: [EMAIL PROTECTED] Subject: Re: UTF-16 -> UTF-8 Martin, Thanks - MS Mincho looks interesting. What I found, though, was that some of the punctuation doesn't appear as expected. For example a ')' appears as a centralised dot and a '\

Re: UTF-16 -> UTF-8

2001-11-21 Thread Tim Scott
Martin, Thanks - MS Mincho looks interesting. What I found, though, was that some of the punctuation doesn't appear as expected. For example a ')' appears as a centralised dot and a '\' appears as a Yen symbol. Not terribly good for writing PERL ! Also - the glyphs looked slightly different : do yo

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
> -Original Message- > From: Philip Newton [mailto:[EMAIL PROTECTED]] > Sent: quarta-feira, 21 de Novembro de 2001 15:45 > To: Rui Ribeiro > Cc: [EMAIL PROTECTED] > Subject: Re: UTF-16 -> UTF-8 > > > On Wed, 21 Nov 2001 15:14:38 -, in perl.unicode you wrote:

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
Philip, There was indeed a mistake in the invocation of the execute method of the connection object (it's quite stupid indeed. Allow me not to go into details... The stupid thing is that the instruction works OK if you write literals). I've changed that and now I can write to the DB, but the va

Re: UTF-16 -> UTF-8

2001-11-21 Thread Philip Newton
On Wed, 21 Nov 2001 15:14:38 -, in perl.unicode you wrote: > Still can't write to the BD though. The append SQL instruction has no effect. It looks wrong to me, too. > use Unicode::String qw(utf8 latin1); You don't need to import 'latin1' if you're not going to use it. (It's not going to h

RE: UTF-16 -> UTF-8

2001-11-21 Thread Rui Ribeiro
); > -Original Message- > From: Philip Newton [mailto:[EMAIL PROTECTED]] > Sent: quarta-feira, 21 de Novembro de 2001 7:14 > To: Rui Ribeiro > Cc: [EMAIL PROTECTED] > Subject: Re: UTF-16 -> UTF-8 > > > On Wed, 21 Nov 2001 00:22:04 -, in perl.unicode you wrote

RE: UTF-16 -> UTF-8

2001-11-21 Thread Martin_Hosken
Dear Rui, I probably missed the start of this thread where you said that you couldn't use Perl 5.6. But if you could use Perl 5.6, then something like this would work: open(INFILE, "<$ARGV[0]") || die "Can't read $ARGV[0]"; open(OUTFILE, ">$ARGV[1]") || die "Can't write $ARGV[1]"; binmode OUTFI

Re: UTF-16 -> UTF-8

2001-11-20 Thread Philip Newton
On Wed, 21 Nov 2001 00:22:04 -, in perl.unicode you wrote: > Thank you for your help. Hope it was of some help :) > > But you said you wanted to convert from UTF-8 to UTF-16. So you probably > > want something like > > > > $palavra_objeito = utf8($_); > > $palavra_em_utf16 = $palavr

Re: UTF-16 -> UTF-8

2001-11-20 Thread Martin Duerst
At 17:25 01/11/20 +0100, Philip Newton wrote: > > PS: Does anyone know of - even an odd looking It would look really, really, odd. >- Fixed pitch Unicode font > > including Western European, CJK, Cyrillic and Greek glyphs (ie: most Left > > to Right data) ? It's not for an end-user, it's for te

RE: UTF-16 -> UTF-8

2001-11-20 Thread Rui Ribeiro
Philip, Thank you for your help. This work is being done by a couple of students of mine, so I just sent you one of the results of the experiments. But they have tried other things. So I'll make some localized comments bellow. > On Tue, 20 Nov 2001 16:35:25 -, in perl.unicode you wrote: >

Re: UTF-16 -> UTF-8

2001-11-20 Thread Tim Scott
Philip, Thank you - that's solved my problems. Regards,Tim   Philip Newton <[EMAIL PROTECTED]> wrote: On Tue, 20 Nov 2001 16:49:38 + (GMT), in perl.unicode you wrote:> binmode STDIN;> while(<>)> {> $u = utf16($_);> $u->byteswap2 if defined $swap; # $swap defined based on command line optionsT

Re: UTF-16 -> UTF-8

2001-11-20 Thread Philip Newton
On Tue, 20 Nov 2001 16:49:38 + (GMT), in perl.unicode you wrote: > binmode STDIN; > while(<>) > { > $u = utf16($_); > $u->byteswap2 if defined $swap; # $swap defined based on command line options This looks strange. The way I read the manpage, byteswap2 is meant to be called as a functio

Re: UTF-16 -> UTF-8

2001-11-20 Thread Philip Newton
On Tue, 20 Nov 2001 16:35:25 -, in perl.unicode you wrote: > open(FICH1,"fich1.txt")||die"Nao foi possivel abrir o ficheiro fich1.txt"; > open(FICH3,">fich3.txt")||die"Nao foi possivel abrir o ficheiro fich3.txt"; Good that you check for success, but you should also include the reason -- it'

RE: UTF-16 -> UTF-8

2001-11-20 Thread Rui Ribeiro
utf8 function. Regards. Rui Ribeiro > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED]]On Behalf Of Gisle Aas > Sent: terca-feira, 20 de Novembro de 2001 16:41 > To: [EMAIL PROTECTED] > Cc: Rui Ribeiro; Philip Newton; [EMAIL PROTECTED] > S

RE: UTF-16 -> UTF-8

2001-11-20 Thread Rui Ribeiro
"\n"; $conn->execute($sql,,,adExecuteNoRecords); } > -Original Message- > From: Philip Newton [mailto:[EMAIL PROTECTED]] > Sent: terca-feira, 20 de Novembro de 2001 16:26 > To: Rui Ribeiro > Cc: Philip Newton; [EMAIL PROTECTED] > Subject: Re: UTF-16

Re: UTF-16 -> UTF-8

2001-11-20 Thread Tim Scott
Philip, Here's the first 256 bytes of each file for which the conversion produced unexpected results. FF FE 03 00 01 00 0A 00 18 00 6A 00 5A 00 01 00 00 00 61 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4B 0

Re: UTF-16 -> UTF-8

2001-11-20 Thread Philip Newton
On Tue, 20 Nov 2001 15:02:53 -, in perl.unicode you wrote: > I saw your reference to the use of Unicode::String. Excuse me for > asking, but does it work? I believe so. It's been a while since I used it, but I think it did what I wanted it to back then. > We've tried to several funtions fro

Re: UTF-16 -> UTF-8

2001-11-20 Thread Philip Newton
On Tue, 20 Nov 2001 15:59:07 + (GMT), in perl.unicode you wrote: > b. One file worked fine, but for another it converted the Chinese > data to different Chinese data. Did you see any correlation between the code points? Like, say, turning 4567 into 6745? Can you give an example of "before"

RE: UTF-16 -> UTF-8

2001-11-20 Thread Tim Scott
t; Sent: sexta-feira, 16 de Novembro de 2001 19:41> To: [EMAIL PROTECTED]> Cc: [EMAIL PROTECTED]> Subject: Re: UTF-16 -> UTF-8>>> On Fri, 16 Nov 2001 17:41:52 + (GMT), in perl.unicode you wrote:>> > I'm wanting to convert a file from UTF16 into UTF8. I be

RE: UTF-16 -> UTF-8

2001-11-20 Thread Rui Ribeiro
ton [mailto:[EMAIL PROTECTED]] > Sent: sexta-feira, 16 de Novembro de 2001 19:41 > To: [EMAIL PROTECTED] > Cc: [EMAIL PROTECTED] > Subject: Re: UTF-16 -> UTF-8 > > > On Fri, 16 Nov 2001 17:41:52 + (GMT), in perl.unicode you wrote: > > > I'm wanting to co

Re: UTF-16 -> UTF-8

2001-11-16 Thread Philip Newton
On Fri, 16 Nov 2001 17:41:52 + (GMT), in perl.unicode you wrote: > I'm wanting to convert a file from UTF16 into UTF8. I believe I've > identified the tools to do it and all but installed them, apart > from Unicode::Map8 (v0.10). > > Can anyone help me with the build errors (below) or advise

UTF-16 -> UTF-8

2001-11-16 Thread Tim Scott
Hi, I've scanned the archive, but couldn't find anything relevant so .. I'm wanting to convert a file from UTF16 into UTF8. I believe I've identified the tools to do it and all but installed them, apart from Unicode::Map8 (v0.10). Can anyone help me with the build errors (below) or advise me of alt