#31327 [Asn]: chinese char and word problem
ID: 31327 User updated by: vtsuper1 at mail dot hongkong dot com Reported By: vtsuper1 at mail dot hongkong dot com Status: Assigned Bug Type: COM related Operating System: win32 PHP Version: 5CVS-2005-08-30 Assigned To: wez New Comment: I have see the lastest version of php 5.0.5 and 5.1 but the change log haven't mention about this bug has been fixed. So would you tell me the developer doesn't think that this is a bug so they will not fix it or they are try to fixing it but didn't finish yet? Previous Comments: [2005-08-16 04:53:35] vtsuper1 at mail dot hongkong dot com After yours information, I have try to modified my code and make it support the correct code_page. But finially the result is exactly the same with my first script. There are some meaningless square appears after my Chinese word. (Actually I'm quite confused that why only Word has this problem but excel didn't?) here are some link related to this problem: http://msdn.microsoft.com/library/default.asp?url=/library/en-us/act/htm/actml_ref_scpg.asp http://bugs.php.net/bug.php?id=31327 http://bugs.php.net/bug.php?id=28117 http://hk.php.net/manual/en/class.com.php here are my new scripts: ?php $code_page=array('950','0','1','2','3','65001'); for ($i=0;$icount($code_page);$i++){ // starting word $word = new COM(word.application,NULL,$code_page[$i]) or die(Unable to instantiate Word); echo Loaded Word, version {$word-Version}\n; //bring it to front $word-Visible = 1; //open an empty document $word-Documents-Add(); //do some weird stuff $word-Selection-TypeText(ÄãºÃá); $word-Documents[1]-SaveAs(D:\AppServ\www\word\\.$code_page[$i]..doc); //closing word $word-Quit(); //free the object $word = null; } ? [2005-08-11 16:11:55] [EMAIL PROTECTED] Make sure you're setting up the code page properly. From the manual: com.code_page It controls the default character set code-page to use when passing strings to and from COM objects. If set to an empty string, PHP will assume that you want CP_ACP, which is the default system ANSI code page. If the text in your scripts is encoded using a different encoding/character set by default, setting this directive will save you from having to pass the code page as a parameter to the COM class constructor. Please note that by using this directive (as with any PHP configuration directive), your PHP script becomes less portable; you should use the COM constructor parameter whenever possible. Consult MSDN for more information on code pages. [2005-08-08 11:21:52] bmcrowley at lcwarriormail dot com I am experiencing the same bug with Arabic - I pull a UTF-8 character from a database, and push it into Word with UTF-8 for the characterset. I get meaningless characters (the box) appearing after (to the right of) the arabic text. Also, the arabic text gets mangled - words out of order... I would try to post the input/output here, but it would be meaningless... [2005-08-03 07:51:25] [EMAIL PROTECTED] Assigned to the maintainer of COM extension. [2005-08-03 05:17:34] vtsuper1 at mail dot hongkong dot com Dear sir, Thanks for your help. I have test it in a php 5.1.X version. But the bug doesn't fixed. The meaningless space change to a square characters only... Anyway, thanks for your help. (This problem hasn't exist in Excel, it only happened in Word only) From Victor The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at http://bugs.php.net/31327 -- Edit this bug report at http://bugs.php.net/?id=31327edit=1
#31327 [Fbk-Opn]: chinese char and word problem
ID: 31327 User updated by: vtsuper1 at mail dot hongkong dot com Reported By: vtsuper1 at mail dot hongkong dot com -Status: Feedback +Status: Open Bug Type: COM related Operating System: windowXP PHP Version: 5.1 Assigned To: wez New Comment: After yours information, I have try to modified my code and make it support the correct code_page. But finially the result is exactly the same with my first script. There are some meaningless square appears after my Chinese word. (Actually I'm quite confused that why only Word has this problem but excel didn't?) here are some link related to this problem: http://msdn.microsoft.com/library/default.asp?url=/library/en-us/act/htm/actml_ref_scpg.asp http://bugs.php.net/bug.php?id=31327 http://bugs.php.net/bug.php?id=28117 http://hk.php.net/manual/en/class.com.php here are my new scripts: ?php $code_page=array('950','0','1','2','3','65001'); for ($i=0;$icount($code_page);$i++){ // starting word $word = new COM(word.application,NULL,$code_page[$i]) or die(Unable to instantiate Word); echo Loaded Word, version {$word-Version}\n; //bring it to front $word-Visible = 1; //open an empty document $word-Documents-Add(); //do some weird stuff $word-Selection-TypeText(ÄãºÃá); $word-Documents[1]-SaveAs(D:\AppServ\www\word\\.$code_page[$i]..doc); //closing word $word-Quit(); //free the object $word = null; } ? Previous Comments: [2005-08-11 16:11:55] [EMAIL PROTECTED] Make sure you're setting up the code page properly. From the manual: com.code_page It controls the default character set code-page to use when passing strings to and from COM objects. If set to an empty string, PHP will assume that you want CP_ACP, which is the default system ANSI code page. If the text in your scripts is encoded using a different encoding/character set by default, setting this directive will save you from having to pass the code page as a parameter to the COM class constructor. Please note that by using this directive (as with any PHP configuration directive), your PHP script becomes less portable; you should use the COM constructor parameter whenever possible. Consult MSDN for more information on code pages. [2005-08-08 11:21:52] bmcrowley at lcwarriormail dot com I am experiencing the same bug with Arabic - I pull a UTF-8 character from a database, and push it into Word with UTF-8 for the characterset. I get meaningless characters (the box) appearing after (to the right of) the arabic text. Also, the arabic text gets mangled - words out of order... I would try to post the input/output here, but it would be meaningless... [2005-08-03 07:51:25] [EMAIL PROTECTED] Assigned to the maintainer of COM extension. [2005-08-03 05:17:34] vtsuper1 at mail dot hongkong dot com Dear sir, Thanks for your help. I have test it in a php 5.1.X version. But the bug doesn't fixed. The meaningless space change to a square characters only... Anyway, thanks for your help. (This problem hasn't exist in Excel, it only happened in Word only) From Victor [2005-08-02 23:49:10] [EMAIL PROTECTED] Please try using this CVS snapshot: http://snaps.php.net/php5-latest.tar.gz For Windows: http://snaps.php.net/win32/php5-win32-latest.zip The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at http://bugs.php.net/31327 -- Edit this bug report at http://bugs.php.net/?id=31327edit=1
#31327 [NoF-Opn]: chinese char and word problem
ID: 31327 User updated by: vtsuper1 at mail dot hongkong dot com Reported By: vtsuper1 at mail dot hongkong dot com -Status: No Feedback +Status: Open Bug Type: COM related Operating System: windowXP -PHP Version: 5.0.3 +PHP Version: 5.0.4 New Comment: I'm sorry, becuase I haven'r report to this bug for a long time. I have try the same code in php 5.0.4, but it having the same problem. In honest I don't know what is CVSso...I don;t know what can I do Previous Comments: [2005-03-08 01:00:21] php-bugs at lists dot php dot net No feedback was provided for this bug for over a week, so it is being suspended automatically. If you are able to provide the information that was originally requested, please do so and change the status of the bug back to Open. [2005-02-28 21:06:01] [EMAIL PROTECTED] Please try using this CVS snapshot: http://snaps.php.net/php5-latest.tar.gz For Windows: http://snaps.php.net/win32/php5-win32-latest.zip [2004-12-28 20:46:49] vtsuper1 at mail dot hongkong dot com Description: I have try this code in php 5.0.3 and 4.3 I'm using Chinese winXP and Chinese Office XP apache 2 both version of php also have the same problem When I use Chinese character in the string, the output of the word file will have some meaningless spaces after the words I expected(my expected string is ok, no problem). There are no problem if I using English. And this problem will not occurs in excel. Reproduce code: --- ?php // starting word $word = new COM(word.application) or die(Unable to instantiate Word); echo Loaded Word, version {$word-Version}\n; //bring it to front $word-Visible = 1; //open an empty document $word-Documents-Add(); //do some weird stuff $word-Selection-TypeText(§A¦n¶Ü); $word-Documents[1]-SaveAs(Useless test.doc); //closing word $word-Quit(); //free the object $word = null; ? Expected result: §A¦n¶Ü Actual result: -- §A¦n¶Ü (there are some spaces occurs, I use the underline to repersent the spaces) -- Edit this bug report at http://bugs.php.net/?id=31327edit=1
#31327 [Fbk-Opn]: chinese char and word problem
ID: 31327 User updated by: vtsuper1 at mail dot hongkong dot com Reported By: vtsuper1 at mail dot hongkong dot com -Status: Feedback +Status: Open Bug Type: COM related Operating System: windowXP -PHP Version: 5.0.4 +PHP Version: 5.1 New Comment: Dear sir, Thanks for your help. I have test it in a php 5.1.X version. But the bug doesn't fixed. The meaningless space change to a square characters only... Anyway, thanks for your help. (This problem hasn't exist in Excel, it only happened in Word only) From Victor Previous Comments: [2005-08-02 23:49:10] [EMAIL PROTECTED] Please try using this CVS snapshot: http://snaps.php.net/php5-latest.tar.gz For Windows: http://snaps.php.net/win32/php5-win32-latest.zip [2004-12-28 20:46:49] vtsuper1 at mail dot hongkong dot com Description: I have try this code in php 5.0.3 and 4.3 I'm using Chinese winXP and Chinese Office XP apache 2 both version of php also have the same problem When I use Chinese character in the string, the output of the word file will have some meaningless spaces after the words I expected(my expected string is ok, no problem). There are no problem if I using English. And this problem will not occurs in excel. Reproduce code: --- ?php // starting word $word = new COM(word.application) or die(Unable to instantiate Word); echo Loaded Word, version {$word-Version}\n; //bring it to front $word-Visible = 1; //open an empty document $word-Documents-Add(); //do some weird stuff $word-Selection-TypeText(§A¦n¶Ü); $word-Documents[1]-SaveAs(Useless test.doc); //closing word $word-Quit(); //free the object $word = null; ? Expected result: §A¦n¶Ü Actual result: -- §A¦n¶Ü (there are some spaces occurs, I use the underline to repersent the spaces) -- Edit this bug report at http://bugs.php.net/?id=31327edit=1