ID: 47984 Updated by: w...@php.net Reported By: js at mcs dot be -Status: Assigned +Status: Open Bug Type: COM related Operating System: Windows XP PHP Version: 5.2.9 Assigned To: wez
Previous Comments: ------------------------------------------------------------------------ [2009-04-16 09:22:47] js at mcs dot be Description: ------------ When using the attached script (saved as UTF-8 without BOM) to generate a word document, at the end of the text typed programmatically into word a "ÿ" character gets added. This character is only visible if I press the inverted "P" button on the toolbar. This is a reduced test case, but in real-world scenarios, visibly garbled text is inserted into word. Reproduce code: --------------- <?php unlink(dirname(__FILE__)."\\test.doc"); $word = new COM("word.application", NULL, CP_UTF8); $word->Visible = false; // hidden $word->Documents->Add(); $word->Selection->TypeText("This is a tést..."); $word->Documents[1]->SaveAs(dirname(__FILE__)."\\test.doc"); $word->Quit(); $word = null; header("Content-type: text/plain"); echo "wrote file to ".dirname(__FILE__)."\\test.doc"; ?> Expected result: ---------------- The COM automation should accept UTF-8 formatted PHP strings without malforming the string ending when the CP_UTF8 parameter is added. Actual result: -------------- Broken characters at the end of every string containing non-ASCII characters sent via COM automation. ------------------------------------------------------------------------ -- Edit this bug report at http://bugs.php.net/?id=47984&edit=1