ID: 47984
Updated by: [email protected]
Reported By: js at mcs dot be
-Status: Open
+Status: Assigned
Bug Type: COM related
Operating System: Windows XP
PHP Version: 5.2.9
-Assigned To:
+Assigned To: wez
Previous Comments:
------------------------------------------------------------------------
[2009-04-16 09:22:47] js at mcs dot be
Description:
------------
When using the attached script (saved as UTF-8 without BOM) to generate
a word document, at the end of the text typed programmatically into word
a "ÿ" character gets added. This character is only visible if I press
the inverted "P" button on the toolbar. This is a reduced test case, but
in real-world scenarios, visibly garbled text is inserted into word.
Reproduce code:
---------------
<?php
unlink(dirname(__FILE__)."\\test.doc");
$word = new COM("word.application", NULL, CP_UTF8);
$word->Visible = false; // hidden
$word->Documents->Add();
$word->Selection->TypeText("This is a tést...");
$word->Documents[1]->SaveAs(dirname(__FILE__)."\\test.doc");
$word->Quit();
$word = null;
header("Content-type: text/plain");
echo "wrote file to ".dirname(__FILE__)."\\test.doc";
?>
Expected result:
----------------
The COM automation should accept UTF-8 formatted PHP strings without
malforming the string ending when the CP_UTF8 parameter is added.
Actual result:
--------------
Broken characters at the end of every string containing non-ASCII
characters sent via COM automation.
------------------------------------------------------------------------
--
Edit this bug report at http://bugs.php.net/?id=47984&edit=1