ID:               47984
 Updated by:       w...@php.net
 Reported By:      js at mcs dot be
-Status:           Assigned
+Status:           Open
 Bug Type:         COM related
 Operating System: Windows XP
 PHP Version:      5.2.9
 Assigned To:      wez


Previous Comments:
------------------------------------------------------------------------

[2009-04-16 09:22:47] js at mcs dot be

Description:
------------
When using the attached script (saved as UTF-8 without BOM) to generate
a word document, at the end of the text typed programmatically into word
a "ÿ" character gets added. This character is only visible if I press
the inverted "P" button on the toolbar. This is a reduced test case, but
in real-world scenarios, visibly garbled text is inserted into word.

Reproduce code:
---------------
<?php
   unlink(dirname(__FILE__)."\\test.doc");
   $word = new COM("word.application", NULL, CP_UTF8);
   $word->Visible = false; // hidden
   $word->Documents->Add();
   $word->Selection->TypeText("This is a tést...");
   $word->Documents[1]->SaveAs(dirname(__FILE__)."\\test.doc");
   $word->Quit();
   $word = null;
   header("Content-type: text/plain");
   echo "wrote file to ".dirname(__FILE__)."\\test.doc";
?>

Expected result:
----------------
The COM automation should accept UTF-8 formatted PHP strings without
malforming the string ending when the CP_UTF8 parameter is added.

Actual result:
--------------
Broken characters at the end of every string containing non-ASCII
characters sent via COM automation.


------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=47984&edit=1

Reply via email to