[qooxdoo-devel] Unicode characters and qx.lang.String.trim

John Spackman Fri, 02 Mar 2012 07:39:08 -0800

Hi

I came across an odd bug today  after getting some text from DOM nodes and
then passing it to qx.lang.String.trim(), what I thought was an empty string
had a length of 1.  It turned out to contain unicode character u+ffef, also
known as "Zero Width No-Break Space" - I found an article [1] that goes some
way to explaining it and also mentions other unicode space characters
u+FEFF, u+200B, u+200D, and u+2060.


At the moment qx.lang.String.trim, trimLeft, trimRight, and clean only
remove characters that match the \s regex, but should this be expanded to
include other Unicode whitespace characters?  Would it be detrimental to Qx
users if it was changed?

[1] http://blogs.msdn.com/b/michkap/archive/2005/01/20/357028.aspx

John

------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/

_______________________________________________
qooxdoo-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/qooxdoo-devel

[qooxdoo-devel] Unicode characters and qx.lang.String.trim

Reply via email to