Re; http://www.nelsonmullins.com/rss/rss_newsletters.cfm

Finally found a function that seems to clean most of the MS Word control 
characters and other crap out that was causing me probems. Using two 
filters on the body text seems to be taking care of my problems now..


<cfscript>
function ReplaceMicrosoftChars(arg_str) {
return ReplaceList(arg_str, 
"#Chr(19)#,#Chr(20)#,#Chr(25)#,#chr(8216)#,#chr(8217)#,#Chr(8211)#,#Chr(8212)#,#Chr(145)#,#Chr(146)#,#Chr(147)#,#chr(8220)#,#chr(8221)#,#Chr(148)#,#Chr(29)#,#Chr(28)#,#Chr(150)#,#Chr(151)#,#Chr(8230)#",
 
"--,--,',',',--,--,',',"","","","","","",-,-,...");
}
</cfscript>


<!--- CLEAN HTML --->
<cfset request.bodynohtml = 
"#rereplacenocase(stories.body,"<[^>]*>","","all")#" >

<!--- CLEAN WORD --->
<cfset request.msclean="#ReplaceMicrosoftChars(request.bodynohtml)#">




Feed seems to be working now, until the client finds something else to 
throw in there that the above doesn't cover!!!

Of course, a better way to do this would be to create valid XML text 
right from the start, but I've got hundreds of records of legacy data to 
deal with.



~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
ColdFusion MX7 and Flex 2 
Build sales & marketing dashboard RIA’s for your business. Upgrade now
http://www.adobe.com/products/coldfusion/flex2?sdid=RVJT

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:277074
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4

Reply via email to