Re: Need RegEx help for unknown/odd characters.
How about deAccent() ? http://cflib.org/udf/deAccent nathan strutz [www.dopefly.com] [hi.im/nathanstrutz] On Wed, May 15, 2013 at 9:20 AM, Che Vilnonis ch...@asitv.com wrote: Good morning. Occasionally when parsing a RSS feed, I get RSS data I cannot parse. Click for screenshot below (text highlighted in green). In this example, Montreal should return Montréal but does not. What regex could I use to remove or replace such odd characters? http://www.asitv.com/images/_funkychar.jpg Thanks, Che ~| Order the Adobe Coldfusion Anthology now! http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion Archive: http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355757 Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm
RE: Need RegEx help for unknown/odd characters.
Nathan, the problem is that within the feed itself, the characters don't have accents. When output to a browser, they funky chars display on screen with odd geometric shapes like an upright rectangle. They're like odd ascii chars that I have only seen a few time before. In the end, I can't insert them into the MySQL db. Ché -Original Message- How about deAccent() ? http://cflib.org/udf/deAccent On Wed, May 15, 2013 at 9:20 AM, Che Vilnonis ch...@asitv.com wrote: Good morning. Occasionally when parsing a RSS feed, I get RSS data I cannot parse. Click for screenshot below (text highlighted in green). In this example, Montreal should return Montréal but does not. What regex could I use to remove or replace such odd characters? http://www.asitv.com/images/_funkychar.jpg ~| Order the Adobe Coldfusion Anthology now! http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion Archive: http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355762 Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm
Re: Need RegEx help for unknown/odd characters.
How about making sure the charset is set properly. I have seen this kind of thing when not setting to UTF-8. This can be a problem either on the ingesting, or when the feed is written. On Fri, May 17, 2013 at 2:25 PM, Che Vilnonis ch...@asitv.com wrote: Nathan, the problem is that within the feed itself, the characters don't have accents. When output to a browser, they funky chars display on screen with odd geometric shapes like an upright rectangle. They're like odd ascii chars that I have only seen a few time before. In the end, I can't insert them into the MySQL db. Ché -Original Message- How about deAccent() ? http://cflib.org/udf/deAccent On Wed, May 15, 2013 at 9:20 AM, Che Vilnonis ch...@asitv.com wrote: Good morning. Occasionally when parsing a RSS feed, I get RSS data I cannot parse. Click for screenshot below (text highlighted in green). In this example, Montreal should return Montréal but does not. What regex could I use to remove or replace such odd characters? http://www.asitv.com/images/_funkychar.jpg ~| Order the Adobe Coldfusion Anthology now! http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion Archive: http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355763 Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm