Orange is my favorite color

MotorsportReg.com retrieves feeds of events from various sources, typically in CSV. These are usually straight dumps from a database and we have no control over the output. I received a bug the other day that our RSS feed wasn’t validating as UTF-8 and this was breaking many feed readers.

I checked it out and narrowed it down to this:

2006 SCCA National Championship Runoffs�

What you should see is the word “Runoffs” followed by either a question mark or a black diamond with a question mark in it. Either way, character #65533 seems to be the Unicode version of “What the…?” In this case, it seems to be a (TM) or (R) symbol from another character set.

In ColdFusion, you can safely strip these characters since they won’t display and they will break your RSS feed:

replaceNoCase(text, chr(65533), " ", "ALL")

1 Comment

  1. mmm said:

    on May 21, 2007 at 3:07 am

    coldFusion? I have this problem with java apache httpClient, whats up ?

{ RSS feed for comments on this post}