[wp-trac] [WordPress Trac] #2743: RSS feeds not cleaning html entities properly

WordPress Trac wp-trac at lists.automattic.com
Thu May 25 14:48:15 GMT 2006


#2743: RSS feeds not cleaning html entities properly
-----------------------+----------------------------------------------------
       Id:  2743       |      Status:  new                     
Component:  General    |    Modified:  Thu May 25 14:48:15 2006
 Severity:  blocker    |   Milestone:  2.0.3                   
 Priority:  normal     |     Version:  2.0.2                   
    Owner:  anonymous  |    Reporter:  VxJasonxV               
-----------------------+----------------------------------------------------
 Example Feed:[[BR]]
 http://el-tramo.be/feed/ [[BR]]

 Error:[[BR]]
 XML Parsing Error: undefined entity[[BR]]
 Location: http://el-tramo.be/feed/ [[BR]]
 Line Number 12, Column 25:      <description>Remko Tron&ccedil;on's
 Homepage</description>

 The owner of this blog has put the ç ( &ccedil; ) character into his blog
 literally, and wordpress is not cleaning it properly.[[BR]]
 I see that in functions-formatting.php line 795:[[BR]]
 '&Ccedil;' => '&#199;',[[BR]]

 However, this function (ent2ncr) is not getting called during feed
 generation.[[BR]]
 Unfortunately, this function may not be able to be adapted to simple parse
 RSS feeds, and allowed RSS named entities are:[[BR]]
 &lt; for <,[[BR]]
 &amp; for &,[[BR]]
 &gt; for >,[[BR]]
 ' for &apos;,[[BR]]
 and &quot; for ".[[BR]]

 However, all of those entities are being transformed into their numerical
 equivalent ( '&quot;' => '&#34;', '&amp;' => '&#38;', etc. ) with this
 function as well.[[BR]]

-- 
Ticket URL: <http://trac.wordpress.org/ticket/2743>
WordPress Trac <http://wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list