[wp-xmlrpc] WP doesn't identify xml document text encoding

Daniel Jalkut jalkut at red-sweater.com
Tue Aug 21 21:26:02 GMT 2007


I just reported an issue against WP's xml documents:

http://trac.wordpress.org/ticket/4794

The issue is that a non-UTF8 blog will vend xml documents with no  
implied encoding, but which contain for instance ISO-8859-1 encoded  
characters. This leaves client XML parsers in a tough situation,  
because some common accented characters are legal in ISO-8859-1 but  
are outside the legal range for UTF-8.

I bring this up on the mailing lost both to help Joseph get more  
opinions about whether it's a good idea to advertise the encoding on  
the XML document, and to bring the attention to other client  
developers, in case their code would be affected by the issue.

The symptom "in the wild" is that parsing XML-RPC responses for some  
blogs will fail if your parser assumes the content is UTF8 and comes  
upon one of these accented characters.

Daniel


More information about the wp-xmlrpc mailing list