[wp-xmlrpc] WP doesn't identify xml document text encoding
Daniel Jalkut
jalkut at red-sweater.com
Tue Aug 21 21:26:02 GMT 2007
I just reported an issue against WP's xml documents:
http://trac.wordpress.org/ticket/4794
The issue is that a non-UTF8 blog will vend xml documents with no
implied encoding, but which contain for instance ISO-8859-1 encoded
characters. This leaves client XML parsers in a tough situation,
because some common accented characters are legal in ISO-8859-1 but
are outside the legal range for UTF-8.
I bring this up on the mailing lost both to help Joseph get more
opinions about whether it's a good idea to advertise the encoding on
the XML document, and to bring the attention to other client
developers, in case their code would be affected by the issue.
The symptom "in the wild" is that parsing XML-RPC responses for some
blogs will fail if your parser assumes the content is UTF8 and comes
upon one of these accented characters.
Daniel
More information about the wp-xmlrpc
mailing list