[wp-trac] [WordPress Trac] #63863: Standardize UTF-8 handling and fallbacks in 6.9
WordPress Trac
noreply at wordpress.org
Tue Oct 21 02:34:11 UTC 2025
#63863: Standardize UTF-8 handling and fallbacks in 6.9
--------------------------------------+-----------------------
Reporter: dmsnell | Owner: dmsnell
Type: enhancement | Status: assigned
Priority: normal | Milestone: 6.9
Component: Charset | Version: trunk
Severity: normal | Resolution:
Keywords: has-patch has-unit-tests | Focuses:
--------------------------------------+-----------------------
Comment (by dmsnell):
In [changeset:"61000" 61000]:
{{{
#!CommitTicketReference repository="" revision="61000"
Charset: `wp_has_noncharacters()` for more-specific Unicode handling.
Noncharacters are code points that are permanently reserved in the Unicode
Standard for internal use. They are not recommended for use in open
interchange of Unicode text data. However, they are valid code points and
will not cause a string to return as invalid.
Still, HTML and XML both impose semantic rules on their use and it may be
important for code to know whether they are present in a string. This
patch introduces a new function, `wp_has_noncharacters()`, which answers
this question.
See https://www.unicode.org/versions/Unicode17.0.0/core-
spec/chapter-23/#G12612
Developed in https://github.com/WordPress/wordpress-develop/pull/9827
Discussed in https://core.trac.wordpress.org/ticket/63863
See #63863.
}}}
--
Ticket URL: <https://core.trac.wordpress.org/ticket/63863#comment:44>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform
More information about the wp-trac
mailing list