[wp-trac] [WordPress Trac] #63863: Standardize UTF-8 handling and fallbacks in 6.9

WordPress Trac noreply at wordpress.org
Tue Oct 21 02:34:11 UTC 2025


#63863: Standardize UTF-8 handling and fallbacks in 6.9
--------------------------------------+-----------------------
 Reporter:  dmsnell                   |       Owner:  dmsnell
     Type:  enhancement               |      Status:  assigned
 Priority:  normal                    |   Milestone:  6.9
Component:  Charset                   |     Version:  trunk
 Severity:  normal                    |  Resolution:
 Keywords:  has-patch has-unit-tests  |     Focuses:
--------------------------------------+-----------------------

Comment (by dmsnell):

 In [changeset:"61000" 61000]:
 {{{
 #!CommitTicketReference repository="" revision="61000"
 Charset: `wp_has_noncharacters()` for more-specific Unicode handling.

 Noncharacters are code points that are permanently reserved in the Unicode
 Standard for internal use. They are not recommended for use in open
 interchange of Unicode text data. However, they are valid code points and
 will not cause a string to return as invalid.

 Still, HTML and XML both impose semantic rules on their use and it may be
 important for code to know whether they are present in a string. This
 patch introduces a new function, `wp_has_noncharacters()`, which answers
 this question.

 See https://www.unicode.org/versions/Unicode17.0.0/core-
 spec/chapter-23/#G12612

 Developed in https://github.com/WordPress/wordpress-develop/pull/9827
 Discussed in https://core.trac.wordpress.org/ticket/63863

 See #63863.
 }}}

-- 
Ticket URL: <https://core.trac.wordpress.org/ticket/63863#comment:44>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list