What chr(int).chr(int) decompositions do these 4 letters have? Ș, ș, Ț, ț
I really need to get WordPress to "sanitize" them and replace them with S, s, T, t when it creates post slugs.
I know the code for this is in wp-includes/formatting.php and it's the "remove_accents" function, but I can't for the life of me figure how these letters decompose into chr(int).chr(int), and I would really love to find out since I'd like to have these diacritics included in that 开发者_如何学运维list.
I try to make it my business NOT to know or to care, and certainly not keep a table in my own code :)
echo iconv('utf-8','ascii//translit','Ș, ș, Ț, ț');
//Output: S, s, T, t
Assuming you are using utf-8:
Ș -> \xc8\x98 -> 200,152
ș -> \xc8\x99 -> 200,153
Ț -> \xc8\x9a -> 200,154
ț -> \xc8\x9b -> 200,155
精彩评论