How many non-printing characters are in common use?

2022-12-09 17:32 问答作者：

When writing interpreters for PDF, HTML and other documents we need to deal with a variety of white-space characters and additi开发者_JAVA百科onal non-printing characters. The ANSI ones are well defined, but how many others are likely to be found in practice? A typical example is the cluster in ISO10646 (I think):

&ensp;      &#8194;             en space
&emsp;  &#8195;             em space
&thinsp;    &#8201;             thin space
&zwnj;  &#8204;     ‌   ‌   zero width non-joiner
&zwj;   &#8205;     ‍   ‍   zero width joiner
&lrm;   &#8206;     ‎   ‎   left-to-right mark
&rlm;   &#8207;     ‏   ‏   right-to-left mark

(For obvious reasons the characters do not appear above!).

Unicode will be with us, in increasing quantity, for a long time. If an HTML or XML document is written in UTF-8 encoded Unicode, then you should expect any and all of these to appear.

In Unicode (Unicode Character Database) the following codepoints are defined as whitespace:

U+0009–U+000D (control characters, containing Tab, CR and LF)
U+0020 SPACE
U+0085 NEL (control character next line)
U+00A0 NBSP (NO-BREAK SPACE)
U+1680 OGHAM SPACE MARK
U+180E MONGOLIAN VOWEL SEPARATOR
U+2000–U+200A (different sorts of spaces)
U+2028 LS (LINE SEPARATOR)
U+2029 PS (PARAGRAPH SEPARATOR)
U+202F NNBSP (NARROW NO-BREAK SPACE)
U+205F MMSP (MEDIUM MATHEMATICAL SPACE)
U+3000 IDEOGRAPHIC SPACE

In development world there's at least one more (most often used in web development)

&nbsp;  // non-breaking space

But the more you get to design world the more you see various space/invisible characters. Publishing software normally has

space - the regular SPACE
en space
em space
thin space
hair space
non-breaking space
non-breaking fixed width space
sixth space
quarter space
third space
punctuation space
flush space
figure space
...

继续阅读：pdf whitespace

How many non-printing characters are in common use?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？