Emacs 23 uses character set four times larger than Unicode - why?

2022-12-10 01:46 问答作者：

From Emacs 23.1 NEWS:

*** The Emacs character set is now a superset of Unicode. (It has about four times the code space, which should be plenty).

And more details later on:

*** In multibyte buffers and strings, characters are represented by UTF-8 byte sequences. The character code space is now 0x0..0x3FFFFF with no gap; code points 0x0..0x10FFFF are Unicode characters of the same code points, while code points 0x3FFF80..0x3FFFFF are raw 8-bit bytes.

According to Wikipedia, the BMP of the UCS has 65536 characters, the latest version of Unicode contains more than 107000 characters, and the UCS has more than one million code points. 0x3FFFFF is more than four millions.

What problems could be solved or how otherwise it is beneficial to have internal character set that is a开发者_如何学Go superset of Unicode?

Unicode is designed to encompass the required character sets for all human languages, which is certainly useful for globalisation/localisation of your code, but because Emacs is the tool of the gods themselves, it has to also encompass every character that may be used by deities of all kinds ( including but not limited to the eldritch runes of the Great Old Ones), spacefaring races ( including but not limited to our future alien overlords ), ultra-intelligent-machine-intelligences ( including but not limited to our future robot masters ) and every other being that desires infinite cosmic power. That is potentially a whole lot of characters!

Or it could be to do with UTF-8 being a way of encoding characters that has much more space than is taken up by the Unicode set and Emacs just supporting the whole of UTF-8, but I prefer my explanation above.

继续阅读：emacs emacs23 unicode

Emacs 23 uses character set four times larger than Unicode - why?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？